Back to all jobs
T

Research Engineer - Scalable Interpretability

Transluce

San FranciscoOn-site1w ago
Employment
Full-time

About the role

  • Creating diverse evaluations that range in difficulty. This involves finding naturally occurring interesting and undesirable behaviors exhibited by open-source models.
  • Developing novel architectures and objectives for training interpretability assistants.
  • Scaling up the training and inference pipelines to support up to 1T-scale models.
  • Experience with fine-tuning language models, designing new architectures, and creating evaluations.
  • Reliable results: good experimental design, epistemic self-awareness and transparency
  • Generativeness: coming up with original, productive ideas for unblocking progress
  • Curiosity: a desire to understand ML systems and how they work
  • Strong programming ability, including navigating trade-offs between prototyping speed and maintainability
  • Strong communication skills, low ego, openness to giving and receiving feedback

764,000+ hidden jobs like this

Transluce and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.