Research Engineer - Scalable Interpretability

Transluce

San FranciscoOn-site1w ago

Apply

Employment: Full-time

About the role

Creating diverse evaluations that range in difficulty. This involves finding naturally occurring interesting and undesirable behaviors exhibited by open-source models.
Developing novel architectures and objectives for training interpretability assistants.
Scaling up the training and inference pipelines to support up to 1T-scale models.

Experience with fine-tuning language models, designing new architectures, and creating evaluations.
Reliable results: good experimental design, epistemic self-awareness and transparency
Generativeness: coming up with original, productive ideas for unblocking progress
Curiosity: a desire to understand ML systems and how they work
Strong programming ability, including navigating trade-offs between prototyping speed and maintainability
Strong communication skills, low ego, openness to giving and receiving feedback

764,000+ hidden jobs like this

Transluce and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

Unlimited applications — free stops at 5
Track every application in one place
Apply straight to the source, one click
Save & organize roles you love
Roles pulled from company boards before the big sites

Weekly

$9.99

$4.99/week

For an active search. Cancel anytime.

Get Weekly

Monthly

$24.99

$12.99/month

The smart pick. Save 35% vs weekly.

Get Monthly

Lifetime

$99

$49.99once

Pay once. Every future feature, forever.

Get Lifetime