Back to all jobs
M

Research Engineer

Metis, Inc.

San Francisco8mo ago

About the role

<h2><strong>About the Role</strong></h2> <p data-start="342" data-end="724">As a Research Engineer at Metis, you’ll work on building the next generation of autonomous post-training systems that leverage our Mantis platform. You’ll operate at the intersection of cutting-edge ML research and scalable engineering, designing, implementing, and deploying algorithms that improve how AI agents learn from feedback, synthetic data, and real-world interactions.</p> <p data-start="342" data-end="724">You’ll move seamlessly between papers and production, leading large-scale experiments, creating optimized training pipelines, and helping shape the future of post-training autonomy. You’ll have significant ownership, high compute budgets, and the mandate to push the state of the art in applied reinforcement and preference optimization.</p> <h2><strong>What You'll Do</strong></h2> <ul> <li data-stringify-indent="0" data-stringify-border="0">Research and help build an autonomous post-training agent leveraging the Mantis platform</li> <li data-stringify-indent="0" data-stringify-border="0">Design and execute large-scale experiments on synthetic data generation and algorithmic architecture</li> <li data-stringify-indent="0" data-stringify-border="0">Develop and refine methods for reinforcement learning, reward modeling, and human feedback integration</li> <li data-stringify-indent="0" data-stringify-border="0">Collaborate cross-functionally with Core and Platform Engineering to deploy and evaluate models in production settings</li> <li data-stringify-indent="0" data-stringify-border="0">Publish or contribute to leading-edge research in the post-training domain</li> <li data-stringify-indent="0" data-stringify-border="0">Use tooling and compute efficiently to iterate on experimental pipelines and accelerate research velocity</li> </ul> <h2><strong>Requirements</strong></h2> <ul> <li data-stringify-indent="0" data-stringify-border="0">Deep experience in machine learning, preferably reinforcement learning, post-training, or alignment research</li> <li data-stringify-indent="0" data-stringify-border="0">Demonstrated research contributions; ideally published papers (ICML, NeurIPS) or public implementations</li> <li data-stringify-indent="0" data-stringify-border="0">Strong proficiency in Python and ML frameworks (PyTorch, JAX, or TensorFlow)</li> <li data-stringify-indent="0" data-stringify-border="0">Comfort with distributed training, high-throughput data pipelines, and large-scale experiment management</li> <li data-stringify-indent="0" data-stringify-border="0">Ability to reason independently, formulate hypotheses, and run experiments from idea → insight → product impact</li> </ul> <h2><strong>Compensation &amp; Benefits</strong></h2> <ul> <li>Base: $200,000–$1,000,000</li> <li>Significant Equity</li> <li>Full medical, dental, and vision</li> <li>Wellness &amp; L&amp;D stipend</li> <li>Equinox membership</li> <li>Breakfast, lunch, and dinner provided (Unlimited Doordash)</li> <li>$25,000 housing stipend</li> </ul> <h2><strong>About Metis</strong></h2> <p>Metis helps enterprises and labs build the most reliable AI agents by leveraging post-training. Our platform enables the creation, improvement, and deployment of the most capable frontier agents designed for rigorous, real-world workflows.</p> <div class="p-rich_text_section"><strong data-stringify-type="bold">Momentum</strong></div> <ul class="p-rich_text_list p-rich_text_list__bullet p-rich_text_list--nested" data-stringify-type="unordered-list" data-list-tree="true" data-indent="0" data-border="0"> <li data-stringify-indent="0" data-stringify-border="0">0 → six-figure monthly revenue in the last six weeks</li> <li data-stringify-indent="0" data-stringify-border="0">Working with several Fortune 500 enterprises &amp; frontier AI labs</li> <li data-stringify-indent="0" data-stringify-border="0">Growing 150%+ MoM</li> </ul> <div class="p-rich_text_section"><strong data-stringify-type="bold">Backed by</strong></div> <div class="p-rich_text_section">&nbsp;</div> <div class="p-rich_text_section">Y Combinator, CRV, and executives from OpenAI, Google, Mercor, NVIDIA, and others.</div>

Perks & benefits

  • Equity Compensation

731,000+ hidden jobs like this

Metis, Inc. and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.