Back to all jobs

- Seniority
- Staff
About the role
<div class="content-intro"><h2><strong>About Inflection AI</strong></h2>
<p>Inflection AI is a Public Benefit Corporation empowering people with human-centered, emotionally intelligent AI. We’re shaping the future of AI by combining emotional intelligence (EQ) and raw intelligence (IQ) to elevate people’s potential.<br><br>Inflection AI created Pi, the world’s first emotionally intelligent AI, to help people work through decisions, emotions, and challenges. Pi is a personal AI agent powered by Inflection AI’s foundation model, proving that AI can be personal, empathetic, and contextually aware.</p></div><h2>About the Role</h2>
<p>As a Model Training engineer, you will design, build, and scale the post-training pipelines that turn a general LLM into a brand-fluent, production-ready assistant. Your innovations in fine-tuning and preference optimization (RLHF, DPO, GRPO, RLAIF) will directly improve reliability, alignment, and cost.</p>
<p><strong>This is a good role for you if you:</strong></p>
<ul>
<li>Have hands-on experience training and fine-tuning large transformer models on multi-GPU / multi-node clusters.</li>
<li>Are fluent in PyTorch and its ecosystem tools (Torchtune, FSDP, DeepSpeed) and enjoy digging into distributed-training internals, mixed precision, and memory-efficiency tricks.</li>
<li>Have shipped or published work in RLHF, DPO, GRPO, or RLAIF and understand their practical trade-offs.</li>
<li>Care deeply about training tools, pipelines, and reproducibility—you automate the boring parts so you can iterate on the fun parts.</li>
<li>Balance research curiosity with product pragmatism—you know when to run an ablation and when to ship.</li>
<li>Communicate crisply with both technical and non-technical teammates.</li>
<li>Have a bachelor’s degree or equivalent in a related field to the offered position requirements.</li>
</ul>
<p><strong>Responsibilities include:</strong></p>
<ul>
<li>Contribute to end-to-end post-training workflows—dataset curation, hyper-parameter search, evaluation, and rollout—using PyTorch, Torchtune, FSDP/DeepSpeed, and our internal orchestration stack.</li>
<li>Prototype and compare alignment techniques (e.g., curriculum RL, multi-objective reward modeling, tool-use fine-tuning) and push the best ideas into production.</li>
<li>Automate training at scale: build robust pipeline components, tools, scripts, and dashboards so experiments are reproducible and easy to trace.</li>
<li>Define the metrics that matter; run A/B tests and iterate quickly to meet aggressive quality targets.</li>
<li>Collaborate with inference, safety, and product teams to land improvements in customer-facing systems.</li>
</ul>
<h2><strong>Employee Pay Disclosures</strong></h2>
<p>At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company. For this role, Inflection AI estimates a starting annual base salary to fall within the range of <strong>$</strong><strong>175,000</strong><strong> to $</strong><strong>350,000</strong>, depending on a candidate’s qualifications and level of experience. This role also includes a meaningful equity component, allowing employees to share in the long-term success of the company.<br><br></p>
<h3><strong>Benefits</strong></h3>
<p>Inflection AI values and supports our team’s mental and physical health. We are focused on building a positive, safe, inclusive and inspiring place to work. Our benefits include: </p>
<ul>
<li>Diverse medical, dental and vision options </li>
<li>401k matching program </li>
<li>Unlimited paid time off </li>
<li>Parental leave and flexibility for all parents and caregivers</li>
<li>Support of country-specific visa needs for international employees living in the Bay Area</li>
</ul>
Perks & benefits
- 401k
- Paid Time Off
- Pension Matching
- Equity Compensation
753,000+ hidden jobs like this
Inflection AI and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites