Research Engineer (Agentic Models)

WorldwideRemote3d ago

About the role

<p>At JetBrains, code is our passion. Ever since we started, back in 2000, we’ve been striving to make the strongest, most effective developer tools on earth. Today, AI-powered assistance and agents are becoming a core part of how developers work in our IDEs.</p> <p>We’re building multi-step coding agents that can understand large codebases, plan changes, call tools, and iterate with the user. As a Research Engineer in the Agentic Models team, you’ll be responsible for the models, training loops, and evaluation pipelines that power these agents.</p> <p>You’ll work at the intersection of SFT and RL-style post-training, and product-driven evaluation, using our distributed GPU and MapReduce clusters to ship models into JetBrains products.</p> <h3>As part of our team, you will:</h3> <ul> <li>Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.</li> <li>Train and adapt LLMs for agent workflows, including planning, tool use, and multi-step interactions inside JetBrains IDEs.</li> <li>Build and develop evaluation and simulation environments where coding agents can act, be measured, and compared on realistic developer tasks.</li> <li>Design evaluation frameworks and metrics for agent behavior, analyze traces and logs, and close the loop from evaluation back into training, data, and reward design.</li> <li>Analyze training and evaluation results to propose and implement improvements to model architectures, training recipes, and datasets.</li> <li>Work with large-scale infrastructure, including distributed training on GPU clusters and large MapReduce-style data processing for pre-training and fine-tuning datasets.</li> <li>Collaborate closely with research, product, and infrastructure teams to turn high-level product visions into concrete models, experiments, and shipped features. </li> </ul> <h3>We’ll be happy to bring you on board if you have:</h3> <ul> <li>Extensive hands-on experience training LLMs (pre-training, fine-tuning, or post-training) in a research or production setting.</li> <li>Deep expertise in modern deep learning frameworks such as PyTorch, and specialized LLM training stacks (e.g. Megatron, NeMo, verl, or similar).</li> <li>Strong theoretical and practical understanding of LLM fundamentals: architectures, tokenization, data pipelines, batching, mixed precision, distributed training, and debugging unstable runs.</li> <li>The ability to own projects end to end, starting from a high-level problem or product pain point and overseeing it through the design, experimentation, implementation, and iteration phases.</li> <li>A product-aware mindset – you care about how developers actually use agents and can translate product needs and failure modes into modeling and evaluation work.</li> <li>At least 3 years of Python experience writing clean, maintainable code in modern ML codebases.</li> </ul> <h3>Our ideal candidate would have experience with:</h3> <ul> <li>ML orchestrators and workflow tools such as Kubeflow, Dagster, Airflow, ZenML, and/or job schedulers like Kubernetes or SLURM.</li> <li>Large-scale data and training pipelines, e.g. MapReduce-style clusters, multi-node GPU training, or workloads on the order of 1M+ CPU/GPU hours.</li> <li>Designing and maintaining evaluation pipelines for LLMs or agents, including metrics, dashboards, experiment tracking, and automated regression checks.</li> <li>AI agent development, such as tool-using agents, planners, or multi-step coding workflows, and familiarity with agentic frameworks or patterns.</li> <li>Experiment tracking and observability using tools like Weights & Biases, MLflow, Langfuse, or similar.</li> <li>Inference optimization and serving optimized models in production.</li> </ul> <p><span style="color: rgb(255, 255, 255);">#LI-KP1</span></p><div class="content-conclusion"><p><strong>We are an equal opportunity employer</strong><br><br>We know great ideas can come from anyone, anywhere. That’s why we do our best to create an open and inclusive workplace – one that welcomes everyone regardless of their background, identity, religion, age, accessibility needs, or orientation.</p> <p><em data-stringify-type="italic">We process the data provided in your job application in accordance with the <a href="https://www.jetbrains.com/legal/docs/privacy/privacy-recruitment/">Recruitment Privacy Policy.</a></em></p></div>

731,000+ hidden jobs like this

JetBrains and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

Unlimited applications — free stops at 5
Track every application in one place
Apply straight to the source, one click
Save & organize roles you love
Roles pulled from company boards before the big sites

Weekly

$9.99

$4.99/week

For an active search. Cancel anytime.

Get Weekly

Monthly

$24.99

$12.99/month

The smart pick. Save 35% vs weekly.

Get Monthly

Lifetime

$99

$49.99once

Pay once. Every future feature, forever.

Get Lifetime