Research Engineer, Reinforcement Learning

Tensorstax Com

San FranciscoOn-site1y ago

Apply

Employment: Full-time

About the role

Develop and refine reward functions to optimize agent behavior for complex data engineering tasks.
Create RL gym environments for language model agents.
Fine-tune language models using reinforcement learning techniques such as PPO, DPO, and KTO.
Stay at the forefront of research on RL for language models, incorporating advancements like GRPO, SWE-Gym, and SWE-RL into practical applications.
Curate and build high-quality datasets for supervised fine-tuning (SFT) and RLHF.
Design experiments to evaluate and improve the agentic capabilities of language models in data environments.

Deep understanding of reinforcement learning, reward shaping, and optimization strategies.
Strong familiarity with LLM fine-tuning techniques (PPO, DPO, KTO) and their applications in reinforcement learning.
Knowledge of recent advancements in RL for language models (GRPO, SWE-Gym, SWE-RL).
Experience curating and constructing high-quality datasets for fine-tuning.
Strong problem-solving skills and a history of working on complex ML projects.
High agency—ability to work independently, experiment proactively, and drive research initiatives forward.

Experience with distributed training in PyTorch (DDP, FSDP).
Hands-on experience designing RL environments for traditional RL problems.
Contributions to open-source projects in RL, LLMs, or ML infrastructure.
Familiarity with data lakes and warehouses (Snowflake, BigQuery, Redshift).

100% employer-covered health, dental, and vision insurance.
401(k) with company match.
Access to Bay Club or Equinox in San Francisco.

Perks & benefits

401k
Vision Insurance
Pension Matching

764,000+ hidden jobs like this

Tensorstax Com and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

Unlimited applications — free stops at 5
Track every application in one place
Apply straight to the source, one click
Save & organize roles you love
Roles pulled from company boards before the big sites

Weekly

$9.99

$4.99/week

For an active search. Cancel anytime.

Get Weekly

Monthly

$24.99

$12.99/month

The smart pick. Save 35% vs weekly.

Get Monthly

Lifetime

$99

$49.99once

Pay once. Every future feature, forever.

Get Lifetime