Back to all jobs
B
Senior ML/RL Engineer, Behavior Planning
Bot Auto
Houston3w ago
- Seniority
- Senior
About the role
<div class="mt-8 text-xl text-gray-800 leading-8"> </div>
<div class="mt-8 text-xl text-gray-600 leading-8">
<div data-controller="rich-text">
<div class="rich-text-container" data-rich-text-target="richTextContainer">
<h3><strong>Company Introduction</strong></h3>
<p>At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous trucks, enhancing the quality of life for communities around the globe. With the agility of a startup and the wisdom of seasoned experts, our team has achieved numerous world-firsts and unparalleled innovations. United by a shared vision, we create groundbreaking solutions that propel the future of transportation. Join us and transform your ideas into reality.</p>
<h3><strong>Role Overview</strong></h3>
<p>We are seeking a <strong>Senior ML/RL Engineer</strong> to join our Algo team and drive the development of our unified behavioral architecture. In this role, you will help bridge the gap between simulation and the real world by developing a scalable policy framework that represents both our L4 ego-policy and a diverse population of simulated agents. You will work at the intersection of Multi-Agent Reinforcement Learning (MARL) and safety-critical system design to ensure our autonomous semi-trucks navigate highways with superhuman safety and precision.</p>
<h3><strong>Key Responsibilities</strong></h3>
<ul>
<li><strong>Behavioral Modeling:</strong> Develop and train diverse, conditioned policies that simulate realistic driving behaviors to stress-test and validate our autonomous driving stack.</li>
<li><strong>Safety-Constrained Learning:</strong> Lead the research and implementation of advanced RL algorithms to ensure safety metrics are treated as primary constraints in the learning process.</li>
<li><strong>Reward & Objective Design:</strong> Collaborate with cross-functional teams to design robust reward functions and evaluation metrics that balance safety, progress, and comfort.</li>
<li><strong>Scalable Training Pipelines:</strong> Contribute to the optimization of our large-scale, high-throughput training environments to enable rapid iteration on complex multi-agent scenarios.</li>
<li><strong>Model Architecture:</strong> Advance our state-of-the-art neural architectures to improve spatial reasoning, long-horizon planning, and interaction modeling.</li>
<li><strong>Cross-Team Collaboration:</strong> Work closely with Simulation and Planning teams to integrate research-grade models into production-quality, safety-critical software.</li>
</ul>
<h3><strong>Required Qualifications</strong></h3>
<ul>
<li><strong>Professional RL Experience:</strong> Proven track record of training and deploying deep RL algorithms (e.g., PPO, SAC) for complex, real-world robotic or autonomous systems.</li>
<li><strong>Technical Mastery:</strong> Expertise in <strong>Python</strong> and <strong>PyTorch</strong>; strong understanding of modern deep learning architectures and optimization techniques.</li>
<li><strong>Academic Background:</strong> MS or PhD in Computer Science, Robotics, or a related quantitative field.</li>
<li><strong>Scientific Intuition:</strong> Ability to diagnose and solve fundamental challenges in RL training, such as variance management and distribution shift.</li>
</ul>
<h3><strong>Preferred Qualifications</strong></h3>
<ul>
<li><strong>Safe RL Specialization:</strong> Experience with constrained optimization or safety-critical learning frameworks.</li>
<li><strong>Multi-Agent Systems:</strong> Background in MARL training stability, including self-play and decentralized execution strategies.</li>
<li><strong>Autonomous Driving Domain:</strong> Familiarity with vehicle dynamics and behavior planning, particularly for long-haul highway environments.</li>
</ul>
<h3><strong>Additional Information</strong></h3>
<ul>
<li><strong>Compensation:</strong> Competitive salary based on experience, with opportunities for performance bonuses and equity.</li>
<li><strong>Benefits:</strong> Comprehensive health insurance, paid time off, and the opportunity to work at the forefront of the autonomous trucking industry.</li>
</ul>
<h3><strong>Why Bot Auto?</strong></h3>
<p>We are a small, hyper-focused team on a mission to beat human cost-per-mile through technology. We recently successfully completed the industry’s first fully humanless commercial truckload, proving that our vision is a reality. If you are passionate about AI, safety, and transforming logistics, we want to hear from you.</p>
</div>
</div>
</div>
Perks & benefits
- Medical Insurance
- Paid Time Off
- Equity Compensation
731,000+ hidden jobs like this
Bot Auto and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites