Data Infrastructure Engineer
genesis
- Employment
- Full-time
About the role
What You’ll Do
Design, build, and maintain large-scale data pipelines (batch and streaming) for robotics foundation model training and evaluation at petabyte scale
Own core data infrastructure: data model, storage systems, ingestion pipelines, transformation frameworks, and orchestration layers
Standardize data models and unify processing pipelines across real-world teleoperation and synthetic simulation datasets
Collaborate with a team of driven individuals committed to building general-purpose Physical AI
What You’ll Bring
Excellent software engineering skills (Python, Go, or similar)
Extensive experience designing, building, and maintaining large-scale data pipelines (8+ years)
Deep understanding of distributed systems (Spark, Kafka, or similar)
Extensive experience with data storage technologies (data lakes, warehouses, object stores like S3)
Experience running and maintaining production-grade infrastructure (Kubernetes, Terraform)
Bonus: Experience supporting AI systems, in particular embodied AI like self-driving
731,000+ hidden jobs like this
genesis and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites