Back to all jobs
I
Member of Technical Staff, Training Infra
Inception
Bay Area$200k–350kOn-site3mo ago
- Employment
- Full-time
- Seniority
- Staff
About the role
- Design, implement, and optimize distributed training systems that scale across thousands of GPUs and nodes.
- Develop high-performance optimizations to maximize throughput and efficiency.
- Develop reusable frameworks and libraries to improve training reproducibility, reliability, and scalability for new model architectures.
- BS/MS/PhD in Computer Science, Engineering, or a related field (or equivalent experience).
- Understanding of ML frameworks (PyTorch, TensorFlow) from a systems perspective.
- Strong engineering skills — ability to contribute performant, maintainable code and debug in complex codebases.
- Proficiency in Python and at least one systems programming language (C++/Rust/Go).
- Experience with containerization (Docker), orchestration (Kubernetes), and CI/CD pipelines.
- Experience building and maintaining large-scale language models with tens of billions of parameters or more.
- Experience with ML workflow orchestration tools (Kubeflow, Airflow).
- Background in performance optimization and profiling of ML systems (Prometheus, Grafana, OpenTelemetry).
- Familiarity with distributed frameworks such as PyTorch/XLA, DeepSpeed, Megatron-LM.
Compensation
- Work with World-Class Talent: Collaborate with the inventors of diffusion models and leading AI researchers
- Shape Foundational Technology: Your decisions will influence how the next generation of AI products are built and used
- Immediate Impact: Join at the ground floor where your contributions directly shape product direction and company trajectory
- Competitive salary and equity in a rapidly growing startup
- Flexible vacation and paid time off (PTO)
- Health, dental, and vision insurance
- 401k match
- Catered meals (breakfast, lunch, & dinner)
- Commuter subsidies
- A collaborative and inclusive culture
Perks & benefits
- 401k
- Vision Insurance
- Unlimited Vacation
- Paid Time Off
- Pension Matching
- Equity Compensation
764,000+ hidden jobs like this
Inception and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites