
- Employment
- Full-time
About the role
About the role
Own the platform that powers our accelerator cloud. Your scope spans bare-metal provisioning, multi-tenant Kubernetes, SLURM scheduling, control planes, and the automation and observability that keep thousands of compute nodes running as a single production system.
What you'll do
Build the control plane and APIs that unify our compute fleet
Own provisioning and lifecycle from rack bring-up to node retirement
Operate the scheduling layer for training and inference workloads
Architect multi-tenancy: isolation, quota, fairness, and accounting
Build automation that eliminates manual operations
Drive reliability, observability, and incident response across the fleet
What you'll need
BS in CS, EE, or related field, or equivalent experience
5+ years in infrastructure, platform, or backend engineering
Advanced software engineering skills: Rust, Go, or Python
Deep understanding of Linux, storage, and distributed systems
Experience with workload schedulers: SLURM, Kubernetes scheduling, or equivalent
Expertise with automation tooling: Terraform, Ansible, Helm
Experience architecting multi-tenant systems
Production SRE experience: on-call, incident response, observability
What we offer
Top-tier compensation structured to recognize and retain the best talent
Meaningful equity
Comprehensive medical, dental, vision, life, and disability insurance
Parental leave for all new parents, including adoptive and surrogate journeys
Flexible PTO
Paid Holidays
Relocation support
Equal Employment Opportunity
We're an Equal Opportunity Employer and do not discriminate on the basis of any protected status under applicable law.
Perks & benefits
- Unlimited Vacation
- Paid Time Off
- Equity Compensation
731,000+ hidden jobs like this
Material Group and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites