Back to all jobs
I
Member of Technical Staff, Backend, LLM Applications
Inception
Bay Area$200k–350kOn-site3mo ago
- Employment
- Full-time
- Seniority
- Staff
About the role
- Design, build, and operate scalable backend services and model serving infrastructure for our diffusion LLMs.
- Implement and manage load balancing, autoscaling, and traffic routing for model endpoints.
- Build systems for model versioning, canary deployments, and zero-downtime rollouts.
- Develop monitoring, alerting, and observability tooling to ensure SLA compliance and rapid incident response.
- Benchmark and evaluate serving frameworks and hardware configurations to inform infrastructure decisions.
- BS/MS/PhD in Computer Science or a related field (or equivalent experience).
- 5+ years of experience building production backend systems.
- Strong proficiency in Python, including async programming and concurrent systems.
- Solid understanding of distributed systems, networking, and load balancing at scale.
- Familiarity with Kubernetes, CI/CD pipelines, and cloud infra (AWS and/or Azure).
- Experience serving LLMs or other large generative models in production at scale.
- Experience with cloud infrastructure (AWS, Azure), including GPU instance management and cost optimization.
- Experience with infrastructure as code tools (Terraform) and deployment automation.
- Experience with monitoring and observability tools (Prometheus, Grafana).
- Familiarity with model serving frameworks (vLLM, Triton Inference Server, TensorRT-LLM).
Compensation
- Work with World-Class Talent: Collaborate with the inventors of diffusion models and leading AI researchers
- Shape Foundational Technology: Your decisions will influence how the next generation of AI products are built and used
- Immediate Impact: Join at the ground floor where your contributions directly shape product direction and company trajectory
- Competitive salary and equity in a rapidly growing startup
- Flexible vacation and paid time off (PTO)
- Health, dental, and vision insurance
- 401k match
- Catered meals (breakfast, lunch, & dinner)
- Commuter subsidies
- A collaborative and inclusive culture
Perks & benefits
- 401k
- Vision Insurance
- Unlimited Vacation
- Paid Time Off
- Pension Matching
- Equity Compensation
764,000+ hidden jobs like this
Inception and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites