Back to all jobs
P
ML Engineer - Inference
Photalabs Com
San JoseOn-site9mo ago
- Employment
- Full-time
About the role
About us:
The role:
What you’ll do:
- Deploy and integrate researcher-trained model checkpoints into our cloud infrastructure and production pipelines.
- Conduct thorough performance profiling and benchmarking to identify and eliminate computational bottlenecks.
- Implement neural network optimization techniques including quantization, pruning, and architectural refinements while preserving model accuracy.
- Develop efficient training and fine-tuning strategies with optimal precision trade-offs and parallelism.
- Build and maintain scalable multi-GPU inference solutions with sophisticated model parallelism and serving architectures.
- Collaborate with the research team to ensure optimization integrate smoothly with model development workflows.
You may be a strong fit if you:
- Have experience deploying and optimizing deep learning models for production environments, particularly with multi-GPU inference and large-scale model serving.
- Are well-versed in cutting-edge techniques for optimizing both inference and training workloads.
- Possess strong knowledge of efficient attention mechanisms and algorithms.
- Have hands-on experience implementing model quantization and working with inference frameworks.
- Can write production-quality code and successfully integrate ML models into robust inference pipelines.
- Are familiar with various cloud platforms, storage solutions, and modern training frameworks.
Logistics:
- This role is based in San Jose, where we work in person. We believe the best ideas come from being in the same room.
- We sponsor visas. We are committed to working through the process together for the right candidates. If you're currently outside the US, we're also committed to helping you relocate to the US throughout this process.
- We offer generous health, dental, and vision coverage, unlimited PTO, paid parental leave, and relocation support as needed.
- Don't meet every single qualification? That’s okay — we care more about your trajectory than checking every box. If the role excites you and the mission resonates, we'd love to hear from you.
Perks & benefits
- Vision Insurance
- Unlimited Vacation
- Paid Time Off
764,000+ hidden jobs like this
Photalabs Com and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites