Senior/Staff ML Engineer, Performance Optimization

Comfy

San FranciscoOn-site

Apply

Employment: Full-time
Seniority: Staff

About the role

The Role

We're looking for someone who loves optimizing model inference to join us in building the core of ComfyUI - the most complex and bleeding-edge part of our engine. You'll be working on making AI models run faster and more efficiently than anyone thought possible.

You are a good fit if this describes you:

You geek out about model inference, torch optimizations, and memory management
You've written production PyTorch code that pushes performance boundaries
You love diving deep into how models actually work under the hood
You get excited about making insanely optimized code that just works
You think the current state of ML deployment could be way better

What you'll do:

Build and optimize the core inference engine that powers ComfyUI
Make massive models run faster and use less memory than anyone else
Work directly with our core team on architecting new features
Tackle the hardest technical problems in the visual AI space
Help shape where we take this technology next

Bonus: If you've worked with diffusion/LLM models before or built custom nodes for ComfyUI, that's awesome

747,000+ hidden jobs like this

Comfy and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

Unlimited applications — free stops at 5
Track every application in one place
Apply straight to the source, one click
Save & organize roles you love
Roles pulled from company boards before the big sites

Weekly

$9.99

$4.99/week

For an active search. Cancel anytime.

Get Weekly

Monthly

$24.99

$12.99/month

The smart pick. Save 35% vs weekly.

Get Monthly

Lifetime

$99

$49.99once

Pay once. Every future feature, forever.

Get Lifetime