Back to all jobs
Comfy logo

Senior/Staff ML Engineer, Performance Optimization

Comfy
San FranciscoOn-site
Employment
Full-time
Seniority
Staff

About the role

The Role

We're looking for someone who loves optimizing model inference to join us in building the core of ComfyUI - the most complex and bleeding-edge part of our engine. You'll be working on making AI models run faster and more efficiently than anyone thought possible.

You are a good fit if this describes you:

  • You geek out about model inference, torch optimizations, and memory management

  • You've written production PyTorch code that pushes performance boundaries

  • You love diving deep into how models actually work under the hood

  • You get excited about making insanely optimized code that just works

  • You think the current state of ML deployment could be way better

What you'll do:

  • Build and optimize the core inference engine that powers ComfyUI

  • Make massive models run faster and use less memory than anyone else

  • Work directly with our core team on architecting new features

  • Tackle the hardest technical problems in the visual AI space

  • Help shape where we take this technology next

Bonus: If you've worked with diffusion/LLM models before or built custom nodes for ComfyUI, that's awesome

747,000+ hidden jobs like this

Comfy and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.