Back to all jobs
T

Senior/Staff Applied GenAI Researcher – Enterprise Outcome Team

TrueFoundry

San Mateo4mo ago
Seniority
Staff

About the role

<p><strong>About TrueFoundry</strong><br><br>Every production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure. A way to route between models. A way to manage tools and integrate them securely. A way to orchestrate agents and enforce governance. A unified compute layer to run it all.</p> <p><strong>That infrastructure layer is being built right now.</strong></p> <p>We're TrueFoundry, and we're building it. We're looking for a Senior/Staff Applied GenAI Researcher – Enterprise Outcome Team to join the team.</p> <h2><strong>The Problem We're Solving</strong></h2> <p>Companies are moving beyond simple chatbots to production agentic systems. These systems route between OpenAI, Anthropic, Google, and self-hosted models. They integrate dozens of tools via protocols like MCP. They orchestrate multi-agent workflows where agents coordinate with other agents.</p> <p>The infrastructure to support this doesn't exist yet. You can't just duct-tape together a few API calls and call it production-ready.</p> <p>You need a control plane that handles:</p> <ul> <li>Intelligent routing with observability, cost policies, and fallback logic</li> <li>Centralized tool and MCP server management with security and lifecycle controls</li> <li>Agent orchestration with governance and guardrails</li> <li>A unified compute layer to run self-hosted models, custom tools, and agents</li> </ul> <p>We've built two products to solve this:</p> <p><strong>AI Gateway</strong> is the control plane, five composable components (Prompts, LLM Gateway, MCP Gateway, Guardrails, Agent Gateway) that handle routing, orchestration, and governance.</p> <p><strong>AI Deploy</strong> is the compute layer, a Kubernetes-based platform that abstracts ML workloads as standard software primitives, so everything runs on unified infrastructure.</p> <p>We're Series A, backed by Intel Capital and Sequoia. Companies like CVS, Mastercard, Siemens, Paytm, Synopsys, and Zscaler run production AI workloads on our platform.</p> <h3><strong>What You’ll Do:</strong></h3> <ul> <li>Build and productionize <strong>LLM-based</strong> and <strong>ML-based</strong> solutions, utilizing both open-source and proprietary models</li> <li>Integrate TrueFoundry’s platform seamlessly into customer environments and leverage it to expedite the time to value of developing these applications&nbsp;</li> <li>Build agents, write prompts, eval sets, optimize inference time and response quality for applications&nbsp;&nbsp;</li> <li>Write <strong>maintainable production-quality high-performance code</strong> frequently in Python</li> <li>Build and optimize <strong>REST APIs</strong>, <strong>gRPC services</strong>, and <strong>data pipelines</strong><strong><br></strong></li> <li>Drive <strong>rapid feedback loops</strong> from customer deployments into continuous improvements for product and platform</li> <li>Participate in <strong>solution architecture design</strong>, <strong>code reviews</strong>, and engineering best practices adoption</li> </ul> <h3><strong>Who You Are:</strong></h3> <ul> <li>4+ years experience building and deploying ML applications in production.&nbsp;</li> <li>4+ years experience writing production code in python&nbsp;</li> <li>2+ years working in deep learning and Natural language processing</li> <li>1+ year experience building Agentic applications and GenAI Apps</li> <li>Experience building <strong>REST APIs</strong>, working with <strong>Docker</strong>, and setting up <strong>CI/CD pipelines</strong><strong><br></strong></li> </ul> <h3><strong>Deep familiarity with Pytorch, HuggingFace libraries&nbsp;</strong></h3> <ul> <li><strong>Working knowledge of model servers like vLLM, Triton, TensorRT is preferred</strong></li> <li>Understanding of <strong>Kubernetes</strong>, <strong>distributed systems architecture</strong>, and <strong>cloud-native technologies is preferred&nbsp;</strong></li> <li>Strong system design abilities, with a focus on <strong>modular, reliable, and scalable architecture</strong></li> <li>Passionate about applying AI to solve <strong>real-world, cross-industry problems</strong></li> <li>Familiarity with <strong>LLM fine-tuning</strong>, <strong>RAG (Retrieval-Augmented Generation)</strong>, <strong>prompt engineering</strong>, or <strong>evaluation frameworks</strong><strong><br></strong></li> </ul> <h3><strong>&nbsp;Why Join TrueFoundry</strong></h3> <ul> <li>Build foundational Applied GenAI solutions alongside <strong>world-class engineers</strong> (ex-Facebook Infrastructure leaders)</li> <li>Work on <strong>real-world, high-impact problems</strong> across multiple industries</li> <li>Collaborate directly with <strong>founders and early leadership</strong> on shaping company and product direction</li> <li>Enjoy a <strong>flexible, ownership-driven work environment</strong> with rapid career growth</li> <li>Weekly learning sessions, team-building activities, and startup mentorship opportunities</li> <li><strong>Learning credits</strong> and resources to help you grow your technical and professional skills</li> </ul>

755,000+ hidden jobs like this

TrueFoundry and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.