Back to all jobs
T
Senior/Staff Applied GenAI Researcher – Enterprise Outcome Team
TrueFoundry
San Mateo4mo ago
- Seniority
- Staff
About the role
<p><strong>About TrueFoundry</strong><br><br>Every production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure. A way to route between models. A way to manage tools and integrate them securely. A way to orchestrate agents and enforce governance. A unified compute layer to run it all.</p>
<p><strong>That infrastructure layer is being built right now.</strong></p>
<p>We're TrueFoundry, and we're building it. We're looking for a Senior/Staff Applied GenAI Researcher – Enterprise Outcome Team to join the team.</p>
<h2><strong>The Problem We're Solving</strong></h2>
<p>Companies are moving beyond simple chatbots to production agentic systems. These systems route between OpenAI, Anthropic, Google, and self-hosted models. They integrate dozens of tools via protocols like MCP. They orchestrate multi-agent workflows where agents coordinate with other agents.</p>
<p>The infrastructure to support this doesn't exist yet. You can't just duct-tape together a few API calls and call it production-ready.</p>
<p>You need a control plane that handles:</p>
<ul>
<li>Intelligent routing with observability, cost policies, and fallback logic</li>
<li>Centralized tool and MCP server management with security and lifecycle controls</li>
<li>Agent orchestration with governance and guardrails</li>
<li>A unified compute layer to run self-hosted models, custom tools, and agents</li>
</ul>
<p>We've built two products to solve this:</p>
<p><strong>AI Gateway</strong> is the control plane, five composable components (Prompts, LLM Gateway, MCP Gateway, Guardrails, Agent Gateway) that handle routing, orchestration, and governance.</p>
<p><strong>AI Deploy</strong> is the compute layer, a Kubernetes-based platform that abstracts ML workloads as standard software primitives, so everything runs on unified infrastructure.</p>
<p>We're Series A, backed by Intel Capital and Sequoia. Companies like CVS, Mastercard, Siemens, Paytm, Synopsys, and Zscaler run production AI workloads on our platform.</p>
<h3><strong>What You’ll Do:</strong></h3>
<ul>
<li>Build and productionize <strong>LLM-based</strong> and <strong>ML-based</strong> solutions, utilizing both open-source and proprietary models</li>
<li>Integrate TrueFoundry’s platform seamlessly into customer environments and leverage it to expedite the time to value of developing these applications </li>
<li>Build agents, write prompts, eval sets, optimize inference time and response quality for applications </li>
<li>Write <strong>maintainable production-quality high-performance code</strong> frequently in Python</li>
<li>Build and optimize <strong>REST APIs</strong>, <strong>gRPC services</strong>, and <strong>data pipelines</strong><strong><br></strong></li>
<li>Drive <strong>rapid feedback loops</strong> from customer deployments into continuous improvements for product and platform</li>
<li>Participate in <strong>solution architecture design</strong>, <strong>code reviews</strong>, and engineering best practices adoption</li>
</ul>
<h3><strong>Who You Are:</strong></h3>
<ul>
<li>4+ years experience building and deploying ML applications in production. </li>
<li>4+ years experience writing production code in python </li>
<li>2+ years working in deep learning and Natural language processing</li>
<li>1+ year experience building Agentic applications and GenAI Apps</li>
<li>Experience building <strong>REST APIs</strong>, working with <strong>Docker</strong>, and setting up <strong>CI/CD pipelines</strong><strong><br></strong></li>
</ul>
<h3><strong>Deep familiarity with Pytorch, HuggingFace libraries </strong></h3>
<ul>
<li><strong>Working knowledge of model servers like vLLM, Triton, TensorRT is preferred</strong></li>
<li>Understanding of <strong>Kubernetes</strong>, <strong>distributed systems architecture</strong>, and <strong>cloud-native technologies is preferred </strong></li>
<li>Strong system design abilities, with a focus on <strong>modular, reliable, and scalable architecture</strong></li>
<li>Passionate about applying AI to solve <strong>real-world, cross-industry problems</strong></li>
<li>Familiarity with <strong>LLM fine-tuning</strong>, <strong>RAG (Retrieval-Augmented Generation)</strong>, <strong>prompt engineering</strong>, or <strong>evaluation frameworks</strong><strong><br></strong></li>
</ul>
<h3><strong> Why Join TrueFoundry</strong></h3>
<ul>
<li>Build foundational Applied GenAI solutions alongside <strong>world-class engineers</strong> (ex-Facebook Infrastructure leaders)</li>
<li>Work on <strong>real-world, high-impact problems</strong> across multiple industries</li>
<li>Collaborate directly with <strong>founders and early leadership</strong> on shaping company and product direction</li>
<li>Enjoy a <strong>flexible, ownership-driven work environment</strong> with rapid career growth</li>
<li>Weekly learning sessions, team-building activities, and startup mentorship opportunities</li>
<li><strong>Learning credits</strong> and resources to help you grow your technical and professional skills</li>
</ul>
755,000+ hidden jobs like this
TrueFoundry and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites