AI/ML Engineer

San Jose12h ago

About the role

<div class="content-intro"><p><span data-teams="true">Astera Labs (NASDAQ: ALAB) provides rack-scale AI infrastructure through purpose-built connectivity solutions. By collaborating with hyperscalers and ecosystem partners, Astera Labs enables organizations to unlock the full potential of modern AI. Astera Labs’ Intelligent Connectivity Platform integrates CXL®, Ethernet, NVLink, PCIe®, and UALink™ semiconductor-based technologies with the company’s COSMOS software suite to unify diverse components into cohesive, flexible systems that deliver end-to-end scale-up, and scale-out connectivity. The company’s custom connectivity solutions business complements its standards-based portfolio, enabling customers to deploy tailored architectures to meet their unique infrastructure requirements. Discover more at <a id="menurhut" class="fui-Link ___1q1shib f2hkw1w f3rmtva f1ewtqcl fyind8e f1k6fduh f1w7gpdv fk6fouc fjoy568 figsok6 f1s184ao f1mk8lai fnbmjn9 f1o700av f13mvf36 f1cmlufx f9n3di6 f1ids18y f1tx3yz7 f1deo86v f1eh06m1 f1iescvh fhgqx19 f1olyrje f1p93eir f1nev41a f1h8hb77 f1lqvz6u f10aw75t fsle3fq f17ae5zn" href="http://www.asteralabs.com/" target="_blank">www.asteralabs.com</a>.</span></p></div><p></p> <h2>AI/ML Engineer</h2> <p><strong>Location:</strong> San Jose, CA<br><strong>Experience:</strong> 1–5 years<br><strong>Team:</strong> Applied AI</p> <p> </p> <h3>The role</h3> <p>We’re hiring an AI/ML Engineer to build production AI systems for technical users. This is an applied engineering role for someone who can take modern model capabilities and turn them into reliable systems that people actually use.</p> <p>The core problems in this role are the same ones that matter in modern applied AI: getting the right context into the system, making tool use reliable, designing useful abstractions around skills and workflows, building evals that reflect real tasks, and iterating until the system is good enough to become part of a team’s daily workflow.</p> <p>In practice, you might work on coding agents in terminal and IDE environments, verification and debug assistants, log-analysis systems tied to real product diagnostics, documentation and spec-comparison agents, or internal assistants that operate over company knowledge and engineering data. You will be expected to think end-to-end: prompt and context design, retrieval quality, tool interfaces, evals, failure modes, deployment, and ongoing improvement.</p> <p> </p> <h3>What you’ll do</h3> <ul> <li>Build AI applications and agentic workflows for engineering productivity, diagnostics, search, documentation, and workflow automation.</li> <li>Design systems that combine LLMs with retrieval, tool use, structured outputs, and evaluation loops.</li> <li>Integrate models with internal tools, APIs, CLIs, MCP interfaces, and operational workflows so they can do useful work in real environments.</li> <li>Improve system quality through eval design, prompt and context iteration, model selection, failure analysis, and human feedback.</li> <li>Build reusable skills, workflows, and abstractions so useful capabilities can be shared across agents and teams instead of rebuilt from scratch.</li> <li>Work closely with infrastructure and domain teams to deploy, monitor, and continuously improve AI systems in production.</li> </ul> <h3>What we’re looking for</h3> <ul> <li>1–5 years of experience in software engineering, applied AI, ML engineering, or related backend/platform roles.</li> <li>Strong Python skills and strong production engineering fundamentals.</li> <li>Hands-on experience building AI/LLM applications, agents, retrieval-backed systems, or workflow automation.</li> <li>Comfort working with tool-using systems where correctness depends on context quality, tool integration, and careful failure handling.</li> <li>Experience with AWS or GCP and the realities of deploying and debugging production AI services.</li> <li>Good judgment around evals, failure modes, latency/cost tradeoffs, and safe rollout of non-deterministic systems.</li> <li>Clear communication and the ability to turn ambiguous technical workflows into robust product behavior.</li> </ul> <h3>What strong candidates often look like</h3> <p>They have built more than demos. They have worked on systems where retrieval quality matters, where tool use can fail in subtle ways, where evaluation changes engineering decisions, and where product usefulness depends as much on system design as on model choice. They usually care about the details that separate a clever prototype from a dependable system.</p> <p> </p> <h3>Why this role is interesting</h3> <p>The team’s direction is very concrete: enterprise search, coding agents, workspace automation, customized skills, and agentic applications for specific engineering problems, all measured against real usage and outcomes. This role sits directly in that path. If you want to build applied AI systems that are ambitious but grounded in real workflows, technical users, and fast feedback loops, this is that job.</p> <p><span data-teams="true"> </span></p> <p>The base pay range for this position is $140,000 - $165,000 </p><div class="content-conclusion"><p>We know that creativity and innovation happen more often when teams include diverse ideas, backgrounds, and experiences, and we actively encourage everyone with relevant experience to apply, including people of color, LGBTQ+ and non-binary people, veterans, parents, and individuals with disabilities.</p></div>

731,000+ hidden jobs like this

Astera Labs and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

Unlimited applications — free stops at 5
Track every application in one place
Apply straight to the source, one click
Save & organize roles you love
Roles pulled from company boards before the big sites

Weekly

$9.99

$4.99/week

For an active search. Cancel anytime.

Get Weekly

Monthly

$24.99

$12.99/month

The smart pick. Save 35% vs weekly.

Get Monthly

Lifetime

$99

$49.99once

Pay once. Every future feature, forever.

Get Lifetime