Back to all jobs
Cerebras Systems logo

AI Models, Product Manager

Cerebras Systems
Sunnyvale2w ago

About the role

<div class="content-intro"><p><span data-contrast="none">Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.&nbsp;</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;201341983&quot;:0,&quot;335559685&quot;:0,&quot;335559737&quot;:240,&quot;335559738&quot;:240,&quot;335559739&quot;:240,&quot;335559740&quot;:279}">&nbsp;</span></p> <p>Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups.&nbsp;<a href="https://openai.com/index/cerebras-partnership/">OpenAI recently announced a multi-year partnership with Cerebras</a>, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.&nbsp;</p> <p>Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.</p></div><h4 id="Own-the-Future-of-AI-Inference" data-local-id="27e2ad7d-fed5-4399-8636-9d7785e16b0a" data-renderer-start-pos="568">Own the Future of AI Inference</h4> <p data-renderer-start-pos="600" data-local-id="d08d13f3-579c-4c92-a359-e166de311ab9">Cerebras powers the world's fastest AI inference. As the Product Manager for AI Models, you'll lead the strategic model portfolio that defines our product — deciding which models ship, how they perform, and how the world discovers them.</p> <p data-renderer-start-pos="838" data-local-id="4c18318d-cea4-42d1-83d7-1ecaf41c2c13">You'll partner directly with leading AI labs, drive launches that shape the industry, and ensure every model on our platform delivers exceptional quality at unprecedented speed.</p> <h4 id="What-You'll-Own" data-local-id="eeeca7a0-9050-41be-8319-446a655c863f" data-renderer-start-pos="1017">What You'll Own</h4> <h5 data-renderer-start-pos="1034" data-local-id="17edc850-f094-4f73-bac4-dfeb01e88274"><strong data-renderer-mark="true">Strategic Model Portfolio</strong></h5> <ul> <li data-renderer-start-pos="1063" data-local-id="d0add9b4-eed3-4ebd-b8ed-a5ec55d2771d">Own the models roadmap: decide which frontier and open-source models we support based on market demand, research trends, and strategic fit</li> <li>Establish partnerships with top model labs, for day0 launches</li> <li data-renderer-start-pos="1337" data-local-id="95183413-f570-4c16-aa98-01fe08d20b3e">Build relationships with open-source maintainers to accelerate community model adoption</li> </ul> <h5 data-renderer-start-pos="1428" data-local-id="e826256c-0d1e-4dca-856e-c215a14811f1"><strong data-renderer-mark="true">Product Quality &amp; Customer Success</strong></h5> <ul> <li data-renderer-start-pos="1466" data-local-id="588f966a-b91b-47e1-ad23-d6dd1ccc7aff">Define and enforce quality standards across our model catalog through systematic evaluation frameworks</li> <li data-renderer-start-pos="1572" data-local-id="4eac9d2c-357c-4600-b3a5-f2010e31a38b">Design benchmarks and evaluations that prove our models deliver production-grade performance</li> <li data-renderer-start-pos="1668" data-local-id="8499755a-f89b-4bdd-837c-b4cfe9529509">Own the feedback loop: gather customer insights, identify model weaknesses, and drive improvements with engineering</li> <li data-renderer-start-pos="1787" data-local-id="6d6c027f-6fa9-4566-bcba-d211a4c7e48c">Enable strategic customers to integrate our inference into their products—removing blockers and optimizing for their specific use cases</li> </ul> <h5 data-renderer-start-pos="1926" data-local-id="1f173345-61b0-4a9d-a1a7-a2f25777b965"><strong data-renderer-mark="true">Go-to-Market Excellence</strong></h5> <ul> <li data-renderer-start-pos="1953" data-local-id="97f8725a-d295-448a-933f-887961129253">Lead high-impact model launches that generate buzz and adoption</li> <li data-renderer-start-pos="2020" data-local-id="590f3a84-e495-446f-a067-7f561fa6bda1">Create compelling product marketing: demos, benchmarks, tutorials, and documentation that showcase what's possible on Cerebras</li> <li data-renderer-start-pos="2150" data-local-id="5959aee9-0d04-4141-a689-60a5627721b9">Craft technical content that resonates with developers and decision-makers alike</li> </ul> <h5 data-renderer-start-pos="2234" data-local-id="1380e20b-942a-4943-bf94-7ba94ad16ef7"><strong data-renderer-mark="true">Technical Decision-Making</strong></h5> <ul> <li data-renderer-start-pos="2263" data-local-id="8f72f63f-9009-4b7d-bfd7-85959b9f9356">Select and prioritize performance optimizations (quantization, speculative decoding, etc.) based on customer needs and hardware capabilities</li> <li data-renderer-start-pos="2407" data-local-id="23c1369d-9596-43a5-9dcc-917d707b4995">Collaborate with optimization engineers to implement techniques that maximize our speed advantage</li> <li data-renderer-start-pos="2508" data-local-id="9941a635-03b0-4e4e-9e0c-58166283ab6e">Balance tradeoffs between quality, latency, throughput, and cost</li> </ul> <h5 data-renderer-start-pos="2576" data-local-id="a14e7356-3ad2-41ff-a1f2-97ebd6ff9099"><strong data-renderer-mark="true">Cross-Functional Leadership</strong></h5> <ul> <li data-renderer-start-pos="2607" data-local-id="3d5647fe-0a96-4a21-954b-83f67f9d1beb">Orchestrate launches across model enablement, optimization engineering, deployment, sales, and marketing</li> <li data-renderer-start-pos="2715" data-local-id="a04def27-419a-4888-95d2-03689bd72e43">Drive alignment in a fast-moving environment where priorities shift based on model releases and customer needs</li> <li data-renderer-start-pos="2829" data-local-id="f9d57696-aa75-4225-87b8-326aaf2197b8">Be the voice of the customer to engineering and the voice of product to customers</li> </ul> <h4><span data-contrast="none"><span data-ccp-parastyle="heading 3">Skills &amp; </span><span data-ccp-parastyle="heading 3">Q</span><span data-ccp-parastyle="heading 3">ualifications </span></span><span data-ccp-props="{&quot;134245418&quot;:true,&quot;134245529&quot;:true,&quot;335559738&quot;:160,&quot;335559739&quot;:80}">&nbsp;</span></h4> <p data-renderer-start-pos="1822" data-local-id="624bebb2-0271-4f7d-a98c-5fc9affc8d4f"><strong data-renderer-mark="true">What we need to see: </strong></p> <ol> <li data-renderer-start-pos="1847" data-local-id="45617d6f-ef29-41e9-80f5-d074a04e6842">5+ years of experience as a product manager, currently at or above the level of Senior PM.</li> <li data-renderer-start-pos="1941" data-local-id="d51fcb21-99e4-4b45-9f50-7ceffaa4eafe">5+ years of total technical work experience (e.g. SWE, ML researcher, solution engineer).</li> <li data-renderer-start-pos="2034" data-local-id="875a46b2-7896-4226-aa1e-b6a37009e9a4">Ability to thrive in a fast-paced, dynamic environment. With an entrepreneurial sense of ownership and ability to lead projects.</li> <li data-renderer-start-pos="2166" data-local-id="175f896f-2ef2-47ba-bc51-b0da641da0f4">Knowledge and passion for the worlds of open-source models and generative AI research.</li> <li data-renderer-start-pos="2256" data-local-id="77b72ccf-a333-413e-8b6b-786125c80757">Knowledge of the community model ecosystem, including: PyTorch, Hugging Face, vLLM, and SGLang.</li> <li data-renderer-start-pos="2355" data-local-id="550d7bd6-45a8-4351-a03d-1ba918701868">Highly motivated, independent, organized, and an effective communicator.</li> <li data-renderer-start-pos="2431" data-local-id="83403608-1303-43b1-9930-93766de89439">Comfortable using Python with the chat completions API, for basic model testing.</li> </ol> <p><strong><span data-contrast="none">Preferred requirements</span></strong><span data-contrast="none"> </span><span data-ccp-props="{&quot;134233117&quot;:true,&quot;134233118&quot;:true,&quot;201341983&quot;:0,&quot;335557856&quot;:16777215,&quot;335559740&quot;:240}">&nbsp;</span></p> <p data-renderer-start-pos="2539" data-local-id="7f477a7a-fb9d-4d3b-9cc0-968cef82c1c2"><strong data-renderer-mark="true">How to stand out: </strong></p> <ol> <li data-renderer-start-pos="2561" data-local-id="cb263827-9bab-4c3a-ba3e-d31af4ee8b14">Product manager experience at a model training lab or a company that implements open-source models.</li> <li data-renderer-start-pos="2664" data-local-id="cb263827-9bab-4c3a-ba3e-d31af4ee8b14">Experience working with customers in a solution engineering role.</li> <li data-renderer-start-pos="2733" data-local-id="8b1c7bff-2da6-40d7-8b4e-6a960fa282b8">Experience writing technical marketing assets and social media, with a growing portfolio.</li> <li data-renderer-start-pos="2826" data-local-id="54917830-9aa8-4bfc-9461-091adbc87688">Experience working in a cross-functional organization, and leading projects across multiple teams.</li> <li data-renderer-start-pos="2929" data-local-id="cda6b135-8c89-474c-b876-df140b9e809e">Experience writing model quality evaluations and system prompt harnesses.</li> <li data-renderer-start-pos="3006" data-local-id="6cf3729f-e823-4e82-9cbd-6a2cc928f8ca">Experience writing application code in use cases such as code generation or deep research search application.</li> <li data-renderer-start-pos="3119" data-local-id="99856fb9-adf7-4b0a-8333-f828e33c78c7">Expertise on agentic flows and current LLM model family architectures.</li> <li data-renderer-start-pos="3193" data-local-id="5b6a3c3d-4159-4d2b-b5e5-7511a65d277c">Understanding of model compilers and optimization.</li> <li data-renderer-start-pos="3247" data-local-id="afac169a-9514-4194-bcc3-86b7b08ab9ab">Contributor to communities like vLLM, SGLang, PyTorch, or Hugging Face transformers.</li> <li data-renderer-start-pos="3335" data-local-id="0ae05fb0-64ed-4710-9a57-7be9a5b1949b">Experience with model optimization or compression methods like quantization.</li> </ol> <p><strong><span data-contrast="none">Location </span></strong><span data-contrast="none"> </span><span data-ccp-props="{&quot;134233117&quot;:true,&quot;134233118&quot;:true,&quot;201341983&quot;:0,&quot;335557856&quot;:16777215,&quot;335559740&quot;:240}">&nbsp;</span></p> <ul> <li data-renderer-start-pos="3429" data-local-id="194fdff3-9085-48e9-891b-b24272b27346">Hybrid at our Sunnyvale, California or Toronto, Canada office.</li> <li><span data-contrast="none">Remote possible for candidates willing to travel 1-2x per quarter.  </span><span data-ccp-props="{&quot;134233117&quot;:true,&quot;134233118&quot;:true,&quot;201341983&quot;:0,&quot;335557856&quot;:16777215,&quot;335559740&quot;:240}">&nbsp;</span></li> </ul><div class="content-conclusion"><h4><strong>Why Join Cerebras</strong></h4> <p>People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection&nbsp; point in our business. Members of our team tell us there are five main reasons they joined Cerebras:</p> <ol> <li>Build a breakthrough AI platform beyond the constraints of the GPU.</li> <li>Publish and open source their cutting-edge AI research.</li> <li>Work on one of the fastest AI supercomputers in the world.</li> <li>Enjoy job stability with startup vitality.</li> <li>Our simple, non-corporate work culture that respects individual beliefs.</li> </ol> <p>Read our blog:&nbsp;<a href="https://www.cerebras.net/blog/5-reasons-to-join-cerebras" target="_blank" data-auth="NotApplicable" data-linkindex="0">Five Reasons to Join Cerebras in 2026.</a></p> <h4>Apply today and become part of the forefront of groundbreaking advancements in AI!</h4> <hr> <p><em>Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer.&nbsp;</em><em>We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. </em><em>We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.</em></p> <hr> <p><em>This website or its third-party tools process personal data. For more details, click <a href="https://www.cerebras.net/privacy/" target="_blank">here</a> to review our CCPA disclosure notice.</em></p></div>

753,000+ hidden jobs like this

Cerebras Systems and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.