Back to all jobs
W

Director of Product Management, W&B Weave (AI Agents & Evaluation Platform)- W&B

weights_and_biases

San Fransisco2d ago
Seniority
Lead

About the role

<div class="content-intro"><div id="message-list_1758129707.765969" class="c-virtual_list__item" data-qa="virtual-list-item" data-item-key="1758129707.765969"> <div class="c-message_kit__background c-message_kit__background--hovered p-message_pane_message__message c-message_kit__message" data-qa="message_container" data-qa-unprocessed="false" data-qa-placeholder="false"> <div class="c-message_kit__hover c-message_kit__hover--hovered" data-qa-hover="true"> <div class="c-message_kit__actions c-message_kit__actions--default"> <div class="c-message_kit__gutter"> <div class="c-message_kit__gutter__right" data-qa="message_content"> <div class="c-message_kit__blocks c-message_kit__blocks--rich_text"> <div class="c-message__message_blocks c-message__message_blocks--rich_text" data-qa="message-text"> <div class="p-block_kit_renderer" data-qa="block-kit-renderer"> <div class="p-block_kit_renderer__block_wrapper p-block_kit_renderer__block_wrapper--first"> <div class="p-rich_text_block"> <div class="p-rich_text_section">CoreWeave, the AI Hyperscaler™, acquired Weights &amp; Biases to create the most powerful end-to-end platform to develop, deploy, and iterate AI faster. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe, and was ranked as one of the TIME100 most influential companies of 2024. By bringing together CoreWeave’s industry-leading cloud infrastructure with the best-in-class tools AI practitioners know and love from Weights &amp; Biases, we’re setting a new standard for how AI is built, trained, and scaled.</div> <div class="p-rich_text_section"><br>The integration of our teams and technologies is accelerating our shared mission: to empower developers with the tools and infrastructure they need to push the boundaries of what AI can do. From experiment tracking and model optimization to high-performance training clusters, agent building, and inference at scale, we’re combining forces to serve the full AI lifecycle — all in one seamless platform.</div> <div class="p-rich_text_section"><br>Weights &amp; Biases has long been trusted by over 1,500 organizations — including AstraZeneca, Canva, Cohere, OpenAI, Meta, Snowflake, Square,Toyota, and Wayve — to build better models, AI agents and applications. Now, as part of CoreWeave, that impact is amplified across a broader ecosystem of AI innovators, researchers, and enterprises.</div> <div class="p-rich_text_section"><br>As we unite under one vision, we’re looking for bold thinkers and agile builders who are excited to shape the future of AI alongside us. If you're passionate about solving complex problems at the intersection of software, hardware, and AI, there's never been a more exciting time to join our team.</div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div></div><h3><span style="font-size: 12pt;"><strong>What You’ll Do:</strong></span></h3> <p><span style="font-size: 12pt;">As Director of Product Management for Weights &amp; Biases Weave, you will define and scale the platform that AI developers rely on to build, evaluate, and operate AI agents in production.</span></p> <p><span style="font-size: 12pt;">You will own the vision, roadmap, and execution for Weave—focusing on agent tracing, evaluation workflows, and production monitoring—ensuring developers can confidently ship reliable, high-performing AI systems.</span></p> <p><span style="font-size: 12pt;">This role sits at the intersection of LLMs, developer tooling, and production infrastructure, requiring both deep technical fluency and strong product intuition.</span></p> <h3><span style="font-size: 12pt;"><strong>About the role</strong></span></h3> <ul> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Own the Weave product vision and roadmap, focused on enabling developers to build, evaluate, and monitor AI agents end-to-end</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Define how developers trace and debug agent behavior, including multi-step workflows, tool use, and reasoning chains</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Lead the development of evaluation systems (evals) that allow teams to measure agent quality, correctness, and performance over time</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Drive innovation in production monitoring and observability for AI systems, including logging, metrics, feedback loops, and drift detection</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Build workflows that enable rapid iteration—from experimentation to production—closing the loop between evaluation and deployment</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Partner closely with engineering to design systems for high-scale data ingestion, real-time analysis, and developer-facing APIs/SDKs</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Lead cross-functional initiatives across product, engineering, design, GTM, and customer teams to deliver cohesive, developer-first experiences</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Engage directly with customers building cutting-edge AI agents to deeply understand their workflows, pain points, and emerging needs</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Define success metrics and ensure Weave delivers measurable improvements in developer velocity, agent quality, and production reliability</span></li> </ul> <h2><span style="font-size: 12pt;"><strong>Location</strong></span></h2> <p><span style="font-size: 12pt;">This role is based in San Francisco, CA and requires in-office presence at least 3 days per week to support close collaboration with engineering, design, and go-to-market teams.</span></p> <p><span style="font-size: 12pt;"><strong>Who You Are:&nbsp;</strong></span></p> <ul> <li style="font-size: 12pt;"><span style="font-size: 12pt;">7+ years of product management experience, with a strong focus on developer platforms, AI/ML tools, or data/observability systems</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Experience building products for AI developers, particularly around LLMs, agents, or ML workflows</span></li> </ul> <h3><span style="font-size: 12pt;"><strong>AI Agents &amp; Evaluation Expertise</strong></span></h3> <ul> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Deep understanding of LLM-powered applications and agent architectures (tool use, RAG, orchestration frameworks, etc.)</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Experience with or strong intuition for evaluation systems (evals), including benchmarking, human-in-the-loop feedback, or automated scoring</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Familiarity with the challenges of measuring quality in non-deterministic systems</span></li> </ul> <h3><span style="font-size: 12pt;"><strong>Observability &amp; Production Systems</strong></span></h3> <ul> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Experience building or working with observability, logging, monitoring, or debugging tools</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Understanding of production challenges for AI systems, including latency, cost, reliability, drift, and failure modes</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Ability to reason about data pipelines, telemetry, and real-time systems</span></li> </ul> <h3><span style="font-size: 12pt;"><strong>Technical Fluency</strong></span></h3> <ul> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Comfortable working closely with engineers on APIs, SDKs, system design, and data models</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Able to discuss trade-offs in scalability, performance, cost, and developer experience</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Familiarity with modern AI tooling ecosystems (e.g., LangChain, OpenAI APIs, vector DBs, etc.) is a plus</span></li> </ul> <h3><span style="font-size: 12pt;"><strong>Customer &amp; Product Mindset</strong></span></h3> <ul> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Strong empathy for developers building and operating AI systems</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Passion for creating intuitive, high-leverage tools that abstract complexity without limiting flexibility</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Ability to translate ambiguous, emerging workflows into clear product direction</span></li> </ul> <h3><span style="font-size: 12pt;"><strong>Execution &amp; Leadership</strong></span></h3> <ul> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Proven ability to lead complex, cross-functional initiatives from concept to launch</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Comfortable operating in fast-moving, ambiguous environments with evolving technology landscapes</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Strong prioritization skills, balancing innovation with reliability and usability</span></li> </ul> <h2><span style="font-size: 12pt;"><strong>Preferred</strong></span></h2> <ul> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Experience building products for AI agent development, evaluation, or observability</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Background in ML experimentation, model evaluation, or data-centric AI workflows</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Familiarity with W&amp;B, Weave, or similar MLOps / LLMOps tools</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Experience with developer-first products (APIs, SDKs, CLI tools)</span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;">Exposure to data infrastructure, analytics platforms, or event-driven systems</span></li> </ul> <h2><span style="font-size: 12pt;"><strong>Why This Role Matters</strong></span></h2> <ul> <li><span style="font-size: 12pt;">AI is rapidly shifting from static models to dynamic, agent-based systems. These systems are harder to understand, harder to evaluate, and harder to trust in production.</span></li> <li><span style="font-size: 12pt;">Weave is building the platform that makes them observable, measurable, and reliable.</span></li> <li><span style="font-size: 12pt;">You’ll play a key role in defining how the next generation of AI applications is built and operated.</span></li> </ul> <h3><span style="font-size: 12pt;"><strong>Why Us?</strong></span></h3> <p><span style="font-size: 12pt;">We work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:</span></p> <ul> <li style="font-size: 12pt;"><span style="font-size: 12pt;"><strong>Be Curious at Your Core</strong></span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;"><strong>Act Like an Owner</strong></span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;"><strong>Empower Employees</strong></span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;"><strong>Deliver Best-in-Class Client Experiences</strong></span></li> <li style="font-size: 12pt;"><span style="font-size: 12pt;"><strong>Achieve More Together</strong></span></li> </ul> <p><span style="font-size: 12pt;">We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for takeoff, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!<br><br>The base salary range for this role is $206,000 to $303,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).&nbsp;<br></span></p><div class="content-conclusion"><p><strong>What We Offer</strong></p> <p>The range we’ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.</p> <p>In addition to a competitive salary, we offer a variety of benefits to support your needs. The benefits below reflect our US-based offerings; for roles in other locations, benefits vary and are shared during the hiring process. These include:</p> <ul> <li>Medical, dental, and vision insurance - 100% paid for by CoreWeave</li> <li>Company-paid Life Insurance&nbsp;</li> <li>Voluntary supplemental life insurance&nbsp;</li> <li>Short and long-term disability insurance&nbsp;</li> <li>Flexible Spending Account</li> <li>Health Savings Account</li> <li>Tuition Reimbursement&nbsp;</li> <li>Ability to Participate in Employee Stock Purchase Program (ESPP)</li> <li>Mental Wellness Benefits through Spring Health&nbsp;</li> <li>Family-Forming support provided by Carrot</li> <li>Paid Parental Leave&nbsp;</li> <li>Flexible, full-service childcare support with Kinside</li> <li>401(k) with a generous employer match</li> <li>Flexible PTO</li> <li>Catered lunch each day in our office and data center locations</li> <li>A casual work environment</li> <li>A work culture focused on innovative disruption</li> </ul> <p><strong>California Applicants</strong></p> <p><a href="https://drive.google.com/file/d/1gPBRBhUNAMBmj7Yn4_M-hCugm7ZJD4hr/view?usp=sharing">California Consumer Privacy Act&nbsp;</a></p> <p><strong>Equal Opportunity &amp; Accommodations</strong></p> <p><em>CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.</em></p> <p><em>As part of this commitment and consistent with the </em><a href="https://www.eeoc.gov/laws/guidance/fact-sheet-disability-discrimination"><em>Americans with Disabilities Act (ADA)</em></a><em>, CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: </em><a href="mailto:careers@coreweave.com"><em>careers@coreweave.com</em></a><em>.&nbsp;</em></p> <p><strong>Export Control Compliance</strong></p> <p>This position requires access to export controlled information.&nbsp; To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency.&nbsp; CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.</p></div>

Perks & benefits

  • 401k
  • Vision Insurance
  • Unlimited Vacation
  • Paid Time Off
  • Pension Matching
  • Mental Wellness Budget
  • Equity Compensation

758,000+ hidden jobs like this

weights_and_biases and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.