Back to all jobs
R
Technical Program Manager
radixark
Palo Alto3w ago
About the role
<div data-page-id="X3SxdPrWNohrkPxaMzwjvqYopUe" data-lark-html-role="root" data-docx-has-block-data="false">
<h3 class="heading-3 ace-line old-record-id-R6RTdvC28o2EVoxHXRRjBzz9pdb">About the Role</h3>
<div class="ace-line ace-line old-record-id-QbRkdsoFhorPqFxhMmvjX3p9p3d">As a Technical Program Manager at RadixArk, you'll drive the execution of complex, cross-functional programs across our inference and training infrastructure. You'll partner closely with Product Management, Research, and Engineering to turn ambitious technical roadmaps into shipped reality, coordinating across kernel teams, distributed systems engineers, and external partners to deliver infrastructure that serves billions of tokens daily and coordinates 10,000+ GPU training runs.</div>
<div class="ace-line ace-line old-record-id-NZJbdfd1Rok4X9xDvIGjS7mzpnh">This role is for someone who thrives at the intersection of deep technical understanding and rigorous program execution. You'll own the "how" and "when" of our most critical initiatives.</div>
<h3 class="heading-3 ace-line old-record-id-RWRadq4RUoUJTZxXkzCj1ELwphb">Key Responsibilities</h3>
<h4 class="heading-4 ace-line old-record-id-L2NkdrbOaoTMs8xWjoDjKcLbpYP">Program Execution & Delivery</h4>
<ul class="list-bullet1">
<li class="ace-line ace-line old-record-id-DNL0dkJCooruuYxixnujwclApBg" data-list="bullet">
<div>Drive end-to-end execution of large-scale, cross-functional programs spanning inference engines (e.g., SGLang), training frameworks (e.g., Miles), and hardware integration efforts.</div>
</li>
<li class="ace-line ace-line old-record-id-LgUNdshuEoyLOCx90wsjvqcypKc" data-list="bullet">
<div>Define program structure, including milestones, dependencies, critical paths, risks, and success criteria. Maintain a clear source of truth for status across all stakeholders.</div>
</li>
<li class="ace-line ace-line old-record-id-FSPUdOhnyojvpcxAvK1jVmSRpCg" data-list="bullet">
<div>Run design reviews, sprint planning, release readiness reviews, and post-mortems. Ensure decisions are documented and follow-ups are closed out.</div>
</li>
<li class="ace-line ace-line old-record-id-TkbddicUpoK4N4xoy4vj2GvNp3e" data-list="bullet">
<div>Identify and unblock cross-team dependencies across kernel, runtime, scheduler, networking, and model teams before they become release blockers.</div>
</li>
<li class="ace-line ace-line old-record-id-BXjgd8KFpoeV5BxhHrljxvnCpvc" data-list="bullet">
<div>Drive release management for major versions, including changelog ownership, compatibility validation, partner rollout sequencing, and rollback planning.</div>
</li>
</ul>
<h4 class="heading-4 ace-line old-record-id-Xi1wdhDk6ouWqFxuEDNjCh6fpLd">Technical Coordination</h4>
<ul class="list-bullet1">
<li class="ace-line ace-line old-record-id-LFDAdhQSjomKC8xWL1NjZrP0pac" data-list="bullet">
<div>Partner with Product Management to translate roadmap priorities into executable program plans, with clear scope, staffing, and timelines.</div>
</li>
<li class="ace-line ace-line old-record-id-NbE8dk4RWodRrWxW8Arjkb8wp3c" data-list="bullet">
<div>Work shoulder-to-shoulder with engineering leads on technical trade-off decisions; understand the architecture deeply enough to ask the right questions and surface hidden risks.</div>
</li>
<li class="ace-line ace-line old-record-id-BueCdCMfJoZRS0xsMCHj7W4NpRg" data-list="bullet">
<div>Coordinate hardware enablement programs with partners like Nvidia, Google, and AWS, including new accelerator bring-up, kernel co-development, and benchmark validation.</div>
</li>
<li class="ace-line ace-line old-record-id-EqoAda7Mio3urixoDBmjSzd9pgb" data-list="bullet">
<div>Manage integration programs with frontier AI labs and early adopters, ensuring technical requirements, SLAs, and feedback loops are well-defined.</div>
</li>
</ul>
<h4 class="heading-4 ace-line old-record-id-Yv2Udbb3doA8HnxVgnGjdnK1p0d">Operational Excellence</h4>
<ul class="list-bullet1">
<li class="ace-line ace-line old-record-id-Y6aEdufcpoXuoIxuRK1jb6ULpdb" data-list="bullet">
<div>Build and improve the engineering operating cadence, including standups, planning rituals, OKR tracking, dashboards, and reporting to leadership.</div>
</li>
<li class="ace-line ace-line old-record-id-MT6ad2S4HoMtNdxGOoTjQjpXpcg" data-list="bullet">
<div>Establish metrics and instrumentation for program health such as velocity, defect rates, benchmark regressions, and customer-reported issues, and drive accountability against them.</div>
</li>
<li class="ace-line ace-line old-record-id-ZGzjd9dfuo4TkdxwUsIjdQ7Apng" data-list="bullet">
<div>Lead incident response coordination for production issues affecting partners; own root-cause review and corrective-action tracking.</div>
</li>
<li class="ace-line ace-line old-record-id-LG6odfS2ForDeVx04LmjEvHbpUe" data-list="bullet">
<div>Improve developer productivity by identifying and removing systemic friction in our build, test, and release pipelines.</div>
</li>
</ul>
<h4 class="heading-4 ace-line old-record-id-XQwAdiLqao21VxxQD9KjegGeprh">Stakeholder Communication</h4>
<ul class="list-bullet1">
<li class="ace-line ace-line old-record-id-J4s2dwZHGoxAhrxmPQUjDYmQpoc" data-list="bullet">
<div>Serve as the connective tissue between engineering, product, GTM, and external partners, ensuring everyone has the right information at the right altitude.</div>
</li>
<li class="ace-line ace-line old-record-id-S49ZdMe3Lo2Yn8xicpTjFwMMpob" data-list="bullet">
<div>Produce clear, concise written updates for leadership and partners. Translate engineering progress into business-relevant signals.</div>
</li>
<li class="ace-line ace-line old-record-id-Fa1BdtwYsoq1NOxHKiyj5Q0Ypug" data-list="bullet">
<div>Represent program status honestly, including risks and slips, with concrete mitigation plans.</div>
</li>
</ul>
<h3 class="heading-3 ace-line old-record-id-IZX7dheIXoHGO7xUrRQjeoXNpeb">Qualifications</h3>
<h4 class="heading-4 ace-line old-record-id-GLi8dvo0YorQA2xLam8jjAUspVM">Minimum Requirements</h4>
<ul class="list-bullet1">
<li class="ace-line ace-line old-record-id-HLWMdBZdAohkkAxSToljNaMJpdd" data-list="bullet">
<div>Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.</div>
</li>
<li class="ace-line ace-line old-record-id-QQr8d0beFo75Bax9Nbvj0eUfpSd" data-list="bullet">
<div>4+ years of direct experience in Technical Program Management, Engineering Management, or a senior engineering role with significant program ownership, in a software or infrastructure company.</div>
</li>
<li class="ace-line ace-line old-record-id-YL97djrHSoB9XqxK0MgjKjPwpPe" data-list="bullet">
<div>Strong technical fluency in systems software, distributed systems, or AI/ML infrastructure; able to read code, follow architecture discussions, and challenge technical assumptions productively.</div>
</li>
<li class="ace-line ace-line old-record-id-S4XFdIbTVofBayxFA7XjmLx3pwe" data-list="bullet">
<div>Demonstrated track record shipping complex, multi-team programs on time, including managing dependencies, risks, and scope changes.</div>
</li>
<li class="ace-line ace-line old-record-id-WywndDtelobfIVxr2yDjFuCRpwg" data-list="bullet">
<div>Excellent written and verbal communication skills; able to drive alignment across engineers, executives, and external partners.</div>
</li>
</ul>
<h4 class="heading-4 ace-line old-record-id-B97addbg6oebwVxHLsjjqBIVpKd">Preferred (Bonus) Qualifications</h4>
<ul class="list-bullet1">
<li class="ace-line ace-line old-record-id-SZlCdkXdnohSFEx3HoIjMBB4pvc" data-list="bullet">
<div>Direct experience shipping AI/ML infrastructure such as inference engines, training frameworks, GPU kernels, distributed schedulers, or model serving platforms.</div>
</li>
<li class="ace-line ace-line old-record-id-UvUKdejfvogavKxI1RyjVbdIpBe" data-list="bullet">
<div>Hands-on coding background (Python, C++, CUDA) and comfort working in engineering codebases, including reading PRs, running benchmarks, and reproducing issues.</div>
</li>
<li class="ace-line ace-line old-record-id-Fy4adCyifonSBVxi1vZjLnZepWb" data-list="bullet">
<div>Experience coordinating with hardware vendors (Nvidia, AMD, Google TPU, AWS Trainium/Inferentia) on enablement or co-engineering programs.</div>
</li>
<li class="ace-line ace-line old-record-id-YfAddJ0NzoRjfVxHZqSjDNmIppd" data-list="bullet">
<div>Experience driving open-source release programs or working in OSS communities, including issue triage, RFC processes, and contributor coordination.</div>
</li>
<li class="ace-line ace-line old-record-id-GPKbdBbEEo7GlcxmS8EjY7krpKd" data-list="bullet">
<div>Familiarity with release engineering, CI/CD systems, and observability tooling for large-scale distributed systems.</div>
</li>
<li class="ace-line ace-line old-record-id-JI12dSdJAoc6e5x541JjUH8ople" data-list="bullet">
<div>Experience supporting B2B or developer-facing products with enterprise SLAs.</div>
</li>
</ul>
<div class="framer-1vu7ugc">
<div class="framer-173q4gz" data-framer-name="Job title" data-framer-component-type="RichTextContainer">
<h3 class="framer-text framer-styles-preset-gjn94e" data-styles-preset="KbAUpl97U">About RadixArk</h3>
</div>
<div class="framer-1ndb85j" data-framer-name="About RadixArk" data-framer-component-type="RichTextContainer">
<p class="framer-text framer-styles-preset-1g5o4kb">RadixArk is an infrastructure-first company built by enggineers who've shipped production Al systems,created SGLang (20K+ GitHub stars,the fastest open LLM serving engine),and developed Miles(our large-scale RL framework).<br>We're on a mission to democratize frontier-level Al infrastructure by building world-class open systems for inference and training.<br>Our team has optimized kernels serving billions of tokens daily,designed distributed training systems coordinating 10,000+ GPUs, and contributed to infrastucture that powers leading Al companies and research labs.<br>We're backed by well-known infrastructure investors and partner with NVIDIA, Google,AWS,and frontier Al labs.<br>Join us in building infrastructure that givees real leverage back to the Al community.</p>
</div>
</div>
<div class="framer-1xo2adc">
<div class="framer-1ouiht" data-framer-name="Job title" data-framer-component-type="RichTextContainer">
<h3 class="framer-text framer-styles-preset-gjn94e" data-styles-preset="KbAUpl97U">Compensation</h3>
</div>
<div class="framer-3pynz3" data-framer-name="Compensation" data-framer-component-type="RichTextContainer">
<p class="framer-text framer-styles-preset-1g5o4kb">We offer competitive compensation for this 1-year residency program, with health benefits and potential for conversion to a full-time role. Compensation is determined by location and prior experience. Strong residents may receive offers to join RadixArk full-time with equity after program completion.</p>
</div>
</div>
<div class="framer-uwejmz">
<div class="framer-1oedabd" data-framer-name="Job title" data-framer-component-type="RichTextContainer">
<h3 class="framer-text framer-styles-preset-gjn94e" data-styles-preset="KbAUpl97U">Equal Opportunity</h3>
</div>
<div class="framer-1896wv9" data-framer-name="Equal Opportunity" data-framer-component-type="RichTextContainer">
<p class="framer-text framer-styles-preset-1g5o4kb">RadixArk is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.</p>
</div>
</div>
<p> </p>
</div>
Perks & benefits
- Equity Compensation
731,000+ hidden jobs like this
radixark and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites