Back to all jobs
HP IQ logo

Senior Machine Learning Engineer – Fine-Tuning and On-device AI

HP IQ
Palo Alto7mo ago
Seniority
Senior

About the role

<div class="content-intro"><p><strong>Who We Are</strong></p> <p>HP IQ is HP’s new AI innovation lab. Combining startup agility with HP’s global scale, we’re building intelligent technologies that redefine how the world works, creates, and collaborates.</p> <p>We’re assembling a diverse, world-class team—engineers, designers, researchers, and product minds—focused on creating an intelligent ecosystem across HP’s portfolio. Together, we’re developing intuitive, adaptive solutions that spark creativity, boost productivity, and make collaboration seamless.</p> <p>We create breakthrough solutions that make complex tasks feel effortless, teamwork more natural, and ideas more impactful—always with a human-centric mindset.</p> <p>By embedding AI advancements into every HP product and service, we’re expanding what’s possible for individuals, organisations, and the future of work.</p> <p>Join us as we reinvent work, so people everywhere can do their best work.</p></div><p><strong><span data-contrast="none"><span data-ccp-parastyle="heading 3">About the Role</span></span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;134245418&quot;:true,&quot;134245529&quot;:true,&quot;335559738&quot;:281,&quot;335559739&quot;:281}">&nbsp;</span></strong></p> <p><span data-contrast="auto">We are seeking a&nbsp;</span><span data-contrast="auto">Senior Machine Learning Engineer</span><span data-contrast="auto">&nbsp;to lead the fine-tuning, optimization, and deployment of AI models for diverse tasks, with a strong emphasis on&nbsp;</span><span data-contrast="auto">on-device inference</span><span data-contrast="auto">. You will work on&nbsp;cutting-edge&nbsp;applications such as&nbsp;</span><span data-contrast="auto">orchestration, planning, multi-agent coordination</span><span data-contrast="auto">, and other intelligent decision-making systems.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></p> <p><span data-contrast="auto">You will&nbsp;be responsible for&nbsp;adapting foundation models (LLMs, multimodal models) to specialized domains, making them&nbsp;</span><span data-contrast="auto">fast,&nbsp;accurate, and efficient</span><span data-contrast="auto">&nbsp;for resource-constrained environments—while ensuring robustness and safety.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></p> <p><strong>What You Might Do</strong></p> <ul> <li><span data-contrast="auto">Model Fine-Tuning &amp; Adaptation</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Fine-tune large language models, multimodal models, and task-specific models for orchestration, planning, and&nbsp;any other&nbsp;workflows&nbsp;as defined.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Design and run experiments to improve task accuracy, robustness, and generalization.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Explore and apply methods like&nbsp;full fine-tuning,&nbsp;LoRA,&nbsp;QLoRA&nbsp;and other&nbsp;types of&nbsp;parameter-efficient fine-tuning.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Employee&nbsp;advanced techniques such as QAT, DPO, GRPO to further improve the model quality.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">On-Device Optimization</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Prune, quantize and compress models (e.g., INT8, INT4,&nbsp;mixed-precision) for CPU, GPU,&nbsp;NPU&nbsp;and edge accelerators.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Optimize&nbsp;models for&nbsp;low-latency&nbsp;inference using frameworks like&nbsp;OpenVINO, ONNX Runtime, QNN&nbsp;etc..</span></li> <li><span data-contrast="auto">Data&nbsp;Pipeline &amp; Deployment</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Build robust data pipelines for domain-specific datasets, including synthetic data generation and annotation.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Define evaluation metrics. Perform&nbsp;evaluations&nbsp;and analyze results.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Establish best practices for versioning, reproducibility, and continuous improvement of model performance.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">AI Orchestration &amp; Planning</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Develop and refine models to support multi-step reasoning, tool orchestration, and decision planning.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Work with stakeholders on orchestrator architecture.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Collaborate with product and research teams to design intelligent, context-aware assistant capabilities.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> </ul> <p><strong>Essential Qualifications</strong></p> <ul> <li><span data-contrast="auto">7+ years of experience in applied machine learning, including at least&nbsp;3&nbsp;years in&nbsp;LLM&nbsp;fine-tuning.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Proficiency&nbsp;in Python and ML frameworks&nbsp;ecosystem&nbsp;(HuggingFace,&nbsp;PyTorch).</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Strong understanding of transformer architectures, attention mechanisms, and PEFT techniques.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Experience with on-device inference optimization (OpenVINO, ONNX, QNN).</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Familiarity with orchestration/planning&nbsp;architectures and&nbsp;techniques for AI assistants.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;201341983&quot;:0,&quot;335551550&quot;:1,&quot;335551620&quot;:1,&quot;335559685&quot;:720,&quot;335559737&quot;:0,&quot;335559738&quot;:240,&quot;335559739&quot;:240,&quot;335559740&quot;:279,&quot;335559991&quot;:360}">&nbsp;</span></li> <li><span data-contrast="auto">Track record&nbsp;of delivering production-ready ML solutions in latency-sensitive environments.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> </ul> <p><strong><span data-contrast="auto">Preferred Qualifications</span></strong></p> <ul> <li><span data-contrast="auto">Experience with multi-agent systems or AI assistant orchestration.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Familiarity with advanced inference optimization techniques such as&nbsp;KV cache&nbsp;paging&nbsp;,&nbsp;flash attention.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> <li><span data-contrast="auto">Knowledge about common inference engines, including but not limited to llama.cpp,&nbsp;vLLM.</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> </ul> <p><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;335551550&quot;:1,&quot;335551620&quot;:1,&quot;335559738&quot;:220,&quot;335559739&quot;:220}">Salary Range:&nbsp; $120,000 - $215,000</span></p><div class="content-conclusion"><p><strong>Compensation &amp; Benefits (Full-Time Employees)<br></strong></p> <p>The salary range for this role is listed above. Final salary offered is based upon multiple factors including individual job-related qualifications, education, experience, knowledge and skills.</p> <p>At HP IQ, we offer a competitive and comprehensive benefits package, including:</p> <ul> <li>Health insurance</li> <li>Dental insurance</li> <li>Vision insurance</li> <li>Long term/short term disability insurance</li> <li>Employee assistance program</li> <li>Flexible spending account</li> <li>Life insurance</li> <li>Generous time off policies, including;&nbsp; <ul> <li>4-12 weeks fully paid parental leave based on tenure</li> <li>11 paid holidays</li> <li>Additional flexible paid vacation and sick leave (<a href="https://www8.hp.com/h20195/v2/GetDocument.aspx?docname=c07065756">US benefits overview</a>)</li> </ul> </li> </ul> <p><strong>Why HP IQ?</strong></p> <p>HP IQ is HP’s new AI innovation lab, building the intelligence to empower humanity—reimagining how we work, create, and connect to shape the future of work.</p> <ul> <li><strong>Innovative Work<br></strong>Help shape the future of intelligent computing and workplace transformation.</li> <li><strong>Autonomy and Agility</strong><strong><br></strong> Work with the speed and focus of a startup, backed by HP’s scale.</li> <li><strong>Meaningful Impact</strong><strong><br></strong> Build AI-powered solutions that help people and organisations thrive.</li> <li><strong>Flexible Work Environment</strong><strong><br></strong> Freedom and flexibility to do your best work.</li> <li><strong>Forward-Thinking Culture<br></strong>We learn fast, stay future-focused, and imagine what comes next—together.</li> </ul> <p><strong>Equal Opportunity Employer (EEO) Statement</strong></p> <p>HP, Inc. provides equal employment opportunity to all employees and prospective employees, without regard to race, color, religion, sex, national origin, ancestry, citizenship, sexual orientation, age, disability, or status as a protected veteran, marital status, familial status, physical or mental disability, medical condition, pregnancy, genetic predisposition or carrier status, uniformed service status, political affiliation or any other characteristic protected by applicable national, federal, state, and local law(s).</p> <p>Please be assured that you will not be subject to any adverse treatment if you choose to disclose the information requested. This information is provided voluntarily. The information obtained will be kept in strict confidence.</p> <p>If you’d like more information about HP’s <a href="https://www8.hp.com/h20195/v2/GetDocument.aspx?docname=c08129225">EEO Policy</a> or your EEO rights as an applicant under the law, please click here: <a href="http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf">Equal Employment Opportunity is the Law</a> <a href="https://www.dol.gov/ofccp/regs/compliance/posters/pdf/OFCCP_EEO_Supplement_Final_JRF_QA_508c.pdf">Equal Employment Opportunity is the Law – Supplement</a></p> <div id="application-start"></div></div>

Perks & benefits

  • Vision Insurance
  • Dental Insurance
  • Medical Insurance
  • Paid Time Off

755,000+ hidden jobs like this

HP IQ and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.