Back to all jobs
N

Senior Cloud Native Platform Engineer

nscaleoperationsukltd

US2d ago
Seniority
Senior

About the role

<h2>About Nscale</h2> <p>Nscale is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-ups and large enterprise customers. Nscale enables AI-focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility.</p> <p>We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future.</p> <h2>About the Role</h2> <p>We’re hiring a <strong><strong class="textBold">Senior Cloud Native Platform Engineer</strong></strong>&nbsp;to build, operate, and improve the cloud-native platform foundations that support AI applications and services at scale.</p> <p>In this hands-on platform engineering role, you’ll work on shared <strong><strong class="textBold">Kubernetes-based platforms</strong></strong>, deployment patterns, observability foundations, infrastructure automation, and operational tooling that help internal teams run services safely and efficiently on <strong><strong class="textBold">GPU-backed infrastructure</strong></strong>. You’ll partner closely with software engineering, infrastructure, and SRE teams to ensure platform capabilities meet real developer and operational needs.</p> <p>This role is important to the reliability, scalability, and usability of Nscale’s platform. You’ll take ownership of significant platform components, deliver complex technical work independently, and raise the quality of operations and engineering through practical improvements, sound technical judgement, and mentoring.</p> <h2>What you'll be doing</h2> <h2>Platform Operations &amp; Engineering</h2> <ul> <li value="1"><strong><strong class="textBold">Build</strong></strong> and improve shared cloud-native platform capabilities used by internal engineering teams to run AI applications and services.</li> <li value="2"><strong><strong class="textBold">Own</strong></strong> significant parts of the platform area, including Kubernetes cluster operations, workload runtime configuration, deployment workflows, observability foundations, or environment automation.</li> <li value="3"><strong><strong class="textBold">Improve</strong></strong> the reliability, scalability, and supportability of platform services through practical engineering and operational enhancements.</li> <li value="4"><strong><strong class="textBold">Develop</strong></strong> automation, tooling, and configuration that reduce manual effort, improve consistency, and make the platform easier to use and operate.</li> <li value="5"><strong><strong class="textBold">Apply</strong></strong> software engineering where it creates leverage, including scripts, services, CI/CD automation, operational tooling, and platform integrations.</li> </ul> <h2>Reliability, Operability &amp; Automation</h2> <ul> <li value="1"><strong><strong class="textBold">Improve</strong></strong> incident prevention, detection, response, and recovery across the platform areas you support.</li> <li value="2"><strong><strong class="textBold">Build</strong></strong> and refine observability for platform services, including metrics, logs, tracing, dashboards, alerts, and other useful operational signals.</li> <li value="3"><strong><strong class="textBold">Strengthen</strong></strong> rollout safety, capacity awareness, failure handling, and recovery procedures for production environments.</li> <li value="4"><strong><strong class="textBold">Debug</strong></strong> and resolve complex issues spanning Kubernetes, Linux, networking, storage, workload runtime behaviour, and cloud or datacentre infrastructure dependencies.</li> <li value="5"><strong><strong class="textBold">Enhance</strong></strong> operational playbooks, runbooks, and engineering practices to reduce toil and increase service resilience.</li> </ul> <h2>Team Technical Contribution</h2> <ul> <li value="1"><strong><strong class="textBold">Contribute</strong></strong> to design discussions, code reviews, and operational standards within the platform engineering team.</li> <li value="2"><strong><strong class="textBold">Collaborate</strong></strong> with software engineering, infrastructure, and SRE teams to deliver platform capabilities that are practical, supportable, and aligned to operational needs.</li> <li value="3"><strong><strong class="textBold">Define</strong></strong> sensible defaults, paved roads, and supportable patterns for service deployment and runtime operations.</li> <li value="4"><strong><strong class="textBold">Mentor</strong></strong> less experienced engineers in platform engineering fundamentals, operational judgement, and good automation practices.</li> </ul> <h2>KPIs</h2> <ul> <li value="1"><strong><strong class="textBold">Platform reliability and service resilience</strong></strong></li> <li value="2"><strong><strong class="textBold">Reduction in manual operational toil</strong></strong></li> <li value="3"><strong><strong class="textBold">Incident detection, response, and recovery effectiveness</strong></strong></li> <li value="4"><strong><strong class="textBold">Observability and operational readiness of platform services</strong></strong></li> </ul> <h2>About You</h2> <ul> <li value="1"><strong><strong class="textBold">Strong hands-on experience</strong></strong> operating and improving Kubernetes-based platforms in production.</li> <li value="2"><strong><strong class="textBold">Solid experience</strong></strong> with infrastructure automation, CI/CD, configuration management, or GitOps-style workflows.</li> <li value="3"><strong><strong class="textBold">Strong understanding</strong></strong> of reliability engineering principles, including observability, incident response, failure analysis, and operational readiness.</li> <li value="4"><strong><strong class="textBold">Experience writing</strong></strong> production-quality automation, tooling, or backend code in Go, Python, Bash, or similar languages.</li> <li value="5"><strong><strong class="textBold">Good Linux fundamentals</strong></strong>, including processes, filesystems, cgroups, service behaviour, and system debugging.</li> <li value="6"><strong><strong class="textBold">Good networking fundamentals</strong></strong>, including TCP/IP, DNS, routing, load balancing, and container or overlay networking concepts.</li> <li value="7"><strong><strong class="textBold">Experience debugging</strong></strong> complex production issues across multiple system layers.</li> <li value="8"><strong><strong class="textBold">Ability to work independently</strong></strong> on substantial technical problems while collaborating effectively with adjacent teams.</li> <li value="9"><strong><strong class="textBold">Experience mentoring</strong></strong> or supporting less experienced engineers through practical technical guidance.</li> </ul> <h2>What we can offer you</h2> <p>At Nscale, you'll find a collaborative, supportive, and innovative environment where your contributions spark real impact. We're building something extraordinary, and we want you at the core.</p> <ul> <li>Highly competitive US compensation package (base + bonus + equity), with performance reviews every 12 months. 🚀</li> <li>Join one of the fastest-growing AI infrastructure companies — your chance to directly shape how global AI capacity is planned and deployed. ✨</li> <li>Expect a dynamic progression plan tailored to your ambitions. Grow by leading critical cross-functional initiatives and shaping capital strategy — always with our full support.</li> <li>Human-First Flexibility: We treat you as humans first. 🫶🏽 Our flexible workplace trusts Nscalers to deliver, giving you the autonomy to shape your day around life's moments.</li> </ul> <h2>Equal Opportunities Statement</h2> <p>We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio-economic backgrounds.</p> <p>If there’s anything we can do to accommodate your specific situation, please let us know.</p> <p>The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to perform additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role.</p> <p>For information on how Nscale handles candidate personal data, please see our Employee &amp; Candidate Privacy Notice: Here.</p> <h2>Salary Range</h2> <p>The range below reflects the base salary for the position. Actual compensation may vary based on job-related factors such as skill set, experience, education, and location. In addition to base salary, this role may be eligible for bonus, equity, and/or commission programs. Nscale may offer a competitive benefits package including medical, dental, vision, flexible paid time off, parental leave, and retirement plan part</p><div class="content-pay-transparency"><div class="pay-input"><div class="description"><p>The range below reflects the base salary for the position. Actual compensation may vary based on job-related factors such as skill set, experience, education, and location. In addition to base salary, this role may be eligible for bonus, equity, and/or commission programs. Nscale may offer a competitive benefits package including medical, dental, vision, flexible paid time off, parental leave, and retirement plan participation.</p></div><div class="title">Salary Range</div><div class="pay-range"><span>$200,000</span><span class="divider">&mdash;</span><span>$225,000 USD</span></div></div></div><div class="content-conclusion"><p><em>For information on how Nscale handles candidate personal data, please see our Employee &amp; Candidate Privacy Notice:&nbsp;<a href="https://drive.google.com/file/d/1QK5Yg04WHD9K9IAtJgQWubJZC9oLvatK/view?usp=sharing" target="_blank" data-saferedirecturl="https://www.google.com/url?q=https://drive.google.com/file/d/1QK5Yg04WHD9K9IAtJgQWubJZC9oLvatK/view?usp%3Dsharing&amp;source=gmail&amp;ust=1765375172804000&amp;usg=AOvVaw2Ncte4rmlGl8OKuFuDgDtx">Here.</a></em></p></div>

Perks & benefits

  • Paid Time Off
  • Equity Compensation

731,000+ hidden jobs like this

nscaleoperationsukltd and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.