Back to all jobs
C

Staff Software Engineer - AI Traffic & Inference Infrastructure

coupanginternal
Bengaluru3d ago
Seniority
Staff

About the role

<p><strong><span data-contrast="auto"> </span></strong> <strong><span data-contrast="auto">Please complete the attached<span class="Apple-converted-space">&nbsp;</span></span></strong><a href="https://coupang.service-now.com/sp?id=kb_article&amp;sysparm_article=KB0010204"><strong><span data-contrast="none"><span data-ccp-charstyle="Hyperlink">Internal Transfer Request Form</span></span></strong></a><strong><span data-contrast="auto"> and submit.  </span></strong><span data-ccp-props="{&quot;134233279&quot;:true,&quot;134245417&quot;:false}">&nbsp;</span></p> <p><strong><span data-contrast="auto">Please make sure to<span class="Apple-converted-space">&nbsp;</span></span></strong><span style="text-decoration: underline;"><strong><span data-contrast="auto">apply with your Coupang e-mail address</span></strong></span><strong><span data-contrast="auto">.  </span></strong><span data-ccp-props="{&quot;134233279&quot;:true,&quot;134245417&quot;:false}">&nbsp;</span></p> <hr> <p><strong><span data-contrast="auto"> </span></strong><strong><span data-contrast="auto">Company Introduction</span></strong><span data-contrast="auto"> </span><span data-ccp-props="{&quot;134233279&quot;:true,&quot;134245417&quot;:false}">&nbsp;</span></p> <p><span data-contrast="auto">We exist to wow our customers. We know we’re doing the right thing when we hear our customers say, “How did we ever live without Coupang?” Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We are one of the fastest-growing e-commerce companies that established an unparalleled reputation for being a dominant and reliable force in South Korean commerce. </span><span class="Apple-converted-space">&nbsp;</span><span data-ccp-props="{&quot;134233279&quot;:true,&quot;134245417&quot;:false}">&nbsp;</span></p> <p><span data-contrast="none">We are proud to have the best of both worlds — a startup culture with the resources of a large global public company. This fuels us to continue our growth and launch new services at the speed we have been since our inception. We are all entrepreneurs surrounded by opportunities to drive new initiatives and innovations. At our core, we are bold and ambitious people that like to get our hands dirty and make a hands-on impact. At Coupang, you will see yourself, your colleagues, your team, and the company grow every day.</span><span class="Apple-converted-space">&nbsp;</span><span data-ccp-props="{&quot;134233279&quot;:true,&quot;134245417&quot;:false}">&nbsp;</span></p> <p><span data-contrast="auto">Our mission to build the future of commerce is real. We push the boundaries of what’s possible to solve problems and break traditional tradeoffs. Join Coupang now to create an epic experience in this always-on, high-tech, and hyper-connected world. </span><span data-ccp-props="{&quot;134233279&quot;:true,&quot;134245417&quot;:false}"> <br></span><span data-contrast="auto"> </span><span data-ccp-props="{&quot;134233279&quot;:true,&quot;134245417&quot;:false}">&nbsp;</span></p> <p><strong><span data-contrast="auto">Role Overview</span></strong></p> <p>As a Staff Engineer on our Coupang intelligent Cloud Infrastructure team, you will design and scale the intelligent nervous system of our CIC Cloud AI platform. You won't just be moving packets; you’ll be building the orchestration and routing layers that ensure our LLMs and foundation models are highly available, low-latency, and cost-efficient. You will own the end-to-end lifecycle of traffic management from global load balancing to hardware-aware request routing across thousands of accelerators.</p> <p><strong><span data-contrast="auto">What You Will Do</span></strong></p> <ul data-pm-slice="3 3 []"> <li> <p><u>Intelligent Routing</u><strong>:</strong>&nbsp;Design and implement sophisticated load-balancing algorithms tailored for AI workloads&nbsp;(training, inference), optimizing request distribution based on model availability, and accelerator health.&nbsp;</p> </li> </ul> <ul> <li> <p><u>Inference Orchestration</u><strong>:</strong>&nbsp;Architect and evolve our inference infrastructure to support seamless model deployment,&nbsp;auto-scaling, and multi-AZ&nbsp;failover.&nbsp;</p> </li> </ul> <ul> <li> <p><u>Performance Engineering</u><strong>:</strong>&nbsp;Drive initiatives to minimize tail latency (P95 /P99) and maximize throughput using advanced batching, caching, and streaming token delivery techniques.&nbsp;</p> </li> </ul> <ul> <li> <p><u>Fleet Automation</u><strong>:</strong>&nbsp;Build robust infrastructure-as-code and CI/CD pipelines to manage dynamic compute fleets, ensuring they automatically scale to meet production and research demands.&nbsp;</p> </li> </ul> <ul> <li> <p><u>Observability &amp; Optimization</u><strong>:</strong>&nbsp;Leverage deep telemetry data to tune system performance and hardware-agnostic scheduling across diverse GPU/TPU environments.&nbsp;</p> </li> </ul> <ul> <li> <p><u>Technical Leadership</u><strong>:</strong> Lead cross-functional initiatives across infrastructure and SW&nbsp;team, ML teams, providing mentorship and&nbsp;setting up&nbsp;the long-term technical roadmap for traffic management.&nbsp;</p> </li> </ul> <p>&nbsp;</p> <p><strong><span data-contrast="auto">Basic </span></strong><strong><span data-contrast="auto">Qualifications</span></strong></p> <ul> <li><strong>Education:</strong>&nbsp;Bachelor’s or&nbsp;Master’s degree in Computer Science, Engineering, or&nbsp;a related&nbsp;technical field.&nbsp;</li> </ul> <ul data-pm-slice="3 3 []"> <li> <p><strong>Experience:</strong>&nbsp;8–12 years of progressive software engineering experience, with a heavy emphasis on distributed systems, cloud-native architectures, or platform operations.&nbsp;</p> </li> </ul> <ul> <li> <p><strong>Programming:</strong>&nbsp;Strong&nbsp;proficiency&nbsp;in&nbsp;<strong>Go</strong>&nbsp;or&nbsp;<strong>Python</strong>, with a deep understanding of networked systems and performance optimization.&nbsp;</p> </li> </ul> <ul> <li> <p><strong>Orchestration:</strong>&nbsp;Expert-level knowledge of&nbsp;<strong>Kubernetes</strong>&nbsp;internals (scheduling, controllers) and containerization ecosystems.&nbsp;</p> </li> </ul> <ul> <li> <p><strong>Traffic Management:</strong>&nbsp;Proven experience with load balancing, service mesh, and request routing at scale.&nbsp;</p> </li> <li> <p><strong>Operational Excellence:</strong>&nbsp;A strong "ownership" mindset with&nbsp;a track record&nbsp;of&nbsp;maintaining&nbsp;mission-critical, high-availability systems in production.&nbsp;</p> </li> </ul> <p>&nbsp;</p> <p><strong><span data-contrast="auto">Preferred Qualifications</span></strong></p> <ul data-pm-slice="3 3 []"> <li> <p><strong>AI/ML Domain Knowledge:</strong>&nbsp;Prior experience building infrastructure specifically for LLM inference or large-scale training clusters.&nbsp;</p> </li> </ul> <ul> <li> <p><strong>Low-Level Optimization:</strong>&nbsp;Familiarity&nbsp;with&nbsp;inference, including mixed precision,&nbsp;kernel tuning, or custom hardware accelerators.&nbsp;</p> </li> </ul> <ul> <li> <p><strong>Public/Private Cloud:</strong>&nbsp;Experience managing hybrid-cloud or multi-AZ&nbsp;deployments across AWS, Azure, or GCP.&nbsp;</p> </li> </ul> <ul> <li> <p><strong>Compliance:</strong>&nbsp;Experience&nbsp;operating&nbsp;in regulated environments with strict security and compliance requirements.&nbsp;</p> </li> </ul> <p><span data-ccp-props="{&quot;134245417&quot;:true,&quot;201341983&quot;:0,&quot;335559740&quot;:259}">&nbsp;<br></span></p> <p><strong><span data-contrast="auto">Type of work model</span></strong><span data-ccp-props="{&quot;134245417&quot;:false}">&nbsp;</span></p> <ul> <li data-leveltext="" data-font="Symbol" data-listid="38" data-list-defn-props="{&quot;335552541&quot;:1,&quot;335559685&quot;:880,&quot;335559991&quot;:440,&quot;469769226&quot;:&quot;Symbol&quot;,&quot;469769242&quot;:[8226],&quot;469777803&quot;:&quot;left&quot;,&quot;469777804&quot;:&quot;&quot;,&quot;469777815&quot;:&quot;hybridMultilevel&quot;}" data-aria-posinset="1" data-aria-level="1"><span data-contrast="auto">Hybrid</span></li> </ul> <p><strong><span data-contrast="auto">Details to consider</span></strong><span data-ccp-props="{&quot;134245417&quot;:false}">&nbsp;</span></p> <ul> <li data-leveltext="" data-font="Symbol" data-listid="38" data-list-defn-props="{&quot;335552541&quot;:1,&quot;335559685&quot;:880,&quot;335559991&quot;:440,&quot;469769226&quot;:&quot;Symbol&quot;,&quot;469769242&quot;:[8226],&quot;469777803&quot;:&quot;left&quot;,&quot;469777804&quot;:&quot;&quot;,&quot;469777815&quot;:&quot;hybridMultilevel&quot;}" data-aria-posinset="2" data-aria-level="1"><span data-contrast="auto">Those eligible for employment protection (recipients of veteran’s benefits, the disabled, etc.) may receive preferential treatment for employment in accordance with applicable laws.&nbsp;<br></span><span data-ccp-props="{&quot;134245417&quot;:false}">&nbsp;</span></li> </ul> <p><strong><span data-contrast="none">Privacy Notice</span></strong><strong><span data-contrast="none">&nbsp;</span></strong><span data-ccp-props="{&quot;134233279&quot;:true,&quot;134245417&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></p> <ul> <li data-leveltext="" data-font="Symbol" data-listid="35" data-list-defn-props="{&quot;335552541&quot;:1,&quot;335559685&quot;:720,&quot;335559991&quot;:360,&quot;469769226&quot;:&quot;Symbol&quot;,&quot;469769242&quot;:[8226],&quot;469777803&quot;:&quot;left&quot;,&quot;469777804&quot;:&quot;&quot;,&quot;469777815&quot;:&quot;multilevel&quot;}" data-aria-posinset="1" data-aria-level="1"><span data-contrast="none">Your personal information will be collected and managed by Coupang as stated in the Application Privacy Notice located below. </span><a href="https://privacy.coupang.com/en/land/jobs/"><span data-contrast="none"><span data-ccp-charstyle="Hyperlink">https://privacy.coupang.com/en/land/jobs/</span></span></a><span data-ccp-props="{&quot;134233279&quot;:true,&quot;134245417&quot;:false,&quot;335559738&quot;:240,&quot;335559739&quot;:240}">&nbsp;</span></li> </ul>

741,000+ hidden jobs like this

coupanginternal and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.