Sequen - Staff Software Engineer – Infrastructure

Fabric

WorldwideRemote6mo ago

Apply

Employment: Full-time
Seniority: Staff

About the role

About Us

About the Role

Core Infrastructure: The systems team is responsible for supporting clusters used to train, research, and ultimately serve AI models. Your work will be crucial in ensuring Sequen is able to continue to reliably train and serve frontier ranking models.
Observability: We build and maintain the infrastructure that monitors the health, performance, and efficiency of our AI systems. You'll work across teams to implement monitoring solutions using tools like Prometheus,, and Datadog, while developing automated approaches for dashboards and alerts. Your work will create reliable, low-maintenance systems that enable proactive monitoring and operational excellence.

Responsibilities

Consult with different stakeholders to deeply understand infrastructure, data and compute needs, identifying potential solutions to support frontier research and product development
Set technical strategy and oversee development of high scale, reliable infrastructure systems.
Design processes (e.g. postmortem review, incident response, on-call rotations) that help the team operate effectively and never fail the same way twice

About You

Have 10+ years of relevant industry experience, 3+ years leading large scale, complex projects or teams as an engineer or tech lead
Possess deep knowledge of modern cloud infrastructure including Kubernetes, Infrastructure as Code, AWS, and GCP
Are obsessed with distributed systems at scale, infrastructure reliability, scalability, security, and continuous improvement
Strong proficiency in at least one programming language (e.g., Python, Go, Java)
Strong problem-solving skills and ability to work independently
Have a passion for supporting internal partners like research to understand their needs
Have excellent communication skills to build consensus with stakeholders, both internally and externally

Security and privacy best practice expertise
Hands-on experience with data pipelines and processing large-scale datasets
Experience with machine learning infrastructure like GPUs,
Technical expertise: Quickly understanding systems design tradeoffs, keeping track of rapidly evolving software systems

What We Offer

An senior role with massive impact on infrastructure and engineering velocity
Ownership of mission-critical systems that power ML and data workflows
Annual Salary : $250,000—$350,000 USD + Equity
Health insurance, unlimited time off, awesome team

Perks & benefits

Medical Insurance
Unlimited Vacation
Equity Compensation

764,000+ hidden jobs like this

Fabric and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

Unlimited applications — free stops at 5
Track every application in one place
Apply straight to the source, one click
Save & organize roles you love
Roles pulled from company boards before the big sites

Weekly

$9.99

$4.99/week

For an active search. Cancel anytime.

Get Weekly

Monthly

$24.99

$12.99/month

The smart pick. Save 35% vs weekly.

Get Monthly

Lifetime

$99

$49.99once

Pay once. Every future feature, forever.

Get Lifetime