Back to all jobs
F
Sequen - Staff Software Engineer – Infrastructure
Fabric
WorldwideRemote6mo ago
- Employment
- Full-time
- Seniority
- Staff
About the role
About Us
About the Role
- Core Infrastructure: The systems team is responsible for supporting clusters used to train, research, and ultimately serve AI models. Your work will be crucial in ensuring Sequen is able to continue to reliably train and serve frontier ranking models.
- Observability: We build and maintain the infrastructure that monitors the health, performance, and efficiency of our AI systems. You'll work across teams to implement monitoring solutions using tools like Prometheus,, and Datadog, while developing automated approaches for dashboards and alerts. Your work will create reliable, low-maintenance systems that enable proactive monitoring and operational excellence.
Responsibilities
- Consult with different stakeholders to deeply understand infrastructure, data and compute needs, identifying potential solutions to support frontier research and product development
- Set technical strategy and oversee development of high scale, reliable infrastructure systems.
- Design processes (e.g. postmortem review, incident response, on-call rotations) that help the team operate effectively and never fail the same way twice
About You
- Have 10+ years of relevant industry experience, 3+ years leading large scale, complex projects or teams as an engineer or tech lead
- Possess deep knowledge of modern cloud infrastructure including Kubernetes, Infrastructure as Code, AWS, and GCP
- Are obsessed with distributed systems at scale, infrastructure reliability, scalability, security, and continuous improvement
- Strong proficiency in at least one programming language (e.g., Python, Go, Java)
- Strong problem-solving skills and ability to work independently
- Have a passion for supporting internal partners like research to understand their needs
- Have excellent communication skills to build consensus with stakeholders, both internally and externally
- Security and privacy best practice expertise
- Hands-on experience with data pipelines and processing large-scale datasets
- Experience with machine learning infrastructure like GPUs,
- Technical expertise: Quickly understanding systems design tradeoffs, keeping track of rapidly evolving software systems
What We Offer
- An senior role with massive impact on infrastructure and engineering velocity
- Ownership of mission-critical systems that power ML and data workflows
- Annual Salary : $250,000—$350,000 USD + Equity
- Health insurance, unlimited time off, awesome team
Perks & benefits
- Medical Insurance
- Unlimited Vacation
- Equity Compensation
764,000+ hidden jobs like this
Fabric and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites