(Sr.) SRE Engineer (Backend Focused)
17live
- Employment
- Full-time
- Seniority
- Senior
About the role
Job Overview
As a Backend Engineer at 17LIVE, your primary responsibility is to develop and deliver high-quality, scalable backend features. Simultaneously, you will be deeply involved in Site Reliability Engineering (SRE) initiatives, maintaining system stability, optimizing infrastructure, and managing technical incidents. We are looking for a partner who can write exceptional code while applying an SRE mindset to ensure our systems remain rock-solid under massive global traffic.
Responsibilities
- System Development & Architecture: Design large-scale, fault-tolerant cloud services and backend APIs.
- Performance Optimization: Analyze and improve the efficiency, scalability, and stability of various subsystems.
- Rapid Iteration: Frequently deploy new features to support rapid business growth and system expansion.
- Reliability Practices: Collaborate with the SRE team to integrate monitoring, alerting, and automation workflows into the development lifecycle.
Requirements
- Development Experience: 1+ years (3+ years for Senior) of software development experience with strong analytical and coding skills.
- Core Technology: Proficiency in Go or other backend languages; solid understanding of algorithms, system architecture, databases, and distributed systems.
- Systems Thinking: Ability to operate and troubleshoot effectively within UNIX/Linux environments.
- Personal Attributes: Open-minded with a creative approach to problem-solving.
(Senior Position): Ability to work independently with minimal guidance and demonstrate strong system design capabilities.
Nice to Have (Backend Skills)
- Advanced knowledge of algorithms, cloud computing, and networking.
- Experience with file systems, concurrency, multithreading, and server architectures.
- Background in large-scale system design and distributed systems.
Nice to Have (SRE Skills)
- Containerization: Foundational knowledge of Docker and Kubernetes (K8s).
- CI/CD Experience: Experience writing or maintaining pipelines (e.g., CircleCI, Jenkins, ArgoCD, Helm).
- Observability: Experience maintaining or using monitoring systems (e.g., Prometheus, Grafana, ELK Stack).
- Infrastructure as Code (IaC): Familiarity with Terraform for managing cloud resources.
- Reliability Design: Knowledge of High Availability (HA) and Disaster Recovery (DR) architectures.
- Automation: Basic proficiency in Shell Scripting.
What We Look for in You
- Agility: Ability to quickly grasp problems, locate the Root Cause, and take decisive action.
- Prudence: A habit of performing small-scale testing (Canary/Staging) and validation before deployment.
- Curiosity: A passion for learning and staying updated on the latest trends in both Backend and SRE fields.
- Collaboration: Strong communication skills and a team-player attitude to achieve goals with cross-functional teams.
741,000+ hidden jobs like this
17live and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites