Lead DevOps Engineer
cloudsmiths
- Employment
- Permanent Employee
- Seniority
- Lead
About the role
CloudSmiths is looking for a proactive and deeply technical, Lead DevOps Engineer (GCP) to join our platform engineering practice.
In this role, you won't just maintain environments—you will architect, scale, and lead the charge in ensuring the reliability, performance, and security of production ecosystems for our diverse range of enterprise clients. This includes designing and deploying secure, scalable GCP Landing Zones that serve as the foundational blueprint for our clients' cloud journeys. You will bridge the gap between development and operations by championing advanced automation, robust monitoring, and cutting-edge DevOps strategies specifically within the Google Cloud Platform.
Key Responsibilities:
- Act as the ultimate technical authority for DevOps and SRE practices on GCP, architecting high-availability systems to ensure maximum uptime and performance across complex multi-tenant environments.
- Standardize and champion enterprise-grade IaC principles across the organization using tools like Terraform, Ansible, or Deployment Manager to ensure repeatable, secure deployments.
- Drive Next-Gen monitoring and observability initiatives using tools such as Grafana, Prometheus, and Google Cloud Observability (Stack driver) to ensure deep, proactive visibility into system health.
- Design, optimise, and govern scalable CI/CD pipelines using GCP-native tools and industry standards to enable seamless, secure, and frequent code deployments.
- Lead the response for complex, high-priority production incidents, conduct thorough root cause analysis, and cultivate a proactive, blameless post-mortem culture focused on continuous improvement.
- Effectively manage complex workloads, mentor intermediate and junior engineers, and maintain clear, strategic communication with internal leadership and external enterprise stakeholders regarding project milestones and technical roadmaps.
Requirements & Qualifications:
- Experience: 7+ years of hands-on experience in a DevOps, Cloud Engineering, or Site Reliability role, with at least 2+ years in a senior or lead capacity.
- Deep, production-proven experience architecting and managing infrastructure, security frameworks, and cost-optimization strategies natively within the Google Cloud Platform.
- Experience designing and deploying secure, scalable GCP Landing Zones
- Expert-level experience with Kubernetes (GKE), Docker, and managing complex container orchestration at scale.
Technical Skills:
- Advanced UNIX/Linux system administration and performance tuning.
- Expert scripting capabilities in Python, Go, or Bash/Shell for automation.
- Strong mastery of configuration management tools like Ansible, Chef, or Puppet.
- Education: A Degree or Diploma in IT, Computer Science, or equivalent practical industry experience.
- Certifications: Google Cloud Professional certifications (Professional Cloud DevOps Engineer or Professional Cloud Architect) are highly advantageous and preferred.
Why Join Us?
We are 100% remote. Enjoy the flexibility of working from anywhere.
You’ll drive monitoring, observability, and CI/CD initiatives using the latest GCP-native tools.
We value innovative thinkers who aren't afraid to challenge the status quo to drive excellence.
731,000+ hidden jobs like this
cloudsmiths and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites