Back to all jobs

About the role
<p><strong>Job Description</strong>:<strong> Technical Operations Manager </strong></p>
<p><strong>Role Overview </strong></p>
<p>We are seeking an experienced Cloud Infrastructure Shift Supervisor to lead our 24x7 operations team. You will ensure maximum uptime, rapid incident resolution, and seamless shift handovers. This role balances real-time technical incident oversight with people management. </p>
<p> </p>
<p><strong>Responsibilities</strong> - </p>
<p>* Shift Leadership: Manage a rotating team of cloud engineers to ensure continuous 24x7 coverage. </p>
<p>* Incident Management: Act as Incident Resolution Owner for Severity-1 and Severity-2 infrastructure outages. </p>
<p>* Operations Oversight: Monitor system health dashboards (Datadog, New Relic) and prioritize the ticketing queue. </p>
<p>* SLA Compliance: Ensure the team meets or exceeds Service Level Agreements (SLAs) for response and resolution times. </p>
<p>* Shift Handovers: Execute rigorous, documented handovers to incoming shifts to prevent data gaps. </p>
<p>* Team Development: Mentor junior engineers, conduct performance reviews, and manage shift scheduling to prevent burnout. </p>
<p> </p>
<p><strong>Required Technical Skills </strong>- </p>
<p>* Cloud Platforms: Proven experience managing infrastructure on AWS, Azure, or GCP.</p>
<p>* Monitoring & Logging: Proficiency with tools like Splunk, Datadog, Prometheus, or ELK stack. </p>
<p>* Ticketing Systems: Advanced knowledge of ITSM platforms such as Jira Service Desk or ServiceNow. </p>
<p>* OS & Scripting: Familiarity with Linux/Windows administration and basic scripting (Bash, Python). </p>
<p>* Strong understanding of deployment, maintenance and monitoring of Data Pipelines </p>
<p> </p>
<p><strong>Required Soft Skills </strong>- </p>
<p>* Crisis Management: Ability to remain calm and direct technical teams during high pressure outages. </p>
<p>* Communication: Clear written and verbal communication for page updates and stakeholder alerts. </p>
<p>* Problem Solving: Strong analytical skills to spot patterns in recurring infrastructure alerts. </p>
<p>* Flexibility: Absolute willingness to work a rotating schedule, including nights, weekends, and holidays. </p>
<p> </p>
<p><strong>Experience & Qualifications </strong>- </p>
<p>* Bachelor’s degree in Computer Science, IT, or equivalent practical experience.</p>
<p>* 7+ years of experience in a Cloud Support, NOC, or Site Reliability Engineering (SRE) environment. </p>
<p>* 3+ years of experience in a team lead, supervisor, or senior engineer capacity.</p>
<p>* Relevant certifications (e.g., AWS SysOps Administrator, Azure Administrator, ITIL Foundation) are a plus.</p><div class="content-conclusion"><p><strong>Note:</strong></p>
<blockquote class="gmail_quote"><em>By submitting your application, you consent to being contacted by our Talent Acquisition team via phone call, email, SMS, WhatsApp, or other communication channels regarding your application and relevant career opportunities.</em></blockquote>
<p></p></div>
741,000+ hidden jobs like this
Sigmoid and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites