Back to all jobs
X
Senior SRE Engineer - future opening
xebiacee
Bulgaria; Poland1d ago
- Seniority
- Senior
About the role
<p> </p>
<p><strong>Who We Are</strong></p>
<p>While Xebia is a global tech company, our journey in CEE started with two Polish companies – PGS Software, known for world-class cloud and software solutions, and GetInData, a pioneer in Big Data. Today, we’re a team of 1,000+ experts delivering top-notch work across cloud, data, and software. And we’re just getting started.</p>
<p><strong>What We Do</strong></p>
<p>We work on projects that matter – and that make a difference. From fintech and e-commerce to aviation, logistics, media, and fashion, we help our clients build scalable platforms, data and AI solutions, and cutting-edge applications to shape the future of tech. Our clients include McLaren, Aviva, Deloitte, Spotify, Disney, ING, UPS, Tesco, Truecaller, AllSaints, Volotea, Schmitz Cargobull, Allegro, InPost, and many, many more.</p>
<p>We value smart tech, real ownership, and continuous growth. We use modern, open-source stacks, and we’re proud to be trusted partners of Databricks, dbt, Snowflake, Azure, GCP, and AWS. Fun fact: we were the first AWS Premier Partner in Poland!</p>
<p><strong>Beyond Projects</strong></p>
<p>What makes Xebia special? Our community. We support tech communities, organize meetups (Software Talks, Data Tech Talks), and have a culture that actively support your growth via Guilds, Labs, and personal development budgets — for both tech and soft skills. It’s not just a job. It’s a place to grow.</p>
<p><strong>What sets us apart? </strong></p>
<p><strong>Our mindset. Our vibe. Our people. And while that’s hard to capture in text – come visit us and see for yourself.</strong></p>
<p> </p>
<h2><strong>You will be:</strong></h2>
<ul>
<li>designing and implementing SRE practices, including SLI/SLO frameworks, error budgets, toil budgets, and reliability reviews,</li>
<li>leading the maturity progression from Level 1 (Reactive) through Level 5 (Autonomous),</li>
<li>driving toil elimination by identifying, measuring, and automating repetitive operational work,</li>
<li>designing and executing chaos engineering experiments to proactively identify reliability weaknesses,</li>
<li>establishing production readiness review processes for new application onboarding,</li>
<li>collaborating with engineering teams on joint RCA backlogs and incident reduction initiatives,</li>
<li>defining and tracking SRE KPIs, including MTTD, MTTR, error budget consumption, toil ratio, and automation coverage,</li>
<li>mentoring L2 engineers in SRE practices and engineering-led problem solving,</li>
<li>contributing to capacity planning, performance engineering, and reliability architecture reviews,</li>
<li>championing a blameless post-incident culture and continuous improvement.</li>
</ul>
<p> </p>
<h2 data-pm-slice="1 1 []"><strong>Your profile:</strong></h2>
<ul>
<li>5 - 8 years of experience in SRE, DevOps, or platform engineering,</li>
<li>practical experience using AI-powered assistants (e.g. Claude Code, GitHub Copilot, Cursor) to improve productivity, quality, or decision-making in software delivery,</li>
<li>deep understanding of SRE principles (Google SRE book concepts), including SLIs, SLOs, error budgets, and toil elimination,</li>
<li>strong programming skills in Python, Go, or similar languages,</li>
<li>extensive experience with cloud platforms such as AWS, Azure, or GCP, as well as Kubernetes,</li>
<li>proficiency with observability tools, including Datadog, Splunk, Prometheus, and Grafana,</li>
<li>experience with Infrastructure as Code (Terraform, Ansible) and CI/CD pipelines,</li>
<li>proven track record of driving reliability improvements in production environments,</li>
<li>experience with chaos engineering tools such as Gremlin, Chaos Monkey, or Litmus,</li>
<li>strong analytical, problem-solving, and English communication skills (at least B2 level).</li>
</ul>
<p><strong>Work from the European Union region and a work permit are required.</strong></p>
<h2><strong>Nice to have:</strong></h2>
<ul>
<li>
<p><span data-olk-copy-source="MessageBody">experience applying GenAI in a more structured way within the SDLC, including defined workflows, prompt patterns, or tool integrations embedded into daily work,</span></p>
</li>
<li>
<p>experience in managed services or consulting environments,</p>
</li>
<li>
<p>familiarity with AIOps and ML-driven operations,</p>
</li>
<li>
<p>contributions to the SRE community through talks, articles, or open-source projects,</p>
</li>
<li>
<p>experience working with large-scale distributed systems (1,000+ services),</p>
</li>
<li>
<p>SRE or cloud architect certifications,</p>
</li>
<li><span data-olk-copy-source="MessageBody">interest in and familiarity with emerging AI-driven practices (e.g. agent-based workflows, automation patterns, AI-augmented development), with a willingness to explore and experiment beyond standard approaches.</span></li>
</ul>
<p> </p>
<p> </p>
<h2><strong>Recruitment Process:</strong></h2>
<p><strong>CV</strong> review –<strong> HR</strong> call – <strong>Technical Interview</strong> – <strong>Client </strong>Interview – <strong>Decision</strong></p>
<p> </p>
731,000+ hidden jobs like this
xebiacee and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites