Site Reliability Engineer Jobs

Engineers who ensure cloud services are reliable, scalable, and efficient. A critical role in maintaining uptime and performance in cloud environments.

Open roles
13
Salary range
£35k – £105k
Hiring companies
8

Site Reliability Engineers (SREs) are the backbone of cloud operations, ensuring that applications and services run smoothly and efficiently. They work closely with development teams to build and maintain robust, scalable systems that can handle high traffic and complex workloads. SREs are in high demand across a range of industries, from tech startups to large enterprises, and their role is crucial in maintaining the reliability and performance of cloud-based applications.

What the role does

Inside the role of a Site Reliability Engineer

A typical week for an SRE is a mix of proactive system maintenance, incident response, and collaboration with development teams.

  1. 01
    Monitor system performance and health metrics.
  2. 02
    Respond to and resolve incidents and outages.
  3. 03
    Implement and optimise automation scripts and tools.
  4. 04
    Collaborate with developers on system design and improvements.
  5. 05
    Conduct post-incident reviews and document findings.
  6. 06
    Participate in on-call rotations and provide 24/7 support when needed.
Salary on the board

£35k – £105k

Based on advertised midpoints across the 25 priced listings posted in the last 12 months. Base salary only.

By seniority
£k base
Junior
35
55
4 jobs
Mid
40
105
13 jobs
Senior
40
95
4 jobs
Lead
55
90
2 jobs
Skills & tools

What hiring managers ask for

% of 24 listings posted in the last 12 months that mention each skill, extracted from job descriptions.

CI/CD
71%
Terraform
63%
Python
54%
Linux
46%
AWS
42%
Azure
38%
Monitoring
38%
Grafana
38%
DevOps
33%
Docker
33%
Automation
33%
Prometheus
33%
Career ladder

From Junior to Principal

A typical UK progression for site reliability engineers. Years are guidance — strong people move faster, and many senior folks sidestep into research, product or management.

  1. Level 1

    Junior Site Reliability Engineer

    0–2 yrs

    Assist in monitoring and maintaining cloud infrastructure, with a focus on learning and supporting more experienced team members.

  2. Level 2

    Site Reliability Engineer

    2–5 yrs

    Own the reliability and performance of specific systems, implementing automation and optimisation strategies.

  3. Level 3

    Senior Site Reliability Engineer

    5–8 yrs

    Lead the design and implementation of complex cloud architectures, mentor junior engineers, and drive reliability initiatives.

  4. Level 4

    Principal Site Reliability Engineer

    8+ yrs

    Strategise and oversee the reliability and scalability of the entire cloud infrastructure, influencing company-wide practices and policies.

Pathway

How to become a Site Reliability Engineer

There's no single route, but most people follow some version of these steps.

  1. 1

    Learn the Basics

    Start with foundational knowledge in cloud platforms, scripting, and system administration.

  2. 2

    Gain Practical Experience

    Work on real-world projects, contributing to monitoring, automation, and incident response.

  3. 3

    Specialise in SRE Practices

    Deepen your expertise in reliability engineering, including capacity planning and disaster recovery.

  4. 4

    Lead Projects and Teams

    Take on leadership roles, managing teams and driving large-scale reliability initiatives.

  5. 5

    Influence Company Strategy

    Shape the company's cloud strategy and best practices, contributing to long-term reliability and efficiency.

Live jobs

13 live roles

See all 13 roles

Site Reliability Engineer

This role involves driving reliability, automation, and performance across cloud-hosted platforms in a regulated financial services environment. You'll implement SRE best practices, enhance observability, reduce operational toil through automation, and support incident management. The position blends infrastructure engineering with software principles within a high-performing platform team.

Huxley Associates London, City And County Of the City Of London, United Kingdom £90,000 pa
Hybrid Permanent Clearance Required

Site Reliability Engineer

This role involves ensuring the reliability and resilience of cloud-based production systems through automation, incident management, and continuous improvement. The engineer will lead major incident response, develop monitoring and alerting strategies using tools like Prometheus and Datadog, and build self-healing systems on AWS with Kubernetes and Docker. Collaboration with engineering and product teams is key to improving platform stability and operational efficiency.

Spectrum IT Recruitment Basingstoke, United Kingdom
Remote Permanent
Amazon logo

Site Reliability Engineer, Region Services

Would you like to help implement innovative cloud computing solutions and solve the most complex technical problems? Are you excited by the prospect of building and running the world's largest cloud computing infrastructure to provide a better world for future...

Amazon London, United Kingdom
Permanent
Amazon logo

Site Reliability Engineer , Cryptography, Access and Identity Services

This role involves building and operating large-scale, high-availability Cryptography, Access, and Identity services on AWS. Engineers focus on automation, root cause analysis, and operational excellence, using Linux, networking, and AI tooling to reduce manual toil. The position emphasizes ownership across the full service lifecycle in a cloud production environment.

Amazon London, United Kingdom
Hybrid Permanent

Site Reliability Engineer - SC Cleared

This role involves working on high-impact, mission-critical projects for Defence and National Security clients. You will be responsible for ensuring the reliability and performance of complex systems, using your expertise in software development, cloud infrastructure, and system monitoring.

Searchability NS&D Gloucestershire, United Kingdom £40,000 – £65,000 pa
Hybrid Permanent Clearance Required

Site Reliability / Software Engineer - SC Cleared

This role involves developing and maintaining secure, scalable software systems while also ensuring high reliability through SRE and DevOps practices. You'll work across the full software lifecycle, contributing to backend services in Java and Python, frontend components in JavaScript/TypeScript, and automating CI/CD and operational workflows. The position supports mission-critical systems in a regulated environment, with a strong focus on monitoring, observability, and performance optimization.

Searchability NS&D Gloucestershire, United Kingdom £45,000 – £65,000 pa
Hybrid Permanent Clearance Required

Azure Site Reliability Engineer

This role involves designing, building, and automating enterprise-grade Azure Landing Zones and cloud governance frameworks. You will collaborate with a global SRE team to ensure secure, scalable, and policy-driven Azure environments, focusing on IaC, CI/CD, and DevOps practices.

GCS Knutsford, Cheshire, United Kingdom £50,000 – £60,000 pa
Hybrid Permanent

Azure Site Reliability Engineer

Azure Site Reliability Engineer | £500- £550 OutsideIR35 | 6 Month initial (project spanned until August 2027) | 2 Days onsite into Central LondonThis is for my clients Brownfield project, with new funding they are looking to level up their...

Opus Recruitment Solutions London, United Kingdom £500 – £550 pd
Hiring locations

Where this role is hiring

The locations with the most live listings for this role today.

FAQs

Common questions

  • Essential skills include strong knowledge of cloud platforms, scripting, system administration, and automation tools. Familiarity with monitoring and incident response is also crucial.

  • SREs collaborate closely with developers to ensure that applications are designed for reliability and scalability. They provide feedback on system design and help implement automation and monitoring solutions.

  • SREs often work in fast-paced, collaborative environments. They may be part of on-call rotations and need to be available to respond to incidents at any time.

  • Advancement involves gaining experience, specialising in advanced SRE practices, and taking on leadership roles. Continuous learning and staying updated with the latest cloud technologies are also important.

  • Salary ranges can vary widely based on experience, location, and company size. For more detailed information, please refer to the salary section on this page.

Hiring site reliability engineers?

Post your role in 90 seconds and reach the specialist audience that already reads this page.