Site Reliability Engineer (Mid / Senior)

Reed
Sw113Ra, SW11 3RA, United Kingdom
Yesterday
Job Type
Permanent
Work Pattern
Full-time
Work Location
Hybrid
Seniority
Senior
Education
Degree
Visa Sponsorship
Available
Posted
30 Jun 2026 (Yesterday)

Benefits

On-call rota 25 days holiday + bank holidays Pension scheme Private healthcare Remote working allowance Professional development budget

Site Reliability Engineer (Mid / Senior)

South West London (Hybrid – 1–2 days onsite)
Salary: Competitive + Benefits

We are looking for aSite Reliability Engineer to join a well-established small infrastructure team supporting a highly available, production environment. This is an exciting opportunity to work across amodern, self-hosted platform spanning Kubernetes, physical infrastructure and automation, with a strong focus onUbuntu-based systems.

The Role

As an SRE, you will play a key role in ensuring theavailability, performance, security and resilience of production systems. Working in a small, collaborative team, you’ll take ownership of day-to-day platform operations, incident response and continuous improvement, while partnering closely with development teams to deliver reliable and scalable services.

Key Responsibilities

  • Administer and maintainLinux (Ubuntu) server environments
  • Manageself-hosted Kubernetes clusters and supporting infrastructure
  • Supporton-premise infrastructure including physical servers and virtualisation platforms
  • Administer storage solutions includingNFS, iSCSI and object storage
  • Build and maintainautomation using Ansible or similar IaC tools
  • Develop operational tooling usingBash and Python
  • Monitor system health using tools such asPrometheus, Grafana, Zabbix or Nagios
  • Investigate and resolve production incidents (on-call rota involved)
  • Implementsecurity hardening and infrastructure best practices
  • Managebackup and disaster recovery processes and regular testing
  • Support and improveCI/CD pipelines and deployment processes
  • Collaborate with engineering teams to improve reliability and performance

Essential Skills

  • StrongLinux systems administration (Ubuntu preferred)
  • Experience runningproduction Kubernetes environments
  • Solid understanding ofnetworking (TCP/IP, DNS, routing, firewalls)
  • Experience withphysical servers and virtualisation platforms
  • Hands-on experience withAnsible or other IaC tools
  • Scripting skills inBash and Python
  • Experience withmonitoring and alerting platforms
  • Knowledge ofLinux storage technologies (NFS, iSCSI)
  • Experience withbackup & disaster recovery
  • Exposure toActive Directory / Entra ID / endpoint management
  • Strong troubleshooting and problem-solving skills

Desirable Experience

  • Object storage, MariaDB or database administration
  • CI/CD tools such as Jenkins
  • AWS (S3, Lambda, CloudFront) exposure
  • Terraform or additional IaC tooling
  • Experience with Harvester or similar platforms
  • Knowledge of security, compliance or GDPR

Why Apply?

  • Work oncomplex, real-world infrastructure (not just cloud-native)
  • High ownership in asmall, collaborative team
  • Exposure to abroad modern tech stack across infra, Kubernetes and automation
  • Hybrid working with a competitive salary package

Related Jobs

View all jobs

Site Reliability Engineer

Huxley Associates London, City And County Of the City Of London, United Kingdom
£90,000 pa Hybrid Clearance Required

Site Reliability Engineer

Spectrum IT Recruitment Basingstoke, United Kingdom
Remote

Site Reliability Engineer, Region Services

Amazon London, United Kingdom
Permanent

Site Reliability Engineer , Cryptography, Access and Identity Services

Amazon London, United Kingdom
Hybrid

Site Reliability Engineer - SC Cleared

Searchability NS&D Gloucestershire, United Kingdom
£40,000 – £65,000 pa Hybrid Clearance Required

Site Reliability Engineer (Mid / Senior)

Reed Sw113Ra, SW11 3RA, United Kingdom
Hybrid

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.