Senior Site Reliability Engineer

London
1 week ago
Applications closed

Related Jobs

View all jobs

Engineering Lead

Senior Software Engineer

Lead Software Engineer

Software Engineer

Software Engineer

Senior Backend Developer (PHP) - £55,000 - On-site

Senior DevOps Engineer / Senior Site Reliability Engineer

Fully Remote working for candidates based in the UK – Salary £80k to £100k (depending on experience) + Benefits

We are looking for a Senior DevOps Engineer that has strong C# code knowledge combined with strong knowledge of DevOps tools like Kubernetes (EKS or ideally AKS) and Azure

Cloud platform. We are looking for a DevOps Engineer with a strong understanding of C# code combined with experience of monitoring tools like DataDog, Grafana and Prometheus to join a growing global Cloud Infrastructure team supporting SaaS products.

Our client are a Global Digital SaaS Software Company have a fantastic fully remote opportunity for an experienced Senior DevOps Engineer to join their UK Cloud Infrastructure team.

Site Reliability Engineers at this company are responsible for keeping the SaaS products running properly. Using concepts of software and systems engineering, they work to improve the reliability of all cloud systems while keeping levels of manual work low. DevOps are expected to be experienced in software engineering principals, operational discipline, and automation.

The Cloud and DevOps team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software, this company’s unique SaaS platform is an essential platform in the life of millions of University students across the globe.

In this role, you will apply your Software Engineering experience to enhance system performance and reliability, as well as building internal systems and capabilities that eliminate manual work through automation. You'll be joining our Platforms teams with globally-dispersed Site Reliability and Platform Engineers in a "follow the sun" model to operate our products on a multi-region cloud platform.

Role Responsibilities:

  • Provide technical leadership and mentoring within the team through knowledge sharing sessions, pair programming, code reviews and solution design

  • Identify and implement technical solutions to improve platform reliability, including the creation of mitigation strategies and operational playbooks.

  • Implement and maintain monitoring/alerting/logging systems to identify and respond to incidents

  • Ensure scalability and efficiency of cloud infrastructure and systems to handle traffic and data growth

  • Conduct performance tests to identify and remediate bottlenecks

  • Develop and maintain platform solutions, automate infrastructure provisioning, configuration, and management tasks using Infrastructure as Code.

  • Monitor, review and tune databases to ensure high availability and performance

  • Collaborate with product engineering teams to design/build fit-for-purpose and observable software

    Required Skills and Experience:

  • Proven experience in a SR DevOps / Site Reliability Engineering role and having strong code development experience in C# or similar OO development language.

  • Experience of supporting .Net applications as a DevOps Engineer is a big bonus in this role

  • Production experience operating containerization technologies - ideally with Kubernetes and/or Docker. Strong preference for AKS or EKS experience as well.

  • Proficiency with one or more public cloud providers such as Azure, AWS or GCP

  • Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation.

  • Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar.

  • Proven track record of maintaining highly-available and performant production environments.

  • Ability to identify and implement effective mitigation strategies and operational playbooks.

    Useful / Bonus Skills to have:

  • Experience in CI/CD tooling: Azure DevOps/GitHub Actions, Octopus Deploy

  • Relevant certifications in cloud platforms (e.g., Microsoft Certified: Azure Solutions Architect) and DevOps practices (e.g., Certified Kubernetes Administrator) are a plus

  • Experience in database management/performance tuning, particularly MSSQL.

    Employee benefits:

  • Opportunity to be a part of a 30+ year well-established, high-performance SaaS company.

  • Excellent Company Pension scheme and Life Insurance,

  • Excellent holiday allowance.

  • A supportive team environment with emphasis on learning and development opportunities

  • Working with a team of caring, high-performing, and passionate people who have fun supporting our vision, innovation, and continuous improvement.

    This Senior Site Reliability Engineer role is working for a market leading global software company and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider.

    Please apply with your CV to find out more

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Write a Cloud Computing Job Ad That Attracts the Right People

Cloud computing underpins much of the UK’s digital economy. From startups and scale-ups to enterprise organisations and the public sector, cloud platforms enable everything from data analytics and AI to cybersecurity, DevOps and digital services. Yet despite high demand for cloud skills, many employers struggle to attract the right candidates. Cloud job adverts are often flooded with unsuitable applications, while experienced cloud engineers, architects and platform specialists quietly pass them by. In most cases, the problem is not the shortage of cloud talent — it is the quality and clarity of the job advert. Cloud professionals are pragmatic, technically experienced and highly selective. A poorly written job ad signals confusion, unrealistic expectations or a lack of cloud maturity. A well-written one signals credibility, good engineering culture and long-term thinking. This guide explains how to write a cloud computing job ad that attracts the right people, improves applicant quality and strengthens your employer brand.

Maths for Cloud Jobs: The Only Topics You Actually Need (& How to Learn Them)

If you are applying for cloud computing jobs in the UK you might have noticed something frustrating: job descriptions rarely ask for “maths” directly yet interviews often drift into capacity, performance, reliability, cost or security trade-offs that are maths in practice. The good news is you do not need degree-level theory to be job-ready. For most roles like Cloud Engineer, DevOps Engineer, Platform Engineer, SRE, Cloud Architect, FinOps Analyst or Cloud Security Engineer you keep coming back to a small set of practical skills: Units, rates & back-of-the-envelope estimation (requests per second, throughput, latency, storage growth) Statistics for reliability & observability (percentiles, error rates, SLOs, error budgets) Capacity planning & queueing intuition (utilisation, saturation, Little’s Law) Cost modelling & optimisation (right-sizing, break-even thinking, cost per transaction) Trade-off reasoning under constraints (performance vs cost vs reliability) This guide explains exactly what to learn plus a 6-week plan & portfolio projects you can publish to prove it.

Neurodiversity in Cloud Computing Careers: Turning Different Thinking into a Superpower

Cloud computing sits at the heart of modern tech. Almost every digital product runs on someone’s cloud platform – from banking apps & streaming services to AI tools & online shops. Behind those platforms are teams of cloud engineers, architects, SREs, security specialists & more. These roles demand problem-solvers who can think in systems, spot patterns, stay calm under pressure & imagine better ways to build & run infrastructure. That makes cloud computing a natural fit for many neurodivergent people – including those with ADHD, autism & dyslexia. If you are neurodivergent & considering a cloud career, you might have heard messages like “you’re too distracted for engineering”, “too literal for stakeholder work” or “too disorganised for operations”. In reality, many traits that come with ADHD, autism & dyslexia are exactly what cloud teams need. This guide is written for cloud computing job seekers in the UK. We will cover: What neurodiversity means in a cloud context How ADHD, autism & dyslexia strengths map to cloud roles Practical workplace adjustments you can ask for under UK law How to talk about neurodivergence in applications & interviews By the end, you should have a clearer sense of where you might thrive in cloud computing – & how to turn “different thinking” into a professional superpower.