Site Reliability Engineer (SRE)

Mount Pleasant, Greater London
5 days ago
Create job alert

Site Reliability Engineer – (SRE, Site Reliability Engineer, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote

Charles Simon Associates are currently recruiting for an SRE Engineer on a permanent basis. This role is for a global business with a HQ in the City of London.

Candidates will need to be British Citizens due to Security Clearance requirements.

Location: Remote, with some travel to London

Salary: Up to £125,000 per annum

Skills/Requirements for the Site Reliability Engineer:

  • Extensive SRE experience within previous roles

  • Strong Terraform skills

  • Proven Kubernetes and AKS experience

  • Experience in creating and modifying terraform deployment on live environments

  • Experience with Monitoring solutions ideally Datadog, however Azure Application Insight, Log Analytics or Grafana

  • Scripting skills for automation within; PowerShell, Python or Bash

  • Experience with web based applications

    Desirable Skills:

  • Knowledge or commercial experience of Microservices Architecture

  • Kanban

  • Any prior experience of working with Puppet and Chef would be advantageous

    Start date is ASAP for the Site Reliability Engineer

    The Site Reliability Engineer will be responsible for:

  • Designing and enforcing service-level objectives (SLOs), SLIs, and SLAs to ensure reliability targets are measurable and aligned with business expectations

  • Implementing incident response frameworks, including runbooks, postmortems, and blameless RCA processes to drive continuous improvement

  • Integrating observability tooling (e.g. Prometheus, Grafana, Datadog, OpenTelemetry) to enable proactive detection and resolution of system anomalies

  • Managing infrastructure as code (IaC) using tools like Terraform, Pulumi, or CloudFormation to ensure repeatable, auditable deployments

  • Optimizing cost and resource utilization across cloud environments through rightsizing, autoscaling, and lifecycle policies

  • Driving chaos engineering initiatives to test system resilience under failure conditions and validate recovery strategies

  • Championing security best practices within infrastructure—e.g. secrets management, IAM policies, and vulnerability scanning

  • Collaborating with DevOps and platform teams to build paved-road deployment patterns and internal developer portals

  • Leading capacity planning and load testing efforts to anticipate scaling needs and prevent bottlenecks

  • Contributing to architectural decisions that impact reliability, latency, and fault domains across distributed systems

    Please send an up-to-date copy of your CV to be considered for the Site Reliability Engineer

    Site Reliability Engineer – (SRE, Site Reliability Engineer, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote

Related Jobs

View all jobs

Site Reliability Engineer Consultant

Site Reliability Engineer (SRE) - Defence

Site Reliability Engineer (SRE) - SC Cleared

Site Reliability Engineer (SRE)

Lead Site Reliability Engineer

MongoDB-Site Reliability

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Maths for Cloud Jobs: The Only Topics You Actually Need (& How to Learn Them)

If you are applying for cloud computing jobs in the UK you might have noticed something frustrating: job descriptions rarely ask for “maths” directly yet interviews often drift into capacity, performance, reliability, cost or security trade-offs that are maths in practice. The good news is you do not need degree-level theory to be job-ready. For most roles like Cloud Engineer, DevOps Engineer, Platform Engineer, SRE, Cloud Architect, FinOps Analyst or Cloud Security Engineer you keep coming back to a small set of practical skills: Units, rates & back-of-the-envelope estimation (requests per second, throughput, latency, storage growth) Statistics for reliability & observability (percentiles, error rates, SLOs, error budgets) Capacity planning & queueing intuition (utilisation, saturation, Little’s Law) Cost modelling & optimisation (right-sizing, break-even thinking, cost per transaction) Trade-off reasoning under constraints (performance vs cost vs reliability) This guide explains exactly what to learn plus a 6-week plan & portfolio projects you can publish to prove it.

Neurodiversity in Cloud Computing Careers: Turning Different Thinking into a Superpower

Cloud computing sits at the heart of modern tech. Almost every digital product runs on someone’s cloud platform – from banking apps & streaming services to AI tools & online shops. Behind those platforms are teams of cloud engineers, architects, SREs, security specialists & more. These roles demand problem-solvers who can think in systems, spot patterns, stay calm under pressure & imagine better ways to build & run infrastructure. That makes cloud computing a natural fit for many neurodivergent people – including those with ADHD, autism & dyslexia. If you are neurodivergent & considering a cloud career, you might have heard messages like “you’re too distracted for engineering”, “too literal for stakeholder work” or “too disorganised for operations”. In reality, many traits that come with ADHD, autism & dyslexia are exactly what cloud teams need. This guide is written for cloud computing job seekers in the UK. We will cover: What neurodiversity means in a cloud context How ADHD, autism & dyslexia strengths map to cloud roles Practical workplace adjustments you can ask for under UK law How to talk about neurodivergence in applications & interviews By the end, you should have a clearer sense of where you might thrive in cloud computing – & how to turn “different thinking” into a professional superpower.

Cloud Computing Hiring Trends 2026: What to Watch Out For (For Job Seekers & Recruiters)

As we move into 2026, the cloud computing jobs market in the UK is shifting again. The era of “lift & shift everything to the cloud” is giving way to a more mature, cost-conscious & security-focused phase. Many organisations are tightening budgets, some are rationalising cloud spend, yet demand for strong cloud talent remains high – especially around multi-cloud, FinOps, cloud security, data platforms & AI on cloud. Vendors are racing to integrate generative AI into their offerings, enterprises are modernising legacy estates, & regulators are asking tougher questions about resilience, sovereignty & risk. At the same time, some roles are being automated or commoditised, & the bar for cloud roles keeps rising. Whether you are a cloud job seeker planning your next move, or a recruiter building cloud teams, understanding the key cloud computing hiring trends for 2026 will help you stay ahead.