Principal Engineer - Site Reliability / SRE (we have office locations in Cambridge, Leeds & London)

London
1 week ago
Create job alert

Company Description

Genomics England partners with the NHS to provide whole genome sequencing diagnostics. We also equip researchers to find the causes of disease and develop new treatments – with patients and participants at the heart of it all.

Our mission is to continue refining, scaling, and evolving our ability to enable others to deliver genomic healthcare and conduct genomic research.

We are accelerating our impact and working with patients, doctors, scientists, government and industry to improve genomic testing, and help researchers access the health data and technology they need to make new medical discoveries and create more effective, targeted medicines for everybody.

Job Description

Are you passionate about helping to mature technical practices and empowering teams to run resilient and reliable services? At Genomics England we are looking for a Principal Site Reliability Engineer to help lead and refocus our small SRE capability, ultimately growing it to become thought-leaders in system reliability across the organisation.

About the Role

As the Principal Site Reliability Engineer, you will be a hands-on contributor with exemplary platform engineering skills, who also has an ability to lead and think strategically to identify and prioritise areas with the greatest impact. You may have previously worked in a variety or roles and job titles within engineering, and you may have worked in a variety of organisational contexts. Whatever your past experience, it will have given you a deep understanding of SRE principles and practices and how these are used to build and operate reliable services that exceed customer expectations.

You will be a problem-solver who identifies risks, issues, gaps, and dependencies and brings people together to find solutions. You will do this through your supportive, empathetic and collaborative behaviours - acting as coach, mentor, guide or constructive questioner as the situation demands. You are pragmatic but also mindful of the big picture - consciously balancing the immediate goals of teams against the long-term direction for their products. You will be a great communicator, comfortable not just leading your own team, but also engaging across the engineering community and with non-technical stakeholders.

About the Tech Stack

The SRE team will support squads that run a variety of services: most of these are either user-facing web applications (React), backend APIs (Python), bioinformatics pipelines (NextFlow), or data ETL workflows (Prefect, Dremio). These services increasingly run in AWS, though there is still a significant on-premise presence, and they run in a mixture of compute environments, from ECS/Fargate to HPC clusters to (occasionally) Kubernetes.

Within the SDLC we have a standard toolchain which includes Terraform for infrastructure-as-code, GitLab for source code and CI/CD, Artifactory for software artefacts, and DataDog for observability. We are working to become interoperable with the wider NHS via open standards like FHIR and GA4GH APIs and increasingly aiming to integrate with their own API Management platform.

Job Description

Assess the current state of human and technical processes and practices from a site reliability perspective, and identify the areas of greatest concern
Develop a roadmap of initiatives aimed at raising the maturity of live services through SRE practices (SLOs, Critical User Journeys, monitoring with Golden Signals, release-engineering etc.)
Grow and lead a small team of Site Reliability Engineers to roll out those initiatives across the organisation through partnership with product squads
Work with key stakeholders to establish and embed key SRE principles (e.g. Error Budget Policies)
Maintain close contact with Engineering leadership and other "enabling" teams (Test Enablement, Developer Platform) to ensure that SRE work is fully aligned with our direction of travel
Stay abreast of emerging technologies and industry trends, and incorporate them into our software development practices
Contribute to the wider conversation at Genomics England and help mature our technical practicesQualifications

While we recognise the value of relevant qualifications or certifications, we are primarily interested in your real-world experience.

Essential Skills and Experience

Comprehensive knowledge of SRE principles and practices with significant experience of applying these to real-world situations
Excellent software engineering skills especially in the context of release automation and other toil-eliminating activities (Python preferred, polyglot ideal)
Strong understanding of how architecture and other factors contribute to the overall resilience of systems
Extensive experience of platform engineering across CI/CD, Infrastructure as Code, operational monitoring and alerting, backup and recovery etc.
Experience in at least one major public cloud (AWS preferred but not essential)
Demonstrable ability to lead teams, manage, direct, mentor and plan work 
Strong interpersonal skills with a temperament that builds trust and connection within and across squads through open, honest communication
Comfortable engaging responsively with teams both remotely and in person when required
Ability to navigate rapidly to effective solutions through engaged and inclusive listening, clarity of thought, clear documentation, and succinct presentationDesirable Experience

These skills are not essential but if you have either of them they may prove to be useful:

Background in healthcare or bioinformatics
Experience in regulated environmentsIf you’re an experienced Site Reliability Engineer leader, who thrives on working collaboratively to mature engineering practices, we’d love to hear from you. Join us at Genomics England and make a meaningful impact in the world of genomics.

Qualifications

While we recognise the value of relevant qualifications or certifications, we are primarily interested in your real-world experience.

Additional Information

Salary: £103,500

Being an integral part of such a meaningful mission is extremely rewarding in itself, but in order to support our people, we’re continually improving our benefits package. We pride ourselves on investing in our people and supporting them to achieve their career goals, as well as offering a benefits package including: 

Generous Leave: 30 days’ holiday plus bank holidays, additional leave for long service, and the option to apply for up to 30 days of remote working abroad annually (approval required).
Family-Friendly: Blended working arrangements, flexible working, enhanced maternity, paternity and shared parental leave benefits.
Pension & Financial: Defined contribution pension (Genomics England double-matches up to 10%, however you can contribute more if you wish), Life Assurance (3x salary), and a Give As You Earn scheme.
Learning & Development: Individual learning budgets, support for training and certifications, and reimbursement for one annual professional subscription (approval required).
Recognition & Rewards: Employee recognition programme and referral scheme.
Health & Wellbeing: Subsidised gym membership, a free Headspace account, and access to an Employee Assistance Programme, eye tests, flu jabs.Equal opportunities and our commitment to a diverse and inclusive workplace 

Genomics England is actively committed to providing and supporting an inclusive environment that promotes equity, diversity and inclusion best practice both within our community and in any other area where we have influence. We are proud of our diverse community where everyone can come to work and feel welcomed and treated with respect regardless of any disability, ethnicity, gender, gender identity, religion, sexual orientation, or social background. 

Genomics England’s policies of non-discrimination and equity and will be applied fairly to all people, regardless of age, disability, gender identity or reassignment, marital or civil partnership status, being pregnant or recently becoming a parent, race, religion or beliefs, sex or sexual orientation, length of service, whether full or part-time or employed under a permanent or a fixed-term contract or any other relevant factor.  

Genomics England does not tolerate any form of discrimination, harassment, victimisation or bullying at work. Such behaviour is contrary to

Related Jobs

View all jobs

Principal Software Engineer

Principal DevOps Engineer

Platform Engineer

Lead / Principal Software Developer – Platform Development

Principal Data Scientist - Remote

CHECK Team Lead

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Veterans in Cloud Computing: A Military‑to‑Civilian Pathway into Cloud Careers

Introduction The United Kingdom’s public‑cloud market will exceed £45 billion by 2026 (Gartner forecast), fuelled by government digital‑service mandates, fintech scale‑ups, and the MOD’s own Cloud Strategic Roadmap 2025. From hybrid defence networks to NHS electronic health records, almost every mission‑critical workload is moving to AWS, Microsoft Azure, or Google Cloud Platform (GCP). That explosive demand collides with a nationwide skills shortage—over 135,000 unfilled cloud roles in 2024, according to TechUK. Veterans already possess the security clearances, network know‑how, and operational discipline that cloud employers crave. This guide shows you how to translate your military experience into cloud‑native language, leverage Ministry of Defence (MoD) transition funding, and secure a rewarding second career (with salaries starting at £40k and rising above £100k for cleared DevSecOps leads). Quick Win: Bookmark our live board for Cloud Solutions Architect roles to see who’s hiring today.

Rural-Remote Cloud Computing Jobs: Pioneering a New Work-Life Balance

A Fresh Outlook for Cloud Professionals The field of cloud computing has rapidly become one of the most crucial pillars of the modern tech landscape. From multinational corporations to innovative start-ups, organisations rely on remote servers and virtualised infrastructure to store data, deploy applications, and deliver services globally. Conventional wisdom once held that this high-tech world was exclusively tied to large urban hubs—particularly London. However, a growing number of roles in this domain are now just as easily fulfilled from the rural countryside, where rolling hills, seaside towns, and smaller communities provide a more laid-back and affordable lifestyle. At CloudComputingJobs.co.uk, we’re seeing a surge in opportunities that emphasise remote and flexible work. Many professionals are on the hunt for “tech jobs by the sea” or “cloud computing remote countryside” roles, eager to escape the hustle of city life without sacrificing access to cutting-edge technologies. This shift is fuelled by factors like improved rural internet connectivity, evolving corporate cultures, and the inherent nature of cloud work, which primarily revolves around online platforms and virtual collaboration tools. In this article, we’ll explore the reasons behind this migration from traditional city offices to rural and coastal areas, the advantages (and challenges) of such moves, and practical advice on securing a cloud computing job that’s location-agnostic. Whether you dream of swapping your daily Tube commute for a stroll along a coastal path or a walk through a tranquil village lane, this might be the perfect time to redefine your work-life balance—and still shape the future of cloud tech.

Quantum-Enhanced AI in the Cloud: A New Dawn for Distributed Computing

Cloud computing has reshaped the digital landscape, freeing organisations from on-premises hardware constraints and paving the way for services that scale on demand. From hosting business-critical databases to enabling global content delivery networks, the cloud has become the backbone of modern IT. Parallel to this evolution, Artificial Intelligence (AI) has rapidly matured, offering transformative capabilities in data analytics, recommendation systems, and automation. Yet for all its power, conventional computing infrastructure sometimes struggles to handle the complex, high-dimensional problems AI can generate—especially in areas like drug discovery, advanced cryptography, or intricate optimisation. Quantum computing has burst onto the scene as a new paradigm that might address these computational bottlenecks. By leveraging quantum bits (qubits) and phenomena like superposition and entanglement, quantum computers could, in principle, explore vast solution spaces at unprecedented speeds. What happens when these three forces—cloud computing, AI, and quantum technology—converge? We enter a new era of quantum-enhanced AI in the cloud, where distributed systems seamlessly integrate classical and quantum resources to deliver powerful, scalable, and efficient solutions. This article will explore: How cloud computing has set the stage for next-generation AI. The basics of quantum computing and why it’s poised to disrupt traditional workloads. Real-world applications of quantum-enhanced AI that leverage cloud infrastructure. Emerging job roles and skill requirements in this fast-evolving landscape. Ethical, technical, and logistical considerations that must guide future developments. Whether you’re a seasoned cloud professional looking to stay ahead of the curve, a data scientist curious about the promise of quantum, or a tech enthusiast eager to see what’s next, read on. The future of computing may be closer than you think.