Staff Software Engineer - Grafana Cloud k6 | UK |

Grafana Labs
United Kingdom
Last week
Job Type
Permanent
Work Pattern
Full-time
Work Location
Remote
Seniority
Senior
Education
Degree
Posted
18 May 2026 (Last week)

Benefits

Remote allowance Flexible working hours

Grafana Labs, the company behind the open observability cloud, is founded on the principles of open source, open standards, open ecosystems, and open culture. Grafana Cloud, our fully managed observability platform, is flexible and built for scale. With Grafana Cloud's actually useful AI, organizations can see, understand, and act on all their disparate data to move at the speed of their ambitions. Today, more than 35 million users and 7,000+ customers – including Anthropic, Bloomberg, NVIDIA, Microsoft, and Salesforce – trust Grafana Labs to ensure reliability of their applications and systems, resolve incidents quickly, and optimize their telemetry to reduce noise and cost. We are a 100% remote company with 1,600+ team members across 40+ countries, and we’re backed by leading investors including Lightspeed Venture Partners, Sequoia Capital, GIC, Coatue, J.P. Morgan, CapitalG, and Lead Edge Capital. Learn more at grafana.com and follow us on LinkedIn and X.

We’re scaling fast and staying true to what makes us different: an open-source legacy, a global collaborative culture, and a passion for meaningful work. Our team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do.

You may not meet every requirement, and that’s okay. If this role excites you, we’d love you to raise your hand for what could be a truly career-defining opportunity.

This is a remote opportunity, and we would be interested in applicants in UK time zones

Staff Software Engineer - Grafana Cloud k6

The Opportunity

We are the team behind Grafana k6, Grafana Cloud k6, and Grafana Cloud Synthetics, used by teams globally to ensure resilient, high-performing systems. This opportunity is with the Grafana Cloud k6 squad, who build and operate our performance testing product. Grafana Cloud k6 is built around the OSS k6 and targeted at users looking to run performance tests at scale. Our enterprise and SaaS offerings allow customers to load test their systems by running distributed tests from 15+ regions worldwide, using hundreds of thousands of virtual users sending millions of requests per second. We ingest huge volumes of data generated by k6, which can be used to view, correlate and analyze metrics from each test.

k6 is a product used by other engineers, and as such, we are looking for people enthusiastic about building high-quality tools they would want to use themselves. Due to our small teams and fast development pace, you will have a substantial and immediate impact on how the end product is architected, developed, and how the engineering team operates.

Your role will focus on establishing and scaling a cross-team culture of engineering excellence by setting standards and guiding adoption of strong engineering practices that improve reliability and operational ownership. As this foundation matures, the role is expected to expand into broader application and product development leadership, contributing architectural and technical depth beyond operational excellence.

What will you be doing?

  • Contribute hands-on to the codebase by designing and implementing production-quality software.
  • Guide teams in the design, development, evolution, and operation of large-scale, distributed cloud systems.
  • Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability.
  • Help mature SRE practices, including incident response and PIRs, on-call readiness, runbooks, alerting, observability, and release/change management.
  • Establish reliability frameworks such as SLIs/SLOs and error budgets, and use them to guide prioritization and engineering trade-offs.
  • Provide visibility into system health through clear operational metrics and reliability reporting.
  • Participate in the on-call rotation as a primary escalation point and contribute to incident resolution.
  • Influence product and system direction through design reviews, architectural discussions, and cross-team collaboration.
  • Share knowledge through clear, high-quality documentation and technical communication—internally and, where appropriate, externally—to help teams build and operate systems more effectively.
  • As the reliability foundation matures, grow into broader application and product development leadership, contributing architectural and technical depth beyond operations.

We invest heavily in developer productivity. You can use modern AI coding assistants as part of your daily workflow (your choice of tools, within security guidelines), backed by a company-funded usage budget so you can iterate quickly without unnecessary friction.

We encourage pragmatic AI-assisted development: faster prototyping, test generation, refactors, documentation, and incident follow-ups—always paired with strong code review and quality standards.

You’ll also have access to frontier models (e.g., GPT-Codex 5/3, Claude Opus 4.6, Gemini 3 Pro).

Requirements:

  • Strong programming background in a modern language (Python and Go are primary, but prior experience is not required)
  • Experience designing, building, and operating large-scale distributed systems
  • Strong experience with SRE practices, including operating and evolving production systems at scale
  • Strong understanding of reliability engineering concepts (e.g. incident management, observability, and failure modes)
  • Strong experience of defining or applying SLIs/SLOs, error budgets, or reliability metrics
  • Experience with test automation, including performance and functional testing
  • Ability to influence engineering practices through clear technical communication, reviews, and collaboration
  • Strong interpersonal skills and ability to work effectively across teams
  • Familiarity with modern software engineering processes and delivery practices
  • Self-driven and comfortable operating with a high degree of autonomy and ambiguity
  • Experience participating in blameless incident response and writing high-quality post-incident reviews.

Bonus Points for:

  • Experience with containerized and cloud-native systems (Docker, Kubernetes, AWS)
  • Familiarity with observability tooling and platforms (e.g. the Grafana stack)
  • Experience working with Python, Go, JavaScript and/or Jsonnet
  • Experience building or operating event-driven or asynchronous systems
  • Interest in, or experience with, building testing frameworks or developer tooling

Compensation & Rewards:

In the United Kingdom, the Base compensation range for this role is GBP 103,958 - GBP 124,750. Actual compensation may vary based on level, experience, and skillset as assessed in the interview process. Benefits include equity, bonus (if applicable) and other benefits listed here.

*Compensation ranges are country specific. If you are applying for this role from a different location than listed above, your recruiter will discuss your specific market’s defined pay range & benefits at the beginning of the process

*Compensation ranges are country specific. If you are applying for this role from a different location than listed above, your recruiter will discuss your specific market’s defined pay range & benefits at the beginning of the process.

Why You’ll Thrive at Grafana Labs:

  • 100% Remote, Global Culture -As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
  • Scaling Organization – Tackle meaningful work in a high-growth, ever-evolving environment.
  • Transparent Communication – Expect open decision-making and regular company-wide updates.
  • Innovation-Driven – Autonomy and support to ship great work and try new things.
  • Open Source Roots – Built on community-driven values that shape how we work.
  • Empowered Teams – High trust, low ego culture that values outcomes over optics.
  • Career Growth Pathways – Defined opportunities to grow and develop your career.
  • Approachable Leadership – Transparent execs who are involved, visible, and human.
  • Passionate People – Join a team of smart, supportive folks who care deeply about what they do.
  • In-Person onboarding- We want you to thrive from day 1 with your fellow new ‘Grafanistas’ to learn all about what we do and how we do it.
  • Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect.*We will comply with local legislation where applicable.

Equal Opportunity Employer: We will recruit, train, compensate and promote regardless of race, religion, color, national origin, gender, disability, age, veteran status, and all the other fascinating characteristics that make us different and unique. We believe that equality and diversity builds a strong organization and we’re working hard to make sure that’s the foundation of our organization as we grow.

Grafana Labs may utilize AI tools in its recruitment process to assist in matching information provided in CVs to job postings. The recruitment team will continue to review inbound CVs manually to identify alignment with current openings.

#LI-Remote

For information about how your personal data is used once you’ve applied to a job, check out our privacy policy.

Related Jobs

View all jobs

Staff Software Engineer

Confluent London, London, City Of, United Kingdom
Remote

Staff Software Engineer - Backend

Databricks London, United Kingdom
On-site

Senior Staff Software Engineer - Delta

Databricks London, United Kingdom

Senior Staff Software Engineer - Unity Catalog Runtime Enforcement

Databricks London, United Kingdom

AI Software Engineer

Broadcom United Kingdom
On-site

Cloud Engineer

Supply Staff Limited Hackney, London, E9 6QW, United Kingdom
£500 pd On-site Clearance Required

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Where to Advertise Cloud Computing Jobs in the UK (2026 Guide)

Where to advertise cloud computing jobs UK in 2026: the specialist boards and channels that reach AWS, Azure, GCP and cloud-native engineering talent. The candidate pool is large relative to other deep tech disciplines but highly segmented — cloud architects, DevOps engineers, platform engineers, FinOps specialists and cloud security professionals each occupy distinct communities with different job search behaviours, certification profiles and salary expectations. General job boards reach a broad audience but struggle to differentiate between these disciplines, producing high application volumes but low candidate quality for specialist cloud roles. This guide, published by CloudComputingJobs.co.uk, covers where to advertise cloud computing roles in the UK in 2026, how the main platforms compare, what employers should expect to pay, and what the data says about hiring across different role types.

Cloud Computing Jobs UK 2026: What to Expect Over the Next 3 Years

Cloud Computing Jobs UK 2026: salaries, hiring trends and the AWS, Azure and GCP skills shaping UK cloud careers over the next three years. Cloud computing is the infrastructure layer on which the modern digital economy runs — and the jobs market that has grown around it is one of the largest, most sustained, and most structurally resilient in the entire technology sector. But the cloud computing jobs market of 2026 looks quite different from the one that existed three years ago, and the next three years will bring further change at a pace that rewards those who understand the direction of travel. The migration phase that defined cloud hiring for much of the previous decade is largely complete for enterprise organisations. The question for most UK businesses is no longer whether to move to the cloud but how to operate, optimise, and secure what they have already built there — and how to integrate the wave of AI capability that is now being delivered primarily through cloud infrastructure. That shift has profound implications for which cloud skills are in demand, which roles are growing, and which are beginning to plateau. At the same time, new architectural patterns — multi-cloud, cloud-native, serverless, and the growing integration of edge computing with centralised cloud infrastructure — are creating entirely new categories of specialist expertise that employers are actively competing to hire. The cloud computing jobs market of 2026 is not contracting. It is evolving, and evolving in ways that create significant opportunity for job seekers who are building the right skills. This article breaks down what the UK cloud computing jobs market is likely to look like through to 2028 — covering the titles emerging right now, the technologies driving employer demand, the skills that will matter most, and how to position your career ahead of the curve.