National AI Awards 2025Discover AI's trailblazers! Join us to celebrate innovation and nominate industry leaders.

Nominate & Attend

Databricks Engineer

London
6 days ago
Create job alert

Data Pipeline Development:

Design and implement end-to-end data pipelines in Azure Databricks, handling ingestion from various data sources, performing complex transformations, and publishing data to Azure Data Lake or other storage services.
Write efficient and standardized Spark SQL and PySpark code for data transformations, ensuring data integrity and accuracy across the pipeline.
Automate pipeline orchestration using Databricks Workflows or integration with external tools (e.g., Apache Airflow, Azure Data Factory).
Data Ingestion & Transformation:

Build scalable data ingestion processes to handle structured, semi-structured, and unstructured data from various sources (APIs, databases, file systems).
Implement data transformation logic using Spark, ensuring data is cleaned, transformed, and enriched according to business requirements.
Leverage Databricks features such as Delta Lake to manage and track changes to data, enabling better versioning and performance for incremental data loads.
Data Publishing & Integration:

Publish clean, transformed data to Azure Data Lake or other cloud storage solutions for consumption by analytics and reporting tools.
Define and document best practices for managing and maintaining robust, scalable data pipelines.
Data Governance & Security:

Implement and maintain data governance policies using Unity Catalog, ensuring proper organization, access control, and metadata management across data assets.
Ensure data security best practices, such as encryption at rest and in transit, and role-based access control (RBAC) within Azure Databricks and Azure services.
Performance Tuning & Optimization:

Optimize Spark jobs for performance by tuning configurations, partitioning data, and caching intermediate results to minimize processing time and resource consumption.
Continuously monitor and improve pipeline performance, addressing bottlenecks and optimizing for cost efficiency in Azure.
Automation & Monitoring:

Automate data pipeline deployment and management using tools like Terraform, ensuring consistency across environments.
Set up monitoring and alerting mechanisms for pipelines using Databricks built-in features and Azure Monitor to detect and resolve issues proactively.
Requirements

Data Pipeline Expertise: Extensive experience in designing and implementing scalable ETL/ELT data pipelines in Azure Databricks, transforming raw data into usable datasets for analysis.
Azure Databricks Proficiency: Strong knowledge of Spark (SQL, PySpark) for data transformation and processing within Databricks, along with experience building workflows and automation using Databricks Workflows.
Azure Data Services: Hands-on experience with Azure services like Azure Data Lake, Azure Blob Storage, and Azure Synapse for data storage, processing, and publication.

Data Governance & Security: Familiarity with managing data governance and security using Databricks Unity Catalog, ensuring data is appropriately organized, secured, and accessible to authorized users.
Optimization & Performance Tuning: Proven experience in optimizing data pipelines for performance, cost-efficiency, and scalability, including partitioning, caching, and tuning Spark jobs.
Cloud Architecture & Automation: Strong understanding of Azure cloud architecture, including best practices for infrastructure-as-code, automation, and monitoring in data environments

Related Jobs

View all jobs

Senior Data Engineer

Senior Data Engineer

SQL Database Developer

National AI Awards 2025

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

LinkedIn Profile Checklist for Cloud Computing Jobs: 10 Tweaks to Skyrocket Recruiter Engagement

The cloud computing industry continues to expand at breakneck speed, with organisations seeking experts in AWS, Azure, Google Cloud and multi‑cloud architectures. Recruiters sift through numerous profiles to find candidates skilled in infrastructure, automation, security and cost optimisation. To differentiate yourself, your LinkedIn profile must be optimised for search visibility and present a compelling narrative of your cloud expertise. This step-by-step LinkedIn for cloud computing jobs checklist reveals 10 targeted tweaks that will skyrocket recruiter engagement. Whether you’re an infrastructure engineer, DevOps specialist or cloud architect, these actionable adjustments will sharpen your profile and attract the right hiring managers.

Part-Time Study Routes That Lead to Cloud Computing Jobs: Evening Courses, Bootcamps & Online Masters

Cloud computing has become the backbone of modern IT infrastructure, powering everything from e‑commerce platforms to machine learning pipelines. As UK businesses increasingly migrate workloads to public clouds like AWS, Azure and Google Cloud, demand for skilled cloud professionals—architects, engineers, DevOps specialists and site reliability engineers—is surging. Forecasts suggest that cloud computing will account for over 30% of all IT spending in the UK by 2027, opening thousands of new roles across sectors from finance to healthcare and government. But many aspiring cloud practitioners cannot afford to pause their careers for full‑time study. Thankfully, an expanding ecosystem of part‑time learning options—Evening Courses, Intensive Bootcamps and Flexible Online Master’s Programmes—empowers you to learn cloud computing while working, balancing professional development with existing commitments. This comprehensive guide walks you through every pathway: foundational CPD, hands‑on bootcamps, accredited online MScs, plus funding avenues and practical planning advice. Whether you’re an on‑premises sysadmin, a software developer or a project manager, you’ll discover how to build cloud expertise on your schedule.

The Ultimate Assessment-Centre Survival Guide for Cloud Computing Jobs in the UK

Assessment centres for cloud computing positions in the UK put candidates through rigorous simulations reflecting real-world demands—designing scalable architectures, integrating services, and collaborating under pressure. These multi-stage events test your technical depth, problem-solving agility and interpersonal finesse. Whether you’re targeting roles in AWS, Azure or Google Cloud environments, this guide offers step-by-step insights to help you stand out at every stage.