Senior Data Engineer

Backend & Data - Remote - Full-time

Department: Backend & Data

Location: Remote

Type: Full-time

Posted: 2026-06-23

Job Description

Our client is a European industrial group building a group-wide manufacturing data platform. Plant and IoT telemetry streams through Kafka into a Databricks lakehouse, where Python pipelines and modeled transformations serve analytics and ML consumers across the group. You join the client's data platform team full-time. You own pipelines end to end, from Kafka ingestion through to the data contracts and quality gates that downstream teams build against. You work under the client's direction, alongside their platform architects, analytics engineers, and the ML team that consumes your tables.

What you'll be doing

Build and operate Kafka ingestion for plant and IoT telemetry from the client's manufacturing sites
Design PySpark pipelines on Databricks that move telemetry from raw ingestion to curated Delta tables
Model transformations in dbt and Spark SQL, turning raw plant data into documented, versioned datasets
Own data contracts between plant-side producers and the analytics and ML teams consuming the platform
Ship schema validation, freshness and volume checks that stop bad data before consumers see it
Tune Spark jobs and Delta tables for cost and latency as plant coverage grows
Run the platform day to day: monitor pipelines, resolve incidents, manage schema migrations and backfills
Review pipeline changes from engineers across the group and harden the shared ingestion libraries

What you'll need

6+ years of data engineering, with 3+ years running Spark pipelines in production
Strong Python engineering skills, with pipelines shipped as tested and reviewed code
Production Kafka experience: topic design, consumer groups, and schema management
Hands-on experience with Databricks or an equivalent Spark lakehouse stack, including Delta Lake or a comparable table format
Solid SQL and data modeling, with dbt or a comparable transformation framework
Experience defining data contracts and quality gates for analytics and ML consumers
Comfort operating what you build: pipeline CI/CD, monitoring, and incident response
Professional working proficiency in English
Based in the EU with working hours overlapping CET

Nice to have

Exposure to manufacturing data sources such as OPC UA, MES, or plant historian systems
Databricks Certified Data Engineer Professional or an equivalent Spark certification
Terraform and Kubernetes experience for pipeline infrastructure
German at B2 or above
Availability for occasional business travel within the EU

Engagement terms

Remote-first. Deliverable-based, no time tracking.
Monthly wellness allowance, scaling with tenure.
Annual learning budget, scaling with tenure.
Home office setup allowance, refreshed every two years.
25 days annual leave plus one additional day per year of tenure.
Birthday off.
Family leave and private healthcare coverage.