Senior Data Engineer, Clinical Data Platform

Senior Data Engineer, Clinical Data Platform

DataArt

Praca zdalna

Warszawa
Lublin
Łódź
Kraków
Wrocław
B2B
Praca stała
data engineering
Databricks
Spark
Delta Lake
data pipelines
data quality
data modeling
CI/CD
analytics reporting
Python

Hexjobs Insights

Senior Data Engineer to develop a clinical data platform on Databricks. Responsibilities include building data pipelines and ensuring data quality. Requires 5+ years experience in data engineering and strong Databricks skills.

Słowa kluczowe

data engineering
Databricks
Spark
Delta Lake
data pipelines
data quality
data modeling
CI/CD
analytics reporting
Python

We are considering only candidates who are located in Poland.Project overviewYou will work on a platform that processes clinical and real-world data (EHRs, labs, registries, trial data) and powers analytics, reporting, and data products for a healthcare / clinical research client.Position overviewWe are looking for a Senior Data Engineer to build and operate a clinical data platform on Databricks, with a strong focus on robust data pipelines, data models, and data quality.Technology stackThe platform is built on Databricks (Spark, Delta Lake) and includes reusable pipelines, a shared data model, and automated data quality checks.ResponsibilitiesDesign, build, and maintain end-to-end Databricks data pipelines (ingestion, transformation, publishing) for production useWork with data models (staging, curated, canonical, or dimensional) and help evolve them together with architects and analystsEmbed data quality and data governance rules into all pipelines (checks, validation, monitoring, alerting)Optimize Databricks jobs for performance and cost (cluster configuration, partitioning, caching, file layout)Collaborate with data architects, analysts, and domain experts to clarify requirements and refine technical solutionsRequirements5+ years of experience in data engineering, DWH, or big data, including production data pipelinesStrong hands-on experience with Databricks: Spark (PySpark/Scala), Delta Lake, Databricks Jobs / WorkflowsProven experience designing and operating end-to-end pipelines on Databricks for batch or near-real-time dataExperience with data pipelines and CI/CD for dataPractical experience with data modeling (layered models, canonical or dimensional models) for analytics and reportingExperience embedding data quality and data governance rules into pipelines (schema checks, business rules, SLOs, monitoring)Good communication skills, upper-intermediate or higher English proficiency, and the ability to work closely with stakeholders in distributed teams and communicate directly with clientsNice to haveExperience designing and delivering PoC solutions on Databricks to quickly validate ideas using real dataExperience with ontologies or a semantic layer (business concepts, metrics, mappings) on top of analytical data

Wyświetlenia: 10
Opublikowana26 dni temu
Wygasaza 28 dni
Rodzaj umowyB2B, Praca stała
Źródło
Logo

Podobne oferty, które mogą Cię zainteresować

Na podstawie "Senior Data Engineer, Clinical Data Platform"

Nie znaleziono ofert, spróbuj zmienić kryteria wyszukiwania.