aKUBE · 1 day ago
Senior Data Engineer - Product Performance Data -1573
aKUBE is seeking a Senior Data Engineer to work on product performance data. The role involves building and maintaining large-scale Spark pipelines and delivering analytics-ready datasets to support product performance and reporting.
Responsibilities
Build and maintain large-scale Spark pipelines processing clickstream and telemetry data
Develop shared Scala and Python libraries to standardize business logic across pipelines
Own Databricks workloads orchestrated via Airflow with strict uptime SLAs
Deliver analytics-ready datasets supporting product performance and reporting
Define and document standards for pipeline design, partitioning, and data quality
Qualification
Required
Apache Spark with Scala as the primary production language
Advanced SQL on large datasets (complex joins, window functions, performance tuning)
Hands-on experience with clickstream or user browse event data at scale
Databricks pipeline development with end-to-end ownership
Airflow for orchestration of SLA-driven production pipelines
5+ years of hands-on data engineering experience
Strong production experience with Spark, Scala, SQL, and Databricks
Experience supporting high-volume, event-based data platforms
Bachelor's degree or equivalent experience
Preferred
Experience with media, streaming, or consumer-facing product analytics data
Familiarity with telemetry or quality-of-service datasets
Experience supporting experimentation or A/B testing data