Lead Data Engineer - Computational Discovery jobs in United States
cer-icon
Apply on Employer Site
company-logo

Proclinical Staffing ยท 12 hours ago

Lead Data Engineer - Computational Discovery

Proclinical Staffing is seeking a Lead Data Engineer to play a pivotal role in designing, implementing, and maintaining scalable data pipelines and structures. The role focuses on integrating complex scientific datasets into modern cloud architectures and collaborating with cross-functional teams to ensure robust, scalable, and FAIR-compliant solutions.

Staffing & Recruiting
check
H1B Sponsor Likelynote
Hiring Manager
Anderson Maldonado
linkedin

Responsibilities

Act as a hands-on technical lead, defining architecture and coding scalable ETL pipelines and data structures
Oversee the ingestion of complex datasets (e.g., genomics, proteomics, imaging, lab data) into cloud-based data lakes
Lead data engineering projects, designing integration solutions for diverse scientific data sources
Develop automated procedures to normalize unformatted external vendor data into a structured Common Data Model (CDM)
Collaborate with research and IT teams to align infrastructure with scientific needs
Architect and implement scalable ETL processes, APIs, and visualization tools for data access
Engage stakeholders to gather requirements and incorporate feedback into designs
Lead user acceptance testing (UAT) to ensure high-quality deliverables
Promote FAIR principles and interoperability across translational and clinical programs

Qualification

PythonSQLCloud architecturesData modelingAPI developmentFAIR principlesClinical data standardsBiomarker data formatsCommunication skillsCollaboration skills

Required

Proficiency in Python, including libraries such as Pandas, PySpark, Dask, and SQLAlchemy
Advanced knowledge of SQL and workflow orchestration tools like Airflow, Dagster, or Prefect
Experience with modern cloud architectures (e.g., Azure Fabric, Databricks, Snowflake)
Strong understanding of data modeling, ETL processes, and schema design for complex datasets
Expertise in API development for data access
Familiarity with FAIR principles and metadata standards for scientific data
Excellent communication and collaboration skills to bridge IT and scientific teams

Preferred

Knowledge of clinical data standards (e.g., SDTM, ADaM, CDISC) and biomarker data formats (e.g., NGS, flow cytometry, proteomics)

Company

Proclinical Staffing

twitter
company-logo
At Proclinical Staffing, our life science recruitment services support our partners with permanent and contract vacancies by connecting them with specialist talent across the globe.

H1B Sponsorship

Proclinical Staffing has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2024 (1)
2023 (1)
2022 (1)
2021 (1)

Funding

Current Stage
Growth Stage
Company data provided by crunchbase