Data Scientist - Analytics Engineer (N375) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Heluna Health · 1 day ago

Data Scientist - Analytics Engineer (N375)

Heluna Health is focused on improving care for individuals experiencing or at risk of homelessness through data-driven initiatives. The Analytics Engineer will play a vital role in building data models and supporting the integration of data to enhance performance tracking and policy guidance.

Non ProfitHealthcareAdvertisingHealth CareSponsorship
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Build and maintain semantic data models (silver and gold layers) in Spark/Databricks, primarily through ETL pipelines written in PySpark
Understand and identify entity relationships among large collections of normalized backend tables to design accurate, denormalized, analyst-ready structures
Contribute to schema and catalog design decisions, including naming conventions for static vs. live feeds and ad hoc data use cases. This includes creating and maintaining documentation that clarifies data model logic, table relationships, and mapping assumptions to support downstream users and internal knowledge transfer
Collaborate with program and analytic teams to understand and translate both business rules used to define data fields as well as needs of analytic teams using semantic layer to produce reports and dashboards
Collaborate with the Privacy Engineer to ensure analytic datasets align with RBAC policies, de-identification requirements, and data classification standards set for Departmental and Countywide use
Contribute to, update, and maintain centralized code repositories used for data transformations
Participate in Dev/Prod promotion workflows using GitHub, ensuring proper validation and configuration for CI/CD deployment
Apply expectations and version control to standardize, test, and document pipelines
Collaborate with Evaluation and Reporting teams to align models with use cases and downstream needs
Participate in integration of external and internal sources using Azure services and contribute to scalable, secure data pipelines
Support data modeling standards and technical documentation practices
Assist in onboarding and training analysts to use the semantic layer effectively

Qualification

DatabricksSparkPythonSQLData EngineeringMachine LearningPredictive AnalyticsGitHubCI/CD WorkflowsAzure Data FactoryPower BITableauTechnical DocumentationCollaboration

Required

Two (2) years of experience applying advanced statistical analyses, including predictive analytics or data engineering, to produce actionable recommendations to support data-driven program, policy, and operational decision-making, at a level equivalent to the Los Angeles County class of Predictive Data Analyst
Experience at the level of Predictive Data Analyst is defined as using machine learning techniques or data engineering practices to analyze or support analysis of complex data sets and find statistically significant, meaningful predictive patterns, relevant to program goals, that human intelligence could not identify on its own
A Bachelor's degree from an accredited college in a field of applied research such as Data Science, Machine Learning, Mathematics, Statistics, Business Analytics, Psychology, Computer Science, or Public Health that included 12 semester or 18 quarter units of coursework in data science, data engineering, predictive analytics, quantitative research methods, or statistical analysis
Four (4) years of experience applying data engineering, machine learning, predictive analytics, and data management, to conduct or support hypothesis-driven data analysis to produce actionable recommendations to support data-driven program, policy, and operational decision-making
A Master's or Doctoral degree from an accredited college or university in a field of applied research such Data Science, Machine Learning, Mathematics, Statistics, Business Analytics, Psychology, Public Health, or similar related fields may substitute for up to two (2) years of experience
A valid California Class C Driver License or the ability to utilize an alternative method of transportation when needed to carry out job-related essential functions
Successful clearance of Live Scan with the County of Los Angeles
4+ years of experience building data transformations and models in Databricks or Spark-based environments
Strong knowledge of Medallion Architecture and curated model development
Skilled in working with normalized datasets and applying entity resolution techniques to build clean, reliable analytic tables joined across systems (e.g., MDM-linked client records)
Experience using declarative syntax to manage and implement data transformations in such tools as Delta Live Tables, dbt, and Spark
Proficient in SQL, Python, GitHub, and CI/CD workflows
Experience developing and maintaining Databricks notebooks used in orchestrated jobs, including environment-based configuration using YAML/JSON
Familiar with Azure Data Factory, Synapse, and Terraform
Understanding of HIPAA, FERPA, and governance in health and social service data
Experience supporting dashboards (Power BI, Tableau) and ensuring downstream data usability
Ability to work across technical and program teams and contribute to shared engineering practices

Company

Heluna Health

twittertwittertwitter
company-logo
Heluna Health offers program services & fiscal sponsorship for public health agencies, academic researchers, public/private consortia.

H1B Sponsorship

Heluna Health has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2023 (4)
2022 (4)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Blayne Cutler
President and CEO
linkedin
leader-logo
Elizabeth Power Robison, MBA
Chief Advancement Officer
linkedin
Company data provided by crunchbase