SideBy Care · 5 months ago
Senior Data Engineer – Healthcare Data & AI Systems
SideBy Care is the first AI-powered virtual care service for GI practices and their patients with Disorders of Gut-Brain Interaction. They are seeking a Senior Data Engineer to build and manage the data backbone of their platform, focusing on complex data systems that power analytics and clinical decision-making in healthcare.
Health CareTelehealthVirtual Assistant
Responsibilities
Architect and implement robust data pipelines between EMRs, internal systems, and Snowflake, ensuring scalability, reliability, and data provenance
Lead the design of warehouse schemas for multiple use cases: transactional processing, reporting (BI), and statistical/ML analysis
Define and enforce standards for data semantics, integrity, quality, lineage, and access control
Collaborate with data scientists and ML engineers to enable production-grade ML workflows (e.g., TensorFlow pipelines, model monitoring, A/B testing infrastructure)
Experiment with and support the deployment of LLMs to enable reasoning, summarization, and classification on structured and unstructured data (e.g., clinical notes)
Build monitoring and alerting around pipeline health and data trustworthiness
Integrate and normalize complex healthcare data sources (FHIR/HL7, custom APIs, third-party vendors) into a unified analytics model
Partner with engineering and product teams to deliver data-driven features, dashboards, and insights
Qualification
Required
5+ years of experience in data engineering or backend systems, with senior or staff-level contributions
Deep Python proficiency, with production experience in ETL, data validation, and orchestration frameworks (e.g., Airflow, Dagster, dbt)
Strong experience with data warehouse design, including star/snowflake schemas, denormalization strategies, and performance optimization
Strong understanding of data privacy and security practices, especially in healthcare (HIPAA, de-identification, audit logging, etc.)
Proven experience managing complex integrations with EMRs or clinical systems
Familiarity with LLM and ML development tools (e.g., TensorFlow, PyTorch, LangChain, transformers, vector DBs)
Experience deploying or supporting predictive models in production environments
Expertise in Snowflake or similar cloud data platforms (e.g., BigQuery, Redshift)
Strong grasp of data modeling, provenance, and semantics for analytical and AI purposes
Experience working with AWS services such as S3, Lambda, Batch, Event Bridge, Cloud Front, EC2, etc
Preferred
Experience working with graph-based reasoning engines or healthcare ontologies
Knowledge of analytics frameworks like Superset or Looker
Familiarity with HL7, FHIR, or other clinical interoperability standards
Exposure to real-time or streaming data systems (Kafka, Pulsar)
Benefits
Competitive pay
Flexible remote work culture
Company
SideBy Care
SideBy Care is a healthcare platform that specializes in virtual care services for gut health.
Funding
Current Stage
Early StageCompany data provided by crunchbase