SIGN IN
Senior Data Engineer (GenAI & LLM Infrastructure) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Proscia · 1 day ago

Senior Data Engineer (GenAI & LLM Infrastructure)

Proscia is a company focused on transforming pathology through digitization and AI to improve cancer treatment. They are seeking a Senior Data Engineer to build and scale data and AI systems that enhance outcomes for cancer patients and support research in therapies and drug regimens.
BiotechnologyArtificial Intelligence (AI)HealthcareLife ScienceMedical
check
H1B Sponsor Likelynote

Responsibilities

Build and deploy LLM-enabled data products/workflows that turn structured + unstructured inputs into curated, research-ready outputs
Develop and refine data pipelines and warehouse layers (raw → curated → marts) to support both analytics and AI workflows
Implement LLMOps/MLOps foundations: evaluation, versioning, monitoring/observability, and safe release processes for model/prompt changes
Deliver traceable and reproducible outputs (evidence references, run metadata, input/version tracking) so results can be explained and debugged
Identify and implement process improvements—automation, reliability controls, and quality checks—to accelerate delivery and reduce manual effort
Collaborate with core engineering, AI, and RWD stakeholders to align technical strategy and integrate solutions into the broader Proscia platform

Qualification

PythonGenAI/LLM solutionsLLM lifecycle managementSQLData modelingData warehouse patternsCI/CDDockerKubernetesAWSCollaborationAdaptabilityProblem-solving

Required

Strong experience building production systems in Python
Demonstrated experience delivering GenAI/LLM solutions into production (beyond experimentation), such as structured extraction pipelines, retrieval/embedding-based systems, or LLM-powered analytics workflows
Experience owning the LLM lifecycle in production: prompt/model versioning, evaluation/regression testing, monitoring, and controlled releases
Experience shipping an LLM-enabled workflow end-to-end (design → build → deploy → operate)
Experience building systems where outputs are testable, traceable, and reproducible (evidence references, versioning, run logs)
A pragmatic approach to reliability—handling ambiguity, conflicts, and change without breaking downstream analytics
Solid fundamentals in SQL, data modeling, and data warehouse patterns; experience with Snowflake or similar platforms
Software engineering practices: unit/integration testing, CI/CD, and containerization (Docker; Kubernetes)
Experience with cloud platforms (AWS preferred)
Comfort selecting and integrating the right tools to build, evaluate, deploy, and operate LLM workflows in production (we're tooling-agnostic and prioritize end-to-end delivery over specific frameworks)
The ability to work independently, move quickly, and collaborate effectively across teams
Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related field

Preferred

Master's preferred
Experience in life sciences/biopharma is a plus (domain experience is helpful, but not required)

Benefits

Competitive pay
Comprehensive benefits
Flexible schedules
Insurance options

Company

Proscia

twittertwittertwitter
company-logo
Proscia develops digital pathology software designed to help laboratories and life sciences organizations manage and analyze pathology data.

H1B Sponsorship

Proscia has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (1)
2023 (1)
2022 (6)
2021 (1)
2020 (1)

Funding

Current Stage
Late Stage
Total Funding
$129.84M
Key Investors
Alpha Intelligence Capital,Insight Partners,Triangle Peak PartnersTriangle Peak PartnersScale Venture Partners
2025-03-19Series D· $50M
2024-01-11Series C· $9M
2022-06-03Series C· $36.62M

Leadership Team

leader-logo
David West
Co-Founder, CEO
linkedin
leader-logo
Coleman Stavish
Co-founder & CTO
linkedin
Company data provided by crunchbase