Proscia · 1 day ago
Senior Data Engineer (GenAI & LLM Infrastructure)
Proscia is a company focused on transforming pathology through digitization and AI to improve cancer treatment. They are seeking a Senior Data Engineer to build and scale data and AI systems that enhance outcomes for cancer patients and support research in therapies and drug regimens.
BiotechnologyArtificial Intelligence (AI)HealthcareLife ScienceMedical
Responsibilities
Build and deploy LLM-enabled data products/workflows that turn structured + unstructured inputs into curated, research-ready outputs
Develop and refine data pipelines and warehouse layers (raw → curated → marts) to support both analytics and AI workflows
Implement LLMOps/MLOps foundations: evaluation, versioning, monitoring/observability, and safe release processes for model/prompt changes
Deliver traceable and reproducible outputs (evidence references, run metadata, input/version tracking) so results can be explained and debugged
Identify and implement process improvements—automation, reliability controls, and quality checks—to accelerate delivery and reduce manual effort
Collaborate with core engineering, AI, and RWD stakeholders to align technical strategy and integrate solutions into the broader Proscia platform
Qualification
Required
Strong experience building production systems in Python
Demonstrated experience delivering GenAI/LLM solutions into production (beyond experimentation), such as structured extraction pipelines, retrieval/embedding-based systems, or LLM-powered analytics workflows
Experience owning the LLM lifecycle in production: prompt/model versioning, evaluation/regression testing, monitoring, and controlled releases
Experience shipping an LLM-enabled workflow end-to-end (design → build → deploy → operate)
Experience building systems where outputs are testable, traceable, and reproducible (evidence references, versioning, run logs)
A pragmatic approach to reliability—handling ambiguity, conflicts, and change without breaking downstream analytics
Solid fundamentals in SQL, data modeling, and data warehouse patterns; experience with Snowflake or similar platforms
Software engineering practices: unit/integration testing, CI/CD, and containerization (Docker; Kubernetes)
Experience with cloud platforms (AWS preferred)
Comfort selecting and integrating the right tools to build, evaluate, deploy, and operate LLM workflows in production (we're tooling-agnostic and prioritize end-to-end delivery over specific frameworks)
The ability to work independently, move quickly, and collaborate effectively across teams
Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related field
Preferred
Master's preferred
Experience in life sciences/biopharma is a plus (domain experience is helpful, but not required)
Benefits
Competitive pay
Comprehensive benefits
Flexible schedules
Insurance options
Company
Proscia
Proscia develops digital pathology software designed to help laboratories and life sciences organizations manage and analyze pathology data.
H1B Sponsorship
Proscia has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2024 (1)
2023 (1)
2022 (6)
2021 (1)
2020 (1)
Funding
Current Stage
Late StageTotal Funding
$129.84MKey Investors
Alpha Intelligence Capital,Insight Partners,Triangle Peak PartnersTriangle Peak PartnersScale Venture Partners
2025-03-19Series D· $50M
2024-01-11Series C· $9M
2022-06-03Series C· $36.62M
Recent News
2025-12-09
Company data provided by crunchbase