Data Scientist jobs in United States
info-icon
This job has closed.
company-logo

Capgemini · 3 hours ago

Data Scientist

Capgemini is a global business and technology transformation partner, empowering employees to shape their careers. As an Associate Data Scientist, you will lead the development of advanced data engineering solutions to support Generative AI models and design scalable data architectures.

ConsultingInformation TechnologyInsurTechIT ManagementSoftware
check
H1B Sponsor Likelynote

Responsibilities

The Machine Learning Engineer will be responsible for architectural design and planning, advanced data pipelines, model integration and optimization, scalability, performance and research and innovation supporting production generative AI systems
Production level ML workloads for customers using Databricks platform, including end-to-end ML pipelines, training/inference optimization, integration with cloud-native services and MLOps
Build and maintain data engineering solutions on cloud platforms using hyperscaler services
Develop production-grade cloud (AWS/Azure/GCP) infrastructure that supports the deployment of ML applications, including drift monitoring
Design, develop, and maintain data pipelines to efficiently collect, process, and load data from various sources into data storage systems (e.g., data warehouses, data lakes)
Understanding of indexing and vectorization to use with Generative AI prompt engineering
Strong understanding of fundamental data science concepts in NLP, including selection and understanding of embedding models
Use hyperscaler technologies to support data needs for expansion of Machine Learning/Data Science capabilities including generative AI
Design, develop, and implement scalable data pipelines and ETL/ELT processes using Python, PySpark and API integrations

Qualification

Data engineering solutionsGenerative AI modelsCloud platformsPythonMLOpsNLP conceptsETL/ELT processesCollaborationProblem-solving

Required

Lead the development and implementation of advanced data engineering solutions to support the deployment and optimization of Generative AI models
Leverage extensive experience to design robust, scalable, and innovative data architectures that align with the unique requirements of General Artificial Intelligence (GenAI) applications
Architectural design and planning, advanced data pipelines, model integration and optimization, scalability, performance and research and innovation supporting production generative AI systems
Production level ML workloads for customers using Databricks platform, including end-to-end ML pipelines, training/inference optimization, integration with cloud-native services and MLOps
Build and maintain data engineering solutions on cloud platforms using hyperscaler services
Develop production-grade cloud (AWS/Azure/GCP) infrastructure that supports the deployment of ML applications, including drift monitoring
Design, develop, and maintain data pipelines to efficiently collect, process, and load data from various sources into data storage systems (e.g., data warehouses, data lakes)
Understanding of indexing and vectorization to use with Generative AI prompt engineering
Strong understanding of fundamental data science concepts in NLP, including selection and understanding of embedding models
Use hyperscaler technologies to support data needs for expansion of Machine Learning/Data Science capabilities including generative AI
Design, develop, and implement scalable data pipelines and ETL/ELT processes using Python, PySpark and API integrations

Benefits

Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade, Company paid holidays, Personal Days, Sick Leave
Medical, dental, and vision coverage (or provincial healthcare coordination in Canada)
Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)
Life and disability insurance
Employee assistance programs
Other benefits as provided by local policy and eligibility

Company

Capgemini

company-logo
Capgemini is a software company that provides consulting, technology, and digital transformation services.

H1B Sponsorship

Capgemini has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2856)
2024 (3012)
2023 (3424)
2022 (4392)
2021 (3311)
2020 (5871)

Funding

Current Stage
Public Company
Total Funding
$4.72B
2025-09-18Post Ipo Debt· $4.72B
1999-04-01IPO

Leadership Team

leader-logo
Aiman Ezzat
CEO, Capgemini Group
linkedin
leader-logo
Anirban Bose
CEO of Americas SBU | Member of the Group Executive Board
linkedin
Company data provided by crunchbase