Data Scientist jobs in United States
cer-icon
Apply on Employer Site
company-logo

Steppingblocks · 5 hours ago

Data Scientist

Steppingblocks is transforming how individuals navigate their career and educational journeys. They are seeking a highly disciplined, impact-driven Senior Data Scientist to lead the development of advanced models and data assets that power analytics and insights.

AnalyticsArtificial Intelligence (AI)Big DataHigher EducationPredictive Analytics
Hiring Manager
Nicolette Kern
linkedin

Responsibilities

Lead the development of machine learning models such as salary predictors, demographic estimators, and classification systems using internal and external data sources
Apply ML, NLP, and LLM based approaches to build data matching, entity recognition, and enrichment capabilities
Fine tune, monitor, and maintain existing models to improve performance, transparency, and scalability
Prototype and productionize new features and attributes for research, commercialization, and product delivery
Partner with Product to identify customer pain points and translate them into impactful data science solutions
Design and implement automated processes that scale complex internal and external data products
Write scalable, maintainable, and well documented Python code following best practices
Use AWS S3, Snowflake, Coiled with Dask, and related tools to build reproducible pipelines and manage large datasets
Evaluate and introduce emerging AI and ML tools to improve data workflows, experimentation speed, and innovation velocity
Adhere to Steppingblocks documentation standards, versioning practices, and model governance protocols
Conduct and document model evaluations, data profiling, and integrity checks
Collaborate with data engineering and QA teams to ensure new attributes meet production readiness standards and privacy expectations
Contribute to and help maintain a growing taxonomy of features, transformations, and business rules
Support custom research initiatives, audits, and prototypes with the Innovation and Business Analytics teams
Mentor junior data scientists or analysts on modeling approaches, experimentation, and best practices
Participate in stakeholder discussions, roadmap planning, and partner support as needed

Qualification

Machine LearningPythonNLPSnowflakeAWS S3Data ManagementClassification TechniquesLarge Scale Data ManipulationCommunication SkillsTeam CollaborationMentoringDocumentation

Required

5 or more years of experience in data science roles building and deploying machine learning models
Advanced proficiency in Python and libraries such as scikit learn, pandas, NumPy, spaCy, transformers, Splink, and related tools
Hands on experience with NLP, classification techniques, and LLM fine tuning or adaptation
Demonstrated ability to build models that solve real business problems and drive adoption
Experience working with AWS services including S3 and EC2, Snowflake, Coiled or Dask, and SQL based data systems
Ability to work comfortably with flexible data formats including text, CSV, JSON, and Parquet
Strong understanding of taxonomy design, matching algorithms, and model governance concepts
Strong interest in AI and emerging applied machine learning techniques
Excellent written and verbal communication skills, with experience presenting complex technical concepts to business leaders and executives
Strong work ethic and a commitment to delivering high quality, production ready results

Preferred

Experience in labor market analytics or demographic modeling using large structured and unstructured datasets
Master's or PhD in Data Science, Statistics, Computer Science, or a related quantitative field
Familiarity with data privacy principles such as CCPA and FERPA and ethical AI practices
Experience with version control, model monitoring frameworks, and technical writing
Experience working with data fabric architectures and AI optimized pipelines

Company

Steppingblocks

twittertwittertwitter
company-logo
Steppingblocks is a data analytics platform that uses big data to help users make intelligent decisions.

Funding

Current Stage
Early Stage
Total Funding
$2.22M
Key Investors
National Science Foundation
2019-04-17Grant· $0.75M
2018-01-19Grant· $0.23M
2017-07-05Series Unknown· $1.25M

Leadership Team

leader-logo
Carlo Martinez
Co-Founder and CIO
linkedin
leader-logo
Cameron Gibson
Partner Account Executive
linkedin
Company data provided by crunchbase