Steppingblocks · 5 hours ago
Data Scientist
Steppingblocks is transforming how individuals navigate their career and educational journeys. They are seeking a highly disciplined, impact-driven Senior Data Scientist to lead the development of advanced models and data assets that power analytics and insights.
Responsibilities
Lead the development of machine learning models such as salary predictors, demographic estimators, and classification systems using internal and external data sources
Apply ML, NLP, and LLM based approaches to build data matching, entity recognition, and enrichment capabilities
Fine tune, monitor, and maintain existing models to improve performance, transparency, and scalability
Prototype and productionize new features and attributes for research, commercialization, and product delivery
Partner with Product to identify customer pain points and translate them into impactful data science solutions
Design and implement automated processes that scale complex internal and external data products
Write scalable, maintainable, and well documented Python code following best practices
Use AWS S3, Snowflake, Coiled with Dask, and related tools to build reproducible pipelines and manage large datasets
Evaluate and introduce emerging AI and ML tools to improve data workflows, experimentation speed, and innovation velocity
Adhere to Steppingblocks documentation standards, versioning practices, and model governance protocols
Conduct and document model evaluations, data profiling, and integrity checks
Collaborate with data engineering and QA teams to ensure new attributes meet production readiness standards and privacy expectations
Contribute to and help maintain a growing taxonomy of features, transformations, and business rules
Support custom research initiatives, audits, and prototypes with the Innovation and Business Analytics teams
Mentor junior data scientists or analysts on modeling approaches, experimentation, and best practices
Participate in stakeholder discussions, roadmap planning, and partner support as needed
Qualification
Required
5 or more years of experience in data science roles building and deploying machine learning models
Advanced proficiency in Python and libraries such as scikit learn, pandas, NumPy, spaCy, transformers, Splink, and related tools
Hands on experience with NLP, classification techniques, and LLM fine tuning or adaptation
Demonstrated ability to build models that solve real business problems and drive adoption
Experience working with AWS services including S3 and EC2, Snowflake, Coiled or Dask, and SQL based data systems
Ability to work comfortably with flexible data formats including text, CSV, JSON, and Parquet
Strong understanding of taxonomy design, matching algorithms, and model governance concepts
Strong interest in AI and emerging applied machine learning techniques
Excellent written and verbal communication skills, with experience presenting complex technical concepts to business leaders and executives
Strong work ethic and a commitment to delivering high quality, production ready results
Preferred
Experience in labor market analytics or demographic modeling using large structured and unstructured datasets
Master's or PhD in Data Science, Statistics, Computer Science, or a related quantitative field
Familiarity with data privacy principles such as CCPA and FERPA and ethical AI practices
Experience with version control, model monitoring frameworks, and technical writing
Experience working with data fabric architectures and AI optimized pipelines
Company
Steppingblocks
Steppingblocks is a data analytics platform that uses big data to help users make intelligent decisions.
Funding
Current Stage
Early StageTotal Funding
$2.22MKey Investors
National Science Foundation
2019-04-17Grant· $0.75M
2018-01-19Grant· $0.23M
2017-07-05Series Unknown· $1.25M
Recent News
Company data provided by crunchbase