ByteDance · 2 weeks ago
Senior Data Scientist, Model Risk & Data Analytics, Internal Audit - AMS
ByteDance is a global tech company known for its innovative products, including TikTok. They are seeking a Senior Data Scientist to enhance their Internal Audit function by building data products that support continuous auditing and risk identification through advanced data analytics and machine learning.
ContentData MiningFoundational AIInternetSocial Media
Responsibilities
Proficiency in frameworks for auditing models, including criteria like robustness, fairness, interpretability, alignment, and compliance. Familiarity with emerging LLM auditing methodologies such as LLMAuditor (probe generation/answering cycles, human-in-the-loop assessments)
Model Evaluation & Audit Frameworks: conduct audits on the model lifecycle from training through deployment and monitoring, ensuring compliance with quality, performance, fairness, and risk-management standards
Risk Identification & Mitigation: Identify model vulnerabilities including bias, fairness violations, harmful hallucinations, security risks, and recommend remediation strategies
Measurement Metrics & Statistical Validation: Define and assess model performance metrics (accuracy, precision/recall, F1, calibration, robustness, fairness metrics), measurement of hallucination rates in LLMs, bias/fairness quantification, confidence scoring, and stability analyses
Communication & Collaboration: Develop and maintain collaborative working relationships with stakeholders, including data partners and owners across different business verticals. Clearly communicate technical findings, risk assessments, and recommendations to technical and non-technical stakeholders
Data Analytics Services: Partner with auditors to provide data support and guidance for audit engagements, including conducting interviews, observing systems and operations, developing queries and testing strategies, deploying data quality checks to ensure completeness and accuracy for data sets, and deriving insights
Data Warehousing: develop and maintain data warehouses across different business verticals to efficiently support audit engagements; implement data quality checks for key data assets and continuously collaborate with data partners to maintain completeness and accuracy of these assets
Automation and self-service analytics: partner with auditors to identify and analyze key risk indicators, contribute to a continuous auditing data strategy that will translate into various use cases and corresponding data solutions that can automate the evaluation of the design and effectiveness of controls; build and maintain ETL data pipelines, as well as dashboards to support the solutions
AI-Driven Automation and Insights: Leverage machine learning and AI to automate business and audit processes, surface insights from unstructured and structured data, and extend the team’s ability to deliver actionable recommendations at scale. Develop, train, and implement proprietary machine learning and AI models, to scale up audit testing insights
Professional Development: Continue to develop and expand knowledge in data analytics practices, machine learning, AI, and ByteDance products through continuous education. Provide data training to empower the audit team to derive insights
Qualification
Required
Bachelor's degree in a quantitative discipline, such as Mathematics, Statistics, Computer Science, Financial Engineering, Operations Research, or Economics
Minimum of 5 years professional experience in applied data science, machine learning engineering, or AI research, specifically working with LLMs and traditional ML models and at least 5 years practical experience of data science or analytics from the technology sector, including but not limited to B2C SaaS, media tech, e-commerce, social media platforms, fintech etc
Hands-on experience in designing, deploying, and monitoring large-scale ML models with thorough understanding of lifecycle risks and controls plus strong proficiency in SQL and Python (including libraries such as Hugging Face Transformers, TensorFlow, PyTorch, scikit-learn), data analysis tools, and ML pipeline orchestration platforms
Expertise in defining and assessing model performance metrics (accuracy, precision/recall, F1, calibration, robustness, fairness metrics), measurement of hallucination rates in LLMs, bias/fairness quantification, confidence scoring, and stability analyses
Extensive knowledge of transformer-based LLM architectures (e.g., GPT, BERT, T5, PaLM) and classical ML algorithms (e.g., regression, tree-based methods, neural networks)
Working knowledge of classical ML algorithms and LLM architecture and deep technical expertise in LLMs and Traditional ML and a proven track record supporting or performing AI/ML model audits or evaluations within a corporate, regulatory, or advisory context
High oral, written, reading and listening proficiency in Mandarin is required due to system use, technical documents and frequent communication with Chinese stakeholders
Preferred
PHD degree in a quantitative discipline, such as Mathematics, Statistics, Computer Science, Financial Engineering, Operations Research, or Economics
Proficiency in frameworks for auditing models, including criteria like robustness, fairness, interpretability, alignment, and compliance. Familiarity with emerging LLM auditing methodologies such as LLM Auditor (probe generation/answering cycles, human-in-the-loop assessments)
Ability to analyze model design, training methods, data pipelines, and inference behaviors
Capability to identify model vulnerabilities including bias, fairness violations, harmful hallucinations, security risks, and to recommend remediation strategies
Experience building and maintaining data analytics solutions for continuous audit programs, including automating common analyses and recurring checks plus the ability to clearly communicate technical findings, risk assessments, and recommendations to technical and non-technical stakeholders
Experience with data integration, ETL processes, and large-scale data processing systems plus working knowledge of cloud-based infrastructure such as AWS, GCP, Azure or Snowflake; working knowledge of large scale data processing techniques, such as Hadoop, Flink and MapReduce and a good understanding of data warehouse and data modeling principles
Front end and back end software development skills
Benefits
Medical, dental, and vision insurance
401(k) savings plan with company match
Paid parental leave
Short-term and long-term disability coverage
Life insurance
Wellbeing benefits
10 paid holidays per year
10 paid sick days per year
17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)
Company
ByteDance
ByteDance is a technology company that develops content creation platforms and services.
H1B Sponsorship
ByteDance has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1350)
2024 (1123)
2023 (775)
2022 (487)
2021 (417)
2020 (245)
Funding
Current Stage
Late StageTotal Funding
$9.8BKey Investors
Capital TodayG42Tiger Global Management
2025-11-20Secondary Market· $300M
2024-07-25Secondary Market
2023-03-14Secondary Market· $100M
Leadership Team
Recent News
2026-01-08
Company data provided by crunchbase