Scribd, Inc. · 23 hours ago
Engineering Manager, ML/Data Engineering (Content Trust)
Scribd Inc. is dedicated to sparking human curiosity through its suite of products aimed at democratizing the exchange of ideas. The Engineering Manager for ML/Data Engineering will lead a team focused on building scalable ML-driven data pipelines to ensure content safety and trustworthiness, impacting millions of users.
AudiobooksBooksEBooksFile SharingNewsPodcastPublishing
Responsibilities
Lead and grow a high-performing engineering team: Manage, mentor, and recruit a world-class team of data and ML engineers. Foster a culture of technical excellence, operational rigor, and deep empathy for the user safety mission
Architect scalable ML data pipelines: Design and oversee the development of distributed data processing systems capable of handling hundreds of millions of documents. Ensure these pipelines support both batch and real-time inference for content moderation and risk detection
Build the "Trust" scores: Develop and maintain the foundational data layers - including semantic embeddings, metadata extracts, and behavioral signals - that power our Content Trust ML models
Partner on AI/LLM Integration: Work closely with the Search & Discovery and Applied Research teams to integrate ML/LLM-based reasoning into our trust pipelines, enabling more nuanced understanding of complex policy violations
Drive Operational Excellence: Establish SLAs for infrastructure, ensuring our automated enforcement systems are both fast and explainable
Cross-functional Leadership: Collaborate with Product Managers (Content Trust), Legal/Policy teams, and Data Science to translate evolving regulatory requirements (like the DSA) into robust technical architectures
Qualification
Required
8+ years of total engineering experience, with 3+ years specifically in a people management or technical lead role within a Data or ML Engineering organization
Proven track record of building and operating production-grade data pipelines at massive scale (100M+ entities) using technologies like Spark, Flink, Kafka, or Airflow
Deep understanding of the ML lifecycle, including feature engineering, model deployment (MLOps), and vector databases (e.g., Pinecone, Milvus, or Weaviate)
Prior experience building systems for content moderation, fraud detection, spam prevention, or digital rights management
Strong proficiency in Python, Scala, or Go, and experience with cloud-native infrastructure (AWS/GCP, Kubernetes, and Snowflake/BigQuery)
Ability to explain complex architectural trade-offs to non-technical stakeholders in Legal, Policy, and Product
Preferred
Experience building RAG (Retrieval-Augmented Generation) pipelines or managing the data infra for fine-tuning Large Language Models
Background working with large-scale User Generated Content (UGC) ecosystems and the unique challenges of unstructured document data
Familiarity with the technical requirements of global safety regulations such as the Digital Services Act (DSA) or the UK Online Safety Act
Experience building systems that must defend against malicious actors and evolving platform abuse patterns
Benefits
Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees
12 weeks paid parental leave
Short-term/long-term disability plans
401k/RSP matching
Onboarding stipend for home office peripherals + accessories
Learning & Development allowance
Learning & Development programs
Quarterly stipend for Wellness, WiFi, etc.
Mental Health support & resources
Free subscription to the Scribd Inc. suite of products
Referral Bonuses
Book Benefit
Sabbaticals
Company-wide events
Team engagement budgets
Vacation & Personal Days
Paid Holidays (+ winter break)
Flexible Sick Time
Volunteer Day
Company-wide Employee Resource Groups and programs that foster an inclusive and diverse workplace.
Access to AI Tools: We provide free access to best-in-class AI tools, empowering you to boost productivity, streamline workflows, and accelerate bold innovation.
Company
Scribd, Inc.
We're on a mission to spark human curiosity.
H1B Sponsorship
Scribd, Inc. has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (5)
2024 (2)
2023 (8)
2022 (3)
2021 (12)
2020 (15)
Funding
Current Stage
Late StageTotal Funding
$106.75MKey Investors
Spectrum EquityKhosla VenturesCRV
2019-11-25Series E· $58M
2015-01-02Series D· $23M
2011-01-18Series C· $12M
Recent News
2026-01-05
TechCrunch
2025-12-11
Company data provided by crunchbase