Respondology · 15 hours ago
Sr. Backend Software Engineer (Data/ETL)
Respondology is the leader in AI-powered social comment moderation, expanding its technology to enhance social marketing for brands. They are seeking a Backend Software Engineer with deep data engineering expertise to build and scale ingestion pipelines and knowledge base infrastructure, focusing on architecting high-throughput data systems that power their AI-driven product suite.
AdvertisingSocial Media ManagementSpam Filtering
Responsibilities
Design and build data ingestion pipelines integrating with dozens of external sources (social platforms, third-party APIs, web scrapers)
Architect ETL workflows using Airflow and Kafka to process high-volume data streams (millions of records daily)
Collaborate with AI/ML engineers to ensure data quality and availability for Agentic RAG pipelines
Build and optimize multi-store data architecture: vector databases (Pinecone), relational databases (Postgres), and search engines (OpenSearch)
Develop and maintain integrations with social media platforms (Meta, LinkedIn, TikTok, X/Twitter) handling webhook ingestion and API polling
Optimize data freshness, throughput, and reliability across distributed systems
Participate in code reviews and contribute to backend services (FastAPI/Python, occasional Ruby on Rails)
Qualification
Required
Bachelor's degree in Computer Science or related degree; or equivalent work experience
Minimum 5 years of professional software engineering experience
Minimum 4 years proven experience building data pipelines and ETL workflows in Python
Minimum 3 years experience with workflow orchestration tools (Airflow, Dagster, Prefect, or similar)
Minimum 3 years working with multiple data storage technologies (relational, vector, search engines)
Experience with message queues and event streaming (Kafka, RabbitMQ, SQS/SNS)
Proven experience building high-throughput, fault-tolerant systems (we process 100s of millions of comments per year)
Strong understanding of API design, rate limiting, and webhook handling
Experience with data quality monitoring and observability
Demonstrated ability to take ownership of projects, prioritize tasks, and deliver high-quality results independently
Preferred
Experience building and/or working with RAG pipelines to provide context to cutting edge AI Agents
Significant experience with AWS data services (Kinesis, S3, RDS, Redshift, SQS/SNS)
Experience with vector databases (Pinecone, Weaviate, Qdrant) or embedding-based retrieval systems
Deep knowledge of OpenSearch/Elasticsearch for large-scale search and analytics
Experience with social media platform APIs and webhook integrations
Familiarity with modern ML/AI workflows and serving infrastructure
Experience with infrastructure as code (Terraform, CloudFormation)
Experience with APM and data pipeline monitoring (we use DataDog)
Background in web scraping at scale with anti-bot mitigation strategies
Experience working at a rapidly-growing tech startup
FastAPI and Pydantic expertise
Benefits
Equity is included for all employees
Twice yearly off-sites to enjoy time together as a team
Flex PTO plan, generous holidays and off-week between Christmas and New Years
Multiple healthcare options, including plans with FSA and HSA
Matching traditional and Roth 401k—immediately vested
Family and paternity leave
Life Insurance
Company
Respondology
Respondology develops a customizable comment moderation tool for businesses or anyone with a large online audience.
H1B Sponsorship
Respondology has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)
Funding
Current Stage
Early StageTotal Funding
$16M2025-04-23Series A· $5M
2023-04-06Series A· $11M
2023-04-06Debt Financing
Leadership Team
Recent News
2025-04-27
2025-04-26
thesaasnews.com
2025-04-26
Company data provided by crunchbase