Sr. Backend Software Engineer (Data/ETL) jobs in United States
cer-icon
Apply on Employer Site
company-logo

Respondology · 15 hours ago

Sr. Backend Software Engineer (Data/ETL)

Respondology is the leader in AI-powered social comment moderation, expanding its technology to enhance social marketing for brands. They are seeking a Backend Software Engineer with deep data engineering expertise to build and scale ingestion pipelines and knowledge base infrastructure, focusing on architecting high-throughput data systems that power their AI-driven product suite.

AdvertisingSocial Media ManagementSpam Filtering
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Design and build data ingestion pipelines integrating with dozens of external sources (social platforms, third-party APIs, web scrapers)
Architect ETL workflows using Airflow and Kafka to process high-volume data streams (millions of records daily)
Collaborate with AI/ML engineers to ensure data quality and availability for Agentic RAG pipelines
Build and optimize multi-store data architecture: vector databases (Pinecone), relational databases (Postgres), and search engines (OpenSearch)
Develop and maintain integrations with social media platforms (Meta, LinkedIn, TikTok, X/Twitter) handling webhook ingestion and API polling
Optimize data freshness, throughput, and reliability across distributed systems
Participate in code reviews and contribute to backend services (FastAPI/Python, occasional Ruby on Rails)

Qualification

PythonETL workflowsData ingestion pipelinesWorkflow orchestrationKafkaData storage technologiesAPI designAWS data servicesFastAPIWeb scrapingData quality monitoringTeam collaboration

Required

Bachelor's degree in Computer Science or related degree; or equivalent work experience
Minimum 5 years of professional software engineering experience
Minimum 4 years proven experience building data pipelines and ETL workflows in Python
Minimum 3 years experience with workflow orchestration tools (Airflow, Dagster, Prefect, or similar)
Minimum 3 years working with multiple data storage technologies (relational, vector, search engines)
Experience with message queues and event streaming (Kafka, RabbitMQ, SQS/SNS)
Proven experience building high-throughput, fault-tolerant systems (we process 100s of millions of comments per year)
Strong understanding of API design, rate limiting, and webhook handling
Experience with data quality monitoring and observability
Demonstrated ability to take ownership of projects, prioritize tasks, and deliver high-quality results independently

Preferred

Experience building and/or working with RAG pipelines to provide context to cutting edge AI Agents
Significant experience with AWS data services (Kinesis, S3, RDS, Redshift, SQS/SNS)
Experience with vector databases (Pinecone, Weaviate, Qdrant) or embedding-based retrieval systems
Deep knowledge of OpenSearch/Elasticsearch for large-scale search and analytics
Experience with social media platform APIs and webhook integrations
Familiarity with modern ML/AI workflows and serving infrastructure
Experience with infrastructure as code (Terraform, CloudFormation)
Experience with APM and data pipeline monitoring (we use DataDog)
Background in web scraping at scale with anti-bot mitigation strategies
Experience working at a rapidly-growing tech startup
FastAPI and Pydantic expertise

Benefits

Equity is included for all employees
Twice yearly off-sites to enjoy time together as a team
Flex PTO plan, generous holidays and off-week between Christmas and New Years
Multiple healthcare options, including plans with FSA and HSA
Matching traditional and Roth 401k—immediately vested
Family and paternity leave
Life Insurance

Company

Respondology

twittertwittertwitter
company-logo
Respondology develops a customizable comment moderation tool for businesses or anyone with a large online audience.

H1B Sponsorship

Respondology has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1)

Funding

Current Stage
Early Stage
Total Funding
$16M
2025-04-23Series A· $5M
2023-04-06Series A· $11M
2023-04-06Debt Financing

Leadership Team

leader-logo
Erik Swain
CEO & Co-Founder
linkedin
leader-logo
Aaron Benor
Co-Founder, Chief Strategic Partnership Officer @Respondology: Comment tools for social media
linkedin
Company data provided by crunchbase