Senior Software Engineer - Reliability - Artificial Intelligence jobs in United States
cer-icon
Apply on Employer Site
company-logo

Bloomberg ยท 11 hours ago

Senior Software Engineer - Reliability - Artificial Intelligence

Bloomberg is a leading financial technology company that invests in AI to enhance search, discovery, and workflow solutions. They are seeking a Senior Software Engineer for their AI Resilience & Insights team, responsible for improving reliability metrics and incident response for AI-driven products.

AnalyticsBusiness Information SystemsFinancial ServicesInformation ServicesNews
check
Comp. & Benefits
check
H1B Sponsor Likelynote

Responsibilities

Define how we measure reliability for key AI user experiences, and roll that measurement out with service owners
Instrument generative AI-powered conversational agent with real user monitoring and client error tracking so we can see failures the way clients do
Improve alert quality so alerts are actionable and tied to client impact
Standardize incident response practices across ENG AI (runbooks, readiness checks, post-incident learning)
Build dashboards that connect user impact to the underlying drivers, giving teams a clear view of what matters
Strengthen resilience around upstream dependencies, including external model providers, using pragmatic controls like timeouts, retries, and fallbacks
Participate in a secondary on-call rotation after ramp, focused on strengthening systems through automation and engineering

Qualification

PythonGoDistributed systemsIncident responseAutomationObservabilityJudgmentClient telemetryReal user monitoringCollaboration skillsReliability interest

Required

Strong software engineering skills in Python and/or Go, with experience building production systems and automation
Ability to debug distributed systems and improve reliability through instrumentation and engineering
Familiarity with observability, incident response, and building tools that reduce toil
Strong collaboration skills and good judgment to balance 'push standards' vs 'enable teams.'
5+ years of relevant engineering experience

Preferred

Experience with Grafana, OpenTelemetry, Kubernetes, and Infrastructure-as-Code
Experience working with client telemetry or real user monitoring
Exposure to external AI/LLM providers and building resilient integrations
Interest in reliability for agent/tool systems and multi-step AI workflows

Benefits

Merit increases
Incentive compensation (exempt roles only)
Paid holidays
Paid time off
Medical
Dental
Vision
Short and long term disability benefits
401(k) +match
Life insurance
Various wellness programs

Company

Bloomberg

company-logo
Bloomberg provides news, data, analytics, and communication services for the global business and financial world.

H1B Sponsorship

Bloomberg has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (496)
2024 (382)
2023 (363)
2022 (426)
2021 (442)
2020 (588)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
David Rosenberg
Head of Machine Learning Strategy, CTO Office
linkedin
leader-logo
Nabil Bitar
CTO - Head of Network Architecture
linkedin

Recent News

AI-powered learning ecosystems: A guide to workforce upskilling | CIO
Company data provided by crunchbase