Apply on Employer Site

PTC · 14 hours ago

Principal Site Reliability Engineer

Boston, MA, USA

Full-time

Hybrid

Senior Level, Lead/Staff

$113K/yr - $175K/yr

7+ years exp

PTC is a leading company transforming the physical and digital worlds. They are seeking a Principal Site Reliability Engineer to ensure the long-term reliability, scalability, and operational excellence of their platform, influencing system design and leading reliability initiatives across the organization.

Computer Software

Culture & Values

H1B Sponsor Likely

Responsibilities

Lead design, implementation, and evolution of reliability, availability, and resiliency strategies for large‑scale distributed systems written primarily in Java

Apply deep experience operating complex, distributed systems to guide architectural decisions, reliability strategies, and long‑term system evolution

Identify systemic risks in application architecture, data flows, and infrastructure, and drive architectural improvements that measurably improve availability, performance, and scalability

Set and evolve reliability standards, best practices, and operational principles across R&D

Lead efforts to prevent, detect, and mitigate incidents through technical improvements and operational maturity

Serve as a senior coordination point during major incidents, helping manage response and guide long‑term remediation

Champion blameless post-incident reviews and ensure learnings translate into durable system improvements

Apply advanced software engineering practices to eliminate manual work, reduce operational load, and improve system observability

Design and build internal platforms, automation, and tooling that support Java‑based services and their operational needs

Raise the bar on monitoring, alerting, and SLO/SLI adoption across systems

Partner deeply with product engineers, architects, and engineering leadership to ensure reliability and operability are first‑class concerns in system design

Review and influence designs for complex systems involving technologies such as datastores, messaging systems, and coordination services

Serve as a technical mentor and coach for SREs and other engineers, raising overall engineering and operational maturity

Contribute to longer‑term reliability and infrastructure strategy aligned with business growth

Stay current with industry trends in SRE, distributed systems, and the Java ecosystem, turning insights into practical improvements

Help define what “great reliability” looks like for the organization and how we measure it

Qualification

JavaDistributed systemsIncident managementObservabilityCloud infrastructureCI/CD pipelinesPerformance engineeringSystems thinkingCuriosityLeadershipCommunication

Required

Ability to commute to the Seaport Boston office 2-3 days a week

7+ years of experience in software engineering, site reliability engineering, or systems engineering roles

Extremely strong proficiency with the Java programming language and its ecosystem, including building, debugging, and operating production Java services

Deep experience operating complex, distributed systems in production environments

Strong software engineering background, with a track record of delivering high-quality, maintainable code

Expert understanding of incident management, service reliability, and performance engineering

Strong hands-on experience with observability (metrics, logs, traces), capacity planning, and SLO-driven reliability

Deep familiarity with modern cloud-based infrastructure, CI/CD pipelines, and infrastructure-as-code practices

Ability to reason about failure modes across application, data, and infrastructure layers

Demonstrated ability to lead complex initiatives that span teams and organizational boundaries

Comfortable making high-impact technical decisions in ambiguous environments

Strong communicator who can influence design and operational decisions across a wide range of stakeholders

Systems thinker focused on root-cause analysis and durable fixes

Calm and effective under pressure, especially during high-severity incidents

Curious, data-driven, and committed to continuous improvement

Preferred

Experience operating or supporting systems using technologies such as MongoDB, ZooKeeper, and RabbitMQ

Background in performance tuning and scalability optimization of Java services

Experience setting or influencing engineering standards at the organization level

Prior involvement in evolving SRE or platform practices in a growing engineering organization

Experience designing, operating, or scaling systems in cloud environments such as AWS (preferred), including familiarity with core services, networking models, and reliability features

Benefits

Performance-based bonus

Employee share purchase program (ESPP)

Medical, dental and vision insurance

Paid time off and sick leave

Tuition reimbursement

401(k) contributions and employer match

Flexible spending accounts

Life insurance

Disability coverage

Generous commuter subsidy

Company

PTC

Glassdoor4.2

PTC (NASDAQ: PTC) unleashes industrial innovation with award-winning, market-proven solutions that enable companies to differentiate their products and services, improve operational excellence, and increase workforce productivity.

Founded in 1985

Boston, Massachusetts, US

5001-10000 employees

http://ptc.co/VLED30oHtEh

H1B Sponsorship

PTC has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)

Distribution of Different Job Fields Receiving Sponsorship

Represents job field similar to this job

Trends of Total Sponsorships

2025 (57)

2024 (61)

2023 (75)

2022 (86)

2021 (111)

2020 (72)

Funding

Current Stage

Late Stage

Leadership Team

Marcus Senior,PMP, CSM, MSP, and Lean Six-Sigma

Chief Executive Officer

Danny N. Poisson

TVP, Chief Technology Officer for Federal Aerospace & Defense

Recent News

EIN Presswire

Product Lifecycle Management Market to Surpass USD 56.0 Bn by 2035, Expanding at a CAGR of 7.3% | TMR

2025-10-06

eeNews Europe

PTC expands service lifecycle management AI for ServiceMax and Servigistics

2025-10-03

PR Newswire

PTC Delivers New Service Lifecycle Management AI Solutions to Modernize Field Service and the Service Supply Chain

2025-09-30

Company data provided by crunchbase