SRE Architect - Hybrid jobs in United States
cer-icon
Apply on Employer Site
company-logo

Perficient · 15 hours ago

SRE Architect - Hybrid

Perficient is a global AI-first consultancy that partners with innovative enterprises to deliver transformative results. The SRE Architect will be responsible for instrumenting mission-critical systems, building dashboards, and ensuring end-to-end telemetry, enabling teams to operate with confidence.

ConsultingInformation TechnologyInternetMarketing
check
H1B Sponsor Likelynote

Responsibilities

Define reference architectures for high availability, disaster recovery (DR), multi‑region/zone deployments, and fault tolerance
Architect end‑to‑end observability: logs, metrics, traces, profiling, and actionable alerting; standardize telemetry schemas and dashboards
Establish SLOs/SLIs and error budget policies across services; align reliability goals to customer experience and business KPIs
Lead reliability roadmaps, standards, and guardrails (operational readiness reviews, production readiness checklists)
Implement runbooks, playbooks, and automated diagnostics; enforce alert hygiene (signal/noise, on‑call ergonomics)
Define incident response patterns (SEV classification, comms, postmortems, learning reviews) and statistical analysis of incident trends
Own instrumentation across services and UIs: back‑ends and Angular front‑end apps, ensuring high‑quality traces, metrics, and logs with context propagation
Instrument gateways and mesh: AWS API Gateway, Kong API Gateway, and Kong Mesh to capture request/response telemetry, service health, and mesh traffic
Integrate identity flows: ForgeRock (Access Management / Identity) telemetry for authn/authz journeys and error paths
Automate pipelines: Implement observability steps in GitHub Actions (build, test, deploy) to validate telemetry in CI/CD and block non‑compliant releases
Tooling: Leverage Dynatrace OneAgent or OpenTelemetry (enterprise production experience required) to collect signals, normalize, and ship to approved backends
Dashboards & golden signals: Create durable dashboards oriented to the four golden signals—latency, traffic, errors, saturation—plus service availability, dependency health, and user experience
Gap analysis & backlog creation: Identify gaps (e.g., missing SLIs, low trace coverage, noisy logs), write actionable project tasks / stories, and partner with dev, SRE, and security to drive closure
Reliability practices: Define/maintain SLIs/SLOs, alert thresholds, runbooks, and error budgets; collaborate on incident reviews and trend analysis
Enablement: Coach product teams on instrumentation patterns, standards, and SDK usage to scale observability autonomously

Qualification

Dynatrace OneAgentOpenTelemetryAWS API GatewayCI/CD with GitHub ActionsAngular front-end applicationsDistributed systemsSLI/SLO designKong APISoft skills

Required

Enterprise production experience with Dynatrace OneAgent or OpenTelemetry (services and user interfaces)
Proven ability to instrument .NET services and Angular front‑end applications (RUM, distributed tracing, log correlation)
Hands‑on with AWS API Gateway, Kong API, and Kong Mesh (metrics, traces, health checks, and policy events)
Experience instrumenting ForgeRock flows and integrating identity telemetry
CI/CD proficiency with GitHub Actions—adding quality gates for telemetry, linting configs, and secrets management
Strong grasp of golden signals, SLI/SLO design, and practical alerting (avoiding alert fatigue)
Ability to work autonomously and drive cross‑functional change through clear documentation and backlog tasks
Solid understanding of distributed systems, HTTP, and cloud networking fundamentals

Preferred

Experience with Kubernetes, Prometheus, Grafana, AWS CloudWatch/X‑Ray, and log stacks (e.g., OpenTelemetry Collector, OTLP)
Knowledge of service mesh telemetry specifics (sidecar proxies, mTLS, traffic policies)
IaC tooling (e.g., Terraform) for observability resources and dashboards as code
Prior work with digital identity and ForgeRock admin/SDKs
Familiarity with Dynatrace (dashboards, Davis AI, topology modeling) or vendor‑neutral patterns to avoid lock‑in
Demonstrated ability to leverage AI tools to enhance productivity, streamline workflows, and support data-informed task execution
Familiarity with AI-enhanced platforms is a plus
A solid understanding of AI capabilities and limitations including ethical considerations is expected

Company

Perficient

company-logo
Perficient is an IT consulting firm that helps companies adapt their systems to third party software products that use internet.

H1B Sponsorship

Perficient has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (154)
2024 (147)
2023 (156)
2022 (196)
2021 (175)
2020 (223)

Funding

Current Stage
Public Company
Total Funding
$65.3M
2024-05-05Acquired
2024-01-17Post Ipo Equity· $4.44M
2022-10-17Post Ipo Equity· $5.18M

Leadership Team

leader-logo
Ed Hofffman
Senior Vice President, North America Business Operations
linkedin
leader-logo
Hari Madamalla
Vice President
linkedin
Company data provided by crunchbase