SRE Cloud Platform Architect jobs in United States
cer-icon
Apply on Employer Site
company-logo

Signature IT World Inc · 12 hours ago

SRE Cloud Platform Architect

Signature IT World Inc is seeking a highly skilled SRE Cloud Platform Architect to design, automate, and operate scalable cloud infrastructure across AWS and Azure. The role focuses on reliability, performance, and cost efficiency, building secure delivery pipelines and robust observability for mission-critical services.

Information Technology & Services
check
H1B Sponsor Likelynote
Hiring Manager
Priya Chauhan
linkedin

Responsibilities

Architect, deploy, and manage high‑availability, scalable, and secure cloud infrastructure across AWS, Azure, and hybrid environments
Implement infrastructure-as-code (IaC) using Terraform, Ansible, and similar tools to ensure consistent, version-controlled, and fully automated environment provisioning
Design and manage Kubernetes/OpenShift clusters, including node management, autoscaling, ingress/routing, RBAC, quotas, and security policies
Optimize cloud resources through right‑sizing, workload tuning, and cost‑governance practices
Build, enhance, and maintain CI/CD pipelines using Jenkins, GitLab CI/CD, GitHub Actions, or similar tools to support automated build, test, and deployment workflows
Implement blue‑green, rolling, and canary deployment strategies to ensure zero‑downtime releases for mission‑critical applications
Integrate automated testing frameworks, code-quality gates, and security scans into the pipeline to ensure compliance and reliability
Containerize applications using Docker and deploy them via Kubernetes/Helm/OpenShift for scalable, resilient microservice environments
Improve service reliability through resource tuning, autoscaling (HPA/VPA), service mesh patterns, and optimized workload distribution
Deploy and maintain observability pipelines using Prometheus, Grafana, Splunk, Datadog, ELK/EFK, or similar tools to provide deep visibility into system health
Build dashboards and alerts for proactive issue detection, significantly reducing MTTR through automation and intelligent triage
Conduct root‑cause analysis, capacity planning, and performance optimization across distributed systems
Implement cloud security practices including IAM/RBAC, key management, secrets rotation, network policies, and encryption in transit/at rest
Automate compliance checks for SOC2, HIPAA, PCI‑DSS, or internal governance frameworks through policy-as-code and CI/CD integration
Ensure secure container images, infrastructure baselines, audit trails, and vulnerability scanning across environments
Design and optimize VPC/VNet architectures, load balancers, DNS, ingress/egress routing, firewall rules, and hybrid connectivity
Implement resilient traffic strategies such as multi‑region failover, geo‑redundancy, and fault‑tolerant service routing
Develop automation scripts with Python, Shell, Groovy, or PowerShell to eliminate manual tasks, reduce operational toil, and speed up deployments
Build internal tools, templates, and reusable modules to standardize and accelerate infrastructure provisioning
Collaborate closely with development, QA, and architecture teams to streamline release workflows and improve platform reliability
Implement and maintain SLOs, SLIs, and SLAs, ensuring service reliability and performance targets are consistently met
Drive continuous improvement through chaos engineering, disaster recovery planning, resilience testing, and failover simulations
Conduct periodic system reviews, implement performance enhancements, and proactively mitigate production risks

Qualification

KubernetesTerraformAWSAzureCI/CDOpenshiftJenkinsDockerPrometheusGrafanaPythonShellscriptingGithubGit labGCPPrompt Engineering

Required

Kubernetes
Openshift
Terraform
AWS
Azure
CI/CD
Jenkins
Github
Git lab
Docker
Prometheus
Grafana
Python
Shellscripting
8–10+ years of hands‑on experience in DevOps, Site Reliability Engineering, Cloud Engineering, or Platform Engineering roles
Strong expertise with Kubernetes or OpenShift platforms, including cluster operations, workload orchestration, security policies, ingress, autoscaling, and production-grade deployments
Proven experience with Infrastructure as Code (IaC) using tools such as Terraform, Ansible, and related automation/configuration technologies
Hands-on proficiency building and maintaining CI/CD pipelines using Jenkins, GitLab CI/CD, GitHub Actions, or similar enterprise pipelines
Strong experience with AWS and/or Azure cloud services, including networking, compute, IAM/RBAC, load balancing, storage, and secrets management
Demonstrated background in containerization using Docker and Kubernetes with knowledge of Helm, YAML, and modern deployment strategies
Solid scripting ability in Python, Shell, Groovy, or PowerShell for automation, tooling, and workflow optimization
Deep understanding of monitoring, logging, and observability using Prometheus, Grafana, Splunk, Datadog, ELK/EFK, or similar stacks
Strong foundation in networking concepts—TCP/IP, DNS, SSL, HTTP, routing, and firewall policies
Experience implementing high availability, resilience, disaster recovery, and failover strategies in distributed systems
Knowledge of cloud security, compliance frameworks, vulnerability scanning, and policy enforcement

Preferred

GCP
Prompt Engineering

Company

Signature IT World Inc

twitter
company-logo
Our Mission is to provide our customers with services and solutions to optimize their information systems throughout their company and provide the best possible service to our employees.

H1B Sponsorship

Signature IT World Inc has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (11)
2024 (5)
2023 (5)
2022 (21)
2021 (14)

Funding

Current Stage
Growth Stage
Company data provided by crunchbase