Apply on Employer Site

Harrison Clarke · 23 hours ago

Cloud Engineer

San Francisco Bay Area

Full-time

Onsite

Senior Level

$250K/yr - $350K/yr

Harrison Clarke is seeking a Senior Infrastructure Engineer to design and scale the core systems behind a next-generation AI platform. This hands-on role involves creating the infrastructure layer for large-scale AI workloads while collaborating with applied ML teams to ensure reliable model serving.

ConsultingDevOpsHuman ResourcesInformation TechnologyStaffing Agency

Growth Opportunities

Hiring Manager

Tom Sebaduka

Responsibilities

Architecting and scaling infrastructure for low-latency, high-throughput AI inference

Managing GPU resources and multi-tenant workloads using Kubernetes and cloud-native tooling

Designing and operating core infrastructure components including infrastructure-as-code, container orchestration, monitoring, logging, and networking

Building platform-level capabilities such as authentication, rate limiting, telemetry, alerting, and system health monitoring

Owning infrastructure tradeoffs across performance, availability, and cost as usage scales

Working closely with machine learning engineers to productionize and optimize model serving pipelines

Establishing best practices and patterns for operating AI systems at scale in a fast-moving startup environment

Qualification

KubernetesInfrastructure-as-codeAI model servingCloud securityDistributed systemsPerformance tuningConcurrencyCachingCost optimizationCI/CD workflowsObservability stacksService meshGlobal traffic routingHigh-availability architectures

Required

Strong background in infrastructure, platform, or systems engineering

Deep experience operating Kubernetes-based systems at scale, including GPU scheduling and workload orchestration

Familiarity with service mesh, global traffic routing, and high-availability architectures

Fluency in infrastructure-as-code, CI/CD workflows, and modern observability stacks

Solid systems fundamentals: distributed systems, performance tuning, concurrency, caching, and cost optimization

Hands-on experience with model inference and serving technologies (e.g., Triton, ONNX Runtime, vLLM, TensorRT, or similar)

Good understanding of cloud security and data management in production environments

Comfort working in early-stage settings with high ownership, ambiguity, and rapid iteration

Company

Harrison Clarke

Harrison Clarke is the Leading Staffing & Recruiting Firm in XOps & Cybersecurity.

Founded in 2016

New York, New York, USA

11-50 employees

https://www.harrisonclarke.com/

Funding

Current Stage

Early Stage

Leadership Team

Firas Sozan

Founder & CEO

Company data provided by crunchbase