Senior LLMOps Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

New York Global Consultants Inc. (NYGCI) · 12 hours ago

Senior LLMOps Engineer

New York Global Consultants Inc. (NYGCI) is seeking a Senior Consultant specializing in AI/ML platforms. The role involves deploying, managing, and troubleshooting containerized services at scale, as well as managing MLOps/LLMOps pipelines for production environments.

ConsultingInformation ServicesInformation TechnologyInfrastructureIT InfrastructureOutsourcing
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Experience deploying, managing, operating, and troubleshooting containerized services at scale on Kubernetes for mission-critical applications (OpenShift)
Experience with deploying, configuring, and tuning LLMs using TensorRT-LLM and Triton Inference server
Managing MLOps/LLMOps pipelines, using TensorRT-LLM and Triton Inference server to deploy inference services in production
Setup and operation of AI inference service monitoring for performance and availability
Experience deploying and troubleshooting LLM models on a containerized platform, monitoring, load balancing, etc
Operation and support of MLOps/LLMOps pipelines, using TensorRT-LLM and Triton Inference server to deploy inference services in production
Experience deploying and troubleshooting LLM models on a containerized platform, monitoring, load balancing, etc
Experience with standard processes for operation of a mission critical system – incident management, change management, event management, etc
Managing scalable infrastructure for deploying and managing LLMs
Deploying models in production environments, including containerization, microservices, and API design
Triton Inference Server, including its architecture, configuration, and deployment
Model Optimization techniques using Triton with TRTLLM
Model optimization techniques, including pruning, quantization, and knowledge distillation

Qualification

KubernetesTensorRT-LLMTriton Inference ServerMLOps/LLMOps pipelinesModel Optimization techniquesContainerizationMicroservicesAPI designIncident managementChange managementEvent management

Required

Experience deploying, managing, operating, and troubleshooting containerized services at scale on Kubernetes for mission-critical applications (OpenShift)
Experience with deploying, configuring, and tuning LLMs using TensorRT-LLM and Triton Inference server
Managing MLOps/LLMOps pipelines, using TensorRT-LLM and Triton Inference server to deploy inference services in production
Setup and operation of AI inference service monitoring for performance and availability
Experience deploying and troubleshooting LLM models on a containerized platform, monitoring, load balancing, etc
Operation and support of MLOps/LLMOps pipelines, using TensorRT-LLM and Triton Inference server to deploy inference services in production
Experience deploying and troubleshooting LLM models on a containerized platform, monitoring, load balancing, etc
Experience with standard processes for operation of a mission critical system – incident management, change management, event management, etc
Managing scalable infrastructure for deploying and managing LLMs
Deploying models in production environments, including containerization, microservices, and API design
Triton Inference Server, including its architecture, configuration, and deployment
Model Optimization techniques using Triton with TRTLLM
Model optimization techniques, including pruning, quantization, and knowledge distillation

Company

New York Global Consultants Inc. (NYGCI)

twittertwittertwitter
company-logo
New York Global Consultants Inc. (NYGCI) is an innovative technology services provider.

H1B Sponsorship

New York Global Consultants Inc. (NYGCI) has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (2)
2023 (6)
2022 (6)
2021 (10)
2020 (32)

Funding

Current Stage
Growth Stage

Leadership Team

leader-logo
Mukesh Molugu
CEO
linkedin
Company data provided by crunchbase