Nisum · 2 days ago
ML & GenAI Platform Engineer
Nisum is a company focused on advancing technology solutions, and they are seeking an ML & GenAI Platform Engineer. The role involves deploying and managing ML and Generative AI systems, developing enterprise-grade applications, and ensuring best practices in observability and governance.
Consulting
Responsibilities
Deploy, scale, and operate ML and Generative AI systems in cloud-based production environments (Azure preferred)
Build and manage enterprise-grade RAG applications using embeddings, vector search, and retrieval pipelines
Implement and operationalize agentic AI workflows with tool use using frameworks such as LangChain and LangGraph
Develop reusable infrastructure and orchestration for GenAI systems using Model Context Protocol (MCP) and AI Development Kit (ADK)
Design and implement model and agent serving architectures including APIs, batch inference, and real-time workflows
Establish best practices for observability, monitoring, evaluation, and governance of GenAI pipelines in production
Integrate AI solutions into business workflows with data engineering, application teams, and business stakeholders
Drive adoption of MLOps / LLMOps practices including CI/CD automation, versioning, testing, and lifecycle management
Ensure security, compliance, reliability, and cost optimization of AI services deployed at scale
Qualification
Required
Deploy, scale, and operate ML and Generative AI systems in cloud-based production environments (Azure preferred)
Build and manage enterprise-grade RAG applications using embeddings, vector search, and retrieval pipelines
Implement and operationalize agentic AI workflows with tool use using frameworks such as LangChain and LangGraph
Develop reusable infrastructure and orchestration for GenAI systems using Model Context Protocol (MCP) and AI Development Kit (ADK)
Design and implement model and agent serving architectures including APIs, batch inference, and real-time workflows
Establish best practices for observability, monitoring, evaluation, and governance of GenAI pipelines in production
Integrate AI solutions into business workflows with data engineering, application teams, and business stakeholders
Drive adoption of MLOps / LLMOps practices including CI/CD automation, versioning, testing, and lifecycle management
Ensure security, compliance, reliability, and cost optimization of AI services deployed at scale
8–10 years of experience in ML Engineering, AI Platform Engineering, or Cloud AI Deployment roles
Strong proficiency in Python with experience building production-grade AI/ML services
Proven experience deploying and supporting GenAI applications in real-world enterprise environments
Hands-on experience with RAG systems, embeddings, vector search, and retrieval pipelines
Experience with orchestration frameworks including LangChain, LangGraph, and LangSmith
Strong knowledge of model serving, inference pipelines, monitoring, and observability for AI systems
Experience working with cloud AI ecosystems (Azure AI, Azure ML, Databricks preferred)
Familiarity with containerization and deployment tools (Docker, Kubernetes, REST APIs)
Exposure to vector databases such as Pinecone, Weaviate, FAISS, or Azure Cognitive Search
Experience deploying agentic AI systems with tool integrations in production
Strong understanding of CI/CD pipelines and DevOps practices for AI platforms
Familiarity with enterprise governance frameworks for Responsible AI
Bachelor's degree in Computer Science, Engineering, Data Science, or related field (required)
Preferred
Master's degree is a plus
Company
Nisum
Nisum is a leading technology consulting partner based in Silicon Valley that designs and builds custom digital commerce platforms.
H1B Sponsorship
Nisum has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (107)
2024 (102)
2023 (162)
2022 (129)
2021 (112)
2020 (279)
Funding
Current Stage
Late StageRecent News
Company data provided by crunchbase