SIGN IN
Generative AI Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

Apexon · 14 hours ago

Generative AI Engineer

Apexon is a digital-first technology services firm specializing in business transformation and delivering human-centric digital experiences. The Generative AI Engineer will be responsible for launching and implementing GenAI agentic solutions to manage large-scale production environments, addressing runtime challenges by developing AI solutions that improve productivity and support.
Information Technology & Services
check
H1B Sponsor Likelynote
Hiring Manager
Mischelle Sharon Martis
linkedin

Responsibilities

Build agentic AI systems: Design and implement tool-calling agents that combine retrieval, structured reasoning, and secure action execution (function calling, change orchestration, policy enforcement) following MCP protocol. Engineer robust guardrails for safety, compliance, and least-privilege access
Productionize LLMs: Build evaluation framework for open-source and foundational LLMs; implement retrieval pipelines, prompt synthesis, response validation, and self-correction loops tailored to production operations
Integrate with runtime ecosystems: Connect agents to observability, incident management, and deployment systems to enable automated diagnostics, runbook execution, remediation, and post-incident summarization with full traceability
Collaborate directly with users: Partner with production engineers, and application teams to translate production pain points into agentic AI roadmaps; define objective functions linked to reliability, risk reduction, and cost; and deliver auditable, business-aligned outcomes
Safety, reliability, and governance: Build validator models, adversarial prompts, and policy checks into the stack; enforce deterministic fallbacks, circuit breakers, and rollback strategies; instrument continuous evaluations for usefulness, correctness, and risk
Scale and performance: Optimize cost and latency via prompt engineering, context management, caching, model routing, and distillation; leverage batching, streaming, and parallel tool-calls to meet stringent SLOs under real-world load
Build a RAG pipeline: Curate domain-knowledge; build data-quality validation framework; establish feedback loops and milestone framework maintain knowledge freshness
Raise the bar: Drive design reviews, experiment rigor, and high-quality engineering practices; mentor peers on agent architectures, evaluation methodologies, and safe deployment patterns

Qualification

PythonLarge Language ModelsMachine Learning SystemsCloud InfrastructureData Processing PipelinesAnalytical Problem-SolvingCollaboration Skills

Required

5+ years of software development in one or more languages (Python, C/C++, Go, Java); strong hands-on experience building and maintaining large-scale Python applications preferred
3+ years designing, architecting, testing, and launching production ML systems, including model deployment/serving, evaluation and monitoring, data processing pipelines, and model fine-tuning workflows
Practical experience with Large Language Models (LLMs): API integration, prompt engineering, fine-tuning/adaptation, and building applications using RAG and tool-using agents (vector retrieval, function calling, secure tool execution)
Understanding of different LLMs, both commercial and open source, and their capabilities (e.g., OpenAI, Gemini, Llama, Qwen, Claude)
Solid grasp of applied statistics, core ML concepts, algorithms, and data structures to deliver efficient and reliable solutions
Strong analytical problem-solving, ownership, and urgency; ability to communicate complex ideas simply and collaborate effectively across global teams with a focus on measurable business impact

Preferred

Proficiency building and operating on cloud infrastructure (ideally AWS), including containerized services (ECS/EKS), serverless (Lambda), data services (S3, DynamoDB, Redshift), orchestration (Step Functions), model serving (SageMaker), and infra-as-code (Terraform/CloudFormation)

Benefits

Health Insurance with Dental & Vision
401K Plan
Life Insurance, STD & LTD
Paid Vacations & Holidays
Paid Parental Leave
FSA Dependent & Limited Purpose care

Company

Apexon is a digital-first technology services firm, accelerating business transformation and delivering human-centric digital experiences.

H1B Sponsorship

Apexon has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (117)
2024 (73)
2023 (83)
2022 (106)
2021 (99)
2020 (135)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Radha Krishnan
Founder and Board member
linkedin
leader-logo
Shalin Shah
Chief Business Development Officer
linkedin
Company data provided by crunchbase