AI/ML & Analytics Platform Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

PERMEVO · 1 day ago

AI/ML & Analytics Platform Engineer

PERMEVO is a company focused on AI and Analytics, seeking an AI/ML & Analytics Platform Engineer to join their rapidly growing team. The role involves building and scaling a core developer platform to support AI/ML initiatives, ensuring high-performance and secure infrastructure for scientific discoveries.

Management Consulting
Hiring Manager
Vaibhav Nashikkar (VB)
linkedin

Responsibilities

Build and enhance the core AI/ML & Analytics platform, services, and tools across development, testing, and production environments
Develop platform capabilities to support scalable batch and real-time workflows, including low‑latency predictions and offline inference
Improve platform performance, automate operations, scale compute resources, and streamline deployment processes
Collaborate with foundational cloud teams to ensure platform reliability, security, efficiency, and operational excellence
Implement monitoring, observability, and governance frameworks, including registry systems, alerts, and compliance controls
Partner with cross-functional spoke teams to design AI/ML architecture, pipelines, and scalable deployment solutions
Champion self-service platform usage, IaC practices, and GitOps methodologies to enhance developer experience
Continuously refine platform usability, automation, and end‑user efficiency

Qualification

AI/ML platformsPythonAWSInfrastructure as CodeContainerizationMicroservices architectureCI/CD pipelinesVersion controlPerformance optimizationCommunication

Required

Bachelor's or Master's degree in Computer Science, Engineering, Data Science, Mathematics, Statistics, Operations Research, or related field; 5+ years of relevant experience
Proven experience building AI/ML & Analytics platforms for ML Researchers, Engineers, Data Scientists, or Analysts
Strong programming experience in Python, Spark, SQL, or similar languages; familiarity with ML frameworks such as PyTorch or TensorFlow
Experience designing scalable self-service systems using microservices and/or event-driven architectures
Hands-on experience with AWS, particularly AI/ML-related services (e.g., SageMaker)
Proficiency with Infrastructure as Code (Terraform, OpenTofu, CDK, Pulumi, etc.) and CI/CD pipelines
Working knowledge of version control (GitHub/GitLab), CI/CD tools (Actions, Jenkins), and workflow systems (JIRA)
Experience with containerization (Docker, Podman) and orchestration platforms (Kubernetes, Rancher)
Exposure to large-scale CPU/GPU/multi‑GPU environments (CUDA fundamentals a plus)
Understanding of operational capabilities, including observability, monitoring, tracking, and registries
Demonstrated ability to analyze performance, optimize systems, and manage cloud-scale cost efficiency
Strong communication skills with the ability to collaborate effectively across teams

Preferred

Experience in the pharmaceutical or biotech industry
Proficiency in strongly typed languages such as C/C++, Java, Go, or Rust
Hands-on experience with distributed systems (Ray, Dask, Spark) or high-performance computing (Slurm)
Familiarity with data platforms such as Databricks, Snowflake, or AWS Lake Formation, and technologies like Delta, Iceberg, Hudi
Experience building or operating real-time/streaming systems (Kafka, Spark Streaming)
Knowledge of GitOps-based tools such as ArgoCD, Crossplane
Experience working in multi-cloud environments (AWS, GCP, Azure)
Familiarity with high-performance inference/training frameworks such as ONNX Runtime, TensorRT, or Triton

Company

PERMEVO

twitter
company-logo
PERMEVO is a global talent solutions company that provides executive search, contract subject matter experts, and direct hire services to life, health, and data science-related sectors.

Funding

Current Stage
Early Stage
Company data provided by crunchbase