AI Architect jobs in United States
info-icon
This job has closed.
company-logo

Penguin Ai · 2 days ago

AI Architect

Penguin Ai is an innovative company focused on transforming healthcare through advanced AI solutions. They are seeking an AI Architect to lead the design, development, and deployment of large-scale AI systems that bridge business requirements with technical implementation.

Artificial Intelligence (AI)Health CareMedical

Responsibilities

Design and build groundbreaking AI systems from the ground up, harnessing the power of large language models and generative AI technologies
Craft robust, production-ready AI applications that not only meet, but exceed, business objectives and performance demands
Evaluate, select, and seamlessly integrate LLM APIs from leading providers (OpenAI, Anthropic Claude, Google Gemini, and more) into our ecosystem
Establish and champion best practices for prompt engineering, model selection, and overall AI system optimization, making our solutions shine
Custom-tailor open-source models (like Llama, Mistral) to perfection for specific, high-impact business use cases
Design and implement custom training pipelines and rigorous evaluation frameworks to ensure model excellence
Fine-tune model performance, latency, and cost, ensuring our AI runs like a dream in production environments
Stay relentlessly current with the latest model architectures and cutting-edge fine-tuning techniques
Deploy and manage AI models at enterprise scale, mastering containerization (Docker) and orchestration (Kubernetes)
Build robust, scalable APIs using FastAPI and other modern frameworks to power our AI solutions
Design and implement end-to-end MLOps pipelines for seamless model versioning, continuous monitoring, and automated deployment
Ensure the high availability, iron-clad security, and peak performance of all AI systems in production
Partner with stakeholders to dissect business problems and elegantly translate them into precise, actionable technical requirements
Inspire and elevate our development teams by providing expert technical guidance and mentorship
Conduct thorough feasibility assessments and technical due diligence for new AI initiatives, charting the path forward
Craft clear, comprehensive technical documentation, architectural diagrams, and implementation roadmaps that illuminate the way

Qualification

Generative AIMLOpsLarge Language ModelsPythonFastAPIDockerKubernetesML FrameworksCloud PlatformsBusiness TranslationTechnical DocumentationProblem-SolvingTechnical Mentorship

Required

5+ years of battle-tested experience in machine learning engineering or data science
1+ years of hands-on experience building, deploying, and managing generative AI models in production
A proven track record of successfully delivering large-scale ML solutions
Expert-level command of LLM APIs from major providers (OpenAI, Claude, Gemini, etc.)
Hands-on experience fine-tuning powerful transformer models (Llama, Mistral, etc.)
Strong proficiency in FastAPI, Docker, and Kubernetes
Experience with key ML frameworks (PyTorch, TensorFlow, Hugging Face Transformers)
Deep proficiency in Python and modern software development practices
Experience with leading cloud platforms (AWS, GCP, or Azure) and their AI/ML services
Strong understanding of transformer architectures, attention mechanisms, and modern NLP techniques
Deep experience with MLOps tools and practices (model versioning, monitoring, CI/CD)
Exceptional ability to translate complex business requirements into elegant, actionable technical solutions
Strong problem-solving skills coupled with an architectural mindset
An advanced degree in Computer Science, Engineering, Data Science, or a related field

Preferred

Have wielded vector databases and RAG (Retrieval-Augmented Generation) systems
Possess knowledge of distributed training and model parallelization techniques
Have experience with model quantization and optimization for edge deployment
Are familiar with AI safety, alignment, and responsible AI practices
Bring experience in specific domains like finance, healthcare, or legal
You've not just survived but thrived in the beautiful chaos of a high-growth startup

Benefits

Medical, vision, and dental coverage: Keep you healthy and smiling
Generous vacation policy and company holidays: Recharge and conquer!

Company

Penguin Ai

twittertwitter
company-logo
Penguin AI uses AI to analyze health records and support care decisions, improving patient outcomes.

Funding

Current Stage
Early Stage
Total Funding
$25M
Key Investors
Greycroft
2025-09-11Series A· $25M
2024-01-04Convertible Note
Company data provided by crunchbase