Full stack Machine Learning Engineer jobs in United States
cer-icon
Apply on Employer Site
company-logo

BrickRed Systems · 12 hours ago

Full stack Machine Learning Engineer

BrickRed Systems is a global leader in next-generation technology and consulting services. They are seeking a Full stack Machine Learning Engineer to build and scale production-grade AI systems, owning the end-to-end ML and GenAI pipelines while ensuring performance, reliability, and cost-efficiency.

ConsultingInformation ServicesInformation Technology
check
Growth Opportunities
check
H1B Sponsor Likelynote

Responsibilities

Own the full lifecycle of ML & GenAI features in production—from problem framing to monitoring and optimization
Build and scale low-latency, high-throughput inference services for OCR, Speech AI, Search, and LLM applications
Design and evolve the company’s central ML platform & MLOps foundations
Lead LLM fine-tuning, RAG workflows, and agentic AI systems for product use-cases
Drive cloud cost optimization, scalability, and reliability for AI workloads
Partner with Product, Backend, Infra, and Data teams to ship customer-facing AI features
Define engineering standards, reviews, and mentorship for the ML organization
Develop and deploy deep learning, NLP, and GenAI models
Fine-tune LLMs using SFT, QLoRA, FSDP
Implement RAG, federated search, and multi-agent AI systems
Build real-time ML services using NVIDIA Triton, TensorRT, and ONNX
Achieve sub-second inference latency and high RPS throughput
Design event-driven, microservice-based inference pipelines
Architect CI/CD pipelines for ML, automated validation, and model governance
Build and operate feature stores, training pipelines, and monitoring systems
Enable fast experimentation → safe production rollout
Deploy and scale on AWS & Azure using Kubernetes, GPU compute & autoscaling
Own cost-performance tradeoffs for large-scale inference and training

Qualification

PythonPyTorchTransformersNVIDIA TritonMLOpsAWSKubernetesDeep LearningGenAICI/CDSoft Skills

Required

Python
PyTorch
Transformers
HuggingFace
LLMs & GenAI: RAG, LangChain, LangGraph, CrewAI, OpenAI/Claude/LLaMA
Inference: NVIDIA Triton, TensorRT, ONNX, CUDA
MLOps: MLflow, Airflow, Databricks, Kedro, CI/CD
Infra: Kubernetes, Docker, KEDA, Helm
Data & Vector DBs: FAISS, Milvus, ChromaDB, Postgres, MongoDB
Cloud: AWS (S3, SageMaker, Bedrock), Azure
Master's or Bachelor's in Machine Learning, AI, Computer Science, or ECE

Preferred

Built real-time OCR, Speech-to-Text, and Search systems at scale
Designed ML platforms used across multiple teams/products
Proven cost savings in cloud spend and revenue-driving AI deployments
Experience building healthcare or enterprise-grade AI products
Patent, open-source contributions, or competitive ML achievements

Company

BrickRed Systems

twittertwittertwitter
company-logo
BrickRed Systems is an IT Consulting firm specializes in business intelligence, technology consulting, security consulting & more.

H1B Sponsorship

BrickRed Systems has a track record of offering H1B sponsorships. Please note that this does not guarantee sponsorship for this specific role. Below presents additional info for your reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (1)
2022 (1)
2020 (1)

Funding

Current Stage
Late Stage

Leadership Team

leader-logo
Abhay Singh
Director
linkedin
Company data provided by crunchbase