BrickRed Systems · 12 hours ago
Full stack Machine Learning Engineer
BrickRed Systems is a global leader in next-generation technology and consulting services. They are seeking a Full stack Machine Learning Engineer to build and scale production-grade AI systems, owning the end-to-end ML and GenAI pipelines while ensuring performance, reliability, and cost-efficiency.
ConsultingInformation ServicesInformation Technology
Responsibilities
Own the full lifecycle of ML & GenAI features in production—from problem framing to monitoring and optimization
Build and scale low-latency, high-throughput inference services for OCR, Speech AI, Search, and LLM applications
Design and evolve the company’s central ML platform & MLOps foundations
Lead LLM fine-tuning, RAG workflows, and agentic AI systems for product use-cases
Drive cloud cost optimization, scalability, and reliability for AI workloads
Partner with Product, Backend, Infra, and Data teams to ship customer-facing AI features
Define engineering standards, reviews, and mentorship for the ML organization
Develop and deploy deep learning, NLP, and GenAI models
Fine-tune LLMs using SFT, QLoRA, FSDP
Implement RAG, federated search, and multi-agent AI systems
Build real-time ML services using NVIDIA Triton, TensorRT, and ONNX
Achieve sub-second inference latency and high RPS throughput
Design event-driven, microservice-based inference pipelines
Architect CI/CD pipelines for ML, automated validation, and model governance
Build and operate feature stores, training pipelines, and monitoring systems
Enable fast experimentation → safe production rollout
Deploy and scale on AWS & Azure using Kubernetes, GPU compute & autoscaling
Own cost-performance tradeoffs for large-scale inference and training
Qualification
Required
Python
PyTorch
Transformers
HuggingFace
LLMs & GenAI: RAG, LangChain, LangGraph, CrewAI, OpenAI/Claude/LLaMA
Inference: NVIDIA Triton, TensorRT, ONNX, CUDA
MLOps: MLflow, Airflow, Databricks, Kedro, CI/CD
Infra: Kubernetes, Docker, KEDA, Helm
Data & Vector DBs: FAISS, Milvus, ChromaDB, Postgres, MongoDB
Cloud: AWS (S3, SageMaker, Bedrock), Azure
Master's or Bachelor's in Machine Learning, AI, Computer Science, or ECE
Preferred
Built real-time OCR, Speech-to-Text, and Search systems at scale
Designed ML platforms used across multiple teams/products
Proven cost savings in cloud spend and revenue-driving AI deployments
Experience building healthcare or enterprise-grade AI products
Patent, open-source contributions, or competitive ML achievements
Company
BrickRed Systems
BrickRed Systems is an IT Consulting firm specializes in business intelligence, technology consulting, security consulting & more.
H1B Sponsorship
BrickRed Systems has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2023 (1)
2022 (1)
2020 (1)
Funding
Current Stage
Late StageCompany data provided by crunchbase