NVIDIA · 5 days ago
Senior Architect, AI Solutions Engineering
NVIDIA is seeking an AI Solutions Architect to join its Infrastructure Planning and Process Team, focusing on scaling key AI solutions for NVIDIA's internal cloud infrastructure. The role involves managing tools for delivering solutions, identifying gaps, optimizing performance, and collaborating with various teams to solve complex problems.
Artificial Intelligence (AI)Consumer ElectronicsGPUHardwareSoftwareVirtual Reality
Responsibilities
Serve as an Architect developing internal AI systems used by thousands of NVIDIANs globally
Identify gaps and issues and resolve ones are better suited for AI solutions versus conventional approaches
Further divide the AI category into 'buy versus build' options by researching available tools in the market
Align with teams across Nvidia to establish overall AI system goals and break them down into specific objectives for each sub-system
Drive, motivate, convince, and mentor sub-system leads to achieve improvements with agility and speed
Identify performance bottlenecks and optimize the speed and cost efficiency of AI development and testing systems
Drive the planning of software/hardware capacity, covering both internal and public cloud, addressing the balance between time and utilization
Introduce technologies enabling massively parallel systems to improve turnaround time by an order of magnitude
Collaborate with AI product vendors to gain deep insights of the AI industry, and share them with leaders and developers internally
Qualification
Required
BS EE/CS or equivalent experience with 10+ years of systems software development with at least 1 year of experience in developing/exploring AI
Development with Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), Fine-Tuning LLMs, AI Agentic workflows, LangChain, LangGraphs, and Cascading models
Experience in deploying in hybrid, multi-cloud architecture and edge computing
Extensive experience architecting and shipping large-scale distributed software systems
Ability to identify gaps and bottlenecks, and develop solutions to optimize performance
Strong programming and software development skills in JAVA, Python, Shell-script along with good understanding of distributed systems and REST APIs
Experience in working with SQL/NoSQL database systems such as MySQL, Cassandra, MongoDB or Elasticsearch
Excellent knowledge and working experience with Docker containers and Virtual Machines
Good background of Cloud technologies like: OpenStack, Docker, Kubernetes, Chef/Puppet, Hadoop/Ceph/SwiftStack, LXC, Git, Perforce, JFrog, Kafka
Ability to work across organizational boundaries optimally to improve alignment and productivity between teams in a multi-national, multi-time-zone corporate environment
Preferred
MS or PhD in EE/CS
Depth in AI, Machine Learning and Deep Learning algorithms and techniques
Strong collaborative and interpersonal skills, with a consistent record of guiding and influencing others in dynamic environments
Experience developing large-scale software systems using service-oriented architecture under real-time performance requirements
Background in designing high-performance, scalable software systems with a strong focus on hardware cost optimization
Benefits
Equity
Benefits
Company
NVIDIA
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
H1B Sponsorship
NVIDIA has a track record of offering H1B sponsorships. Please note that this does not
guarantee sponsorship for this specific role. Below presents additional info for your
reference. (Data Powered by US Department of Labor)
Distribution of Different Job Fields Receiving Sponsorship
Represents job field similar to this job
Trends of Total Sponsorships
2025 (1877)
2024 (1355)
2023 (976)
2022 (835)
2021 (601)
2020 (529)
Funding
Current Stage
Public CompanyTotal Funding
$4.09BKey Investors
ARPA-EARK Investment ManagementSoftBank Vision Fund
2023-05-09Grant· $5M
2022-08-09Post Ipo Equity· $65M
2021-02-18Post Ipo Equity
Recent News
2026-01-08
Company data provided by crunchbase