Trend Micro · 14 hours ago
Applied AI Architect - Austin, TX
Trend Micro, a global cybersecurity leader, helps make the world safe for exchanging digital information across enterprises, governments, and consumers. They are seeking an Applied AI Architect to lead the technical direction for model architecture selection, fine-tuning, and optimization, translating research into scalable solutions for cybersecurity.
Cloud SecurityCyber SecuritySecurityVirtualization
Responsibilities
Drive research-to-production of LLM/SLM systems — from design and fine-tuning to evaluation, deployment, and continual adaptation in enterprise agent workflows
Lead technical choices — determine when to apply context engineering, prompt tuning, continued pretraining, supervised fine-tuning, reasoning fine-tuning, LoRA, or RL
Architect high-performance inference and serving using vLLM, NVIDIA NIM, Triton, CUDA, or other optimized frameworks
Integrate reinforcement learning frameworks (veRL, SkyRL, PyTorch, Ray RLlib) to enhance reasoning, adaptability, and agent feedback loops
Develop and operationalize AI Ops pipelines — build benchmarks and metrics for model evaluation, observability, drift detection, and lifecycle automation
Advance agent interoperability using A2A (Agent-to-Agent) or MCP (Model Context Protocol) for large-scale coordination
Collaborate with cybersecurity researchers to embed threat reasoning, anomaly detection, and defensive logic directly into model behavior
Publish, document, and codify reusable AI blueprints for hybrid (cloud + on-prem) deployments and future research acceleration
Qualification
Required
Proven end-to-end experience bringing LLM/SLM research into production — from fine-tuning and inference optimization to evaluation and AI Ops integration
Excellent knowledge of at least one of the following: Deep understanding of data-model-infrastructure trade-offs and optimization under real business constraints
Hands-on experience fine-tuning LLMs using frameworks such as LLaMA Factory, NeMo, and PEFT (e.g., LoRA)
Strong knowledge of GPU-accelerated inference (ex: vLLM, NIM, Triton, CUDA, NCCL, PyTorch/XLA)
Familiarity with AI Ops toolchains (ex: Weights & Biases, MLflow, Ray Serve)
Proficiency in Python, and experience building containerized AI microservices (ex: Docker, Kubernetes, Ray)
3+ years of applied AI/ML research or engineering, including 2+ years in production-scale deployment
Preferred
Demonstrated success in building scalable infrastructure and launching LLM/SLM-based features and agent systems within enterprise platforms
Expertise in quantization, distillation, or GPU profiling to lower inference cost
Clear conceptual understanding of when to fine-tune vs prompt-engineer vs use RLHF — and evidence of having applied each effectively
Familiarity with agentic frameworks (LangChain, AWS Strands, AutoGen, etc)
Deep understanding of A2A/MCP protocols for interoperable multi-agent systems
Benefits
Comprehensive medical, dental and vision insurance
Life insurance
Short & Long Term Disability
Pre-partum, maternity, parental and medical leave
Mental Health Wellness Program
Adoption Assistance
Wellness Incentive
Pet Insurance
401(k) with company match
Paid Time Off
14 Annual Holidays
Tuition Assistance
Employee Resource Groups
Company
Trend Micro
Trend Micro is an IT firm that offers cybersecurity solutions like cloud security, endpoint protection, and network threat detection.
Funding
Current Stage
Public CompanyTotal Funding
unknown2000-08-17IPO
Recent News
2026-01-13
2025-11-24
2025-11-24
Company data provided by crunchbase