Red Teaming Domain Expert - AI Training (Contract) jobs in United States
cer-icon
Apply on Employer Site
company-logo

ChatGPT Jobs ยท 1 week ago

Red Teaming Domain Expert - AI Training (Contract)

Handshake is building the career network for the AI economy, and they are seeking a Red Teaming Domain Expert to stress-test AI models by designing creative, adversarial prompts that expose vulnerabilities. This role supports AI safety and model robustness for leading research labs and requires creativity, curiosity, and strong ethical judgment.

Computer Software
badNo H1Bnote

Responsibilities

Crafting creative prompts and scenarios to intentionally stress-test AI guardrails
Discovering ways around safety filters, restrictions, and defenses
Exploring edge cases to provoke disallowed, harmful, or incorrect outputs
Documenting experiments clearly, including what you tried and why
Reviewing and refining adversarial prompts generated by Fellows
Collaborating with engineers, tutors, and researchers to share findings and strengthen defenses
Working with potentially disturbing content, including violence, explicit topics, and hate speech
Staying current on jailbreaks, attack methods, and evolving model behaviors

Qualification

LLM experienceCreative problem-solvingWritten communicationAdversarial thinkingJailbreak techniquesDocumentation skillsEthical judgmentCuriosityTolerance for graphic contentCollaboration

Required

Creativity, curiosity, and an ability to think like an adversary while operating with strong ethical judgment
Strong hands-on experience using multiple LLMs
Intuition for crafting prompts; familiarity with jailbreak or evasion techniques is a plus
Creative, adversarial problem-solving skills
Clear and thoughtful written communication
Ability to tolerate emotionally heavy or graphic content
Curiosity, persistence, and comfort with frequent failure in experimentation
Strong ethical judgment and ability to separate adversarial thinking from personal values
Self-directed, collaborative, and comfortable in feedback-heavy environments
You go deep into unusual interests (fandoms, niche internet cultures, gaming exploits, Wikipedia rabbit holes, etc.)
You come from a creative background, writers, visual artists, etc
You are obsessed with AI and can't stop talking about it

Preferred

Prior red teaming, moderation, or adversarial testing experience
Background in writing, gaming, improv, or niche internet subcultures
Experience documenting complex processes or research
Familiarity with safety, trust & safety, or digital security concepts

Company

ChatGPT Jobs

twitter
company-logo
We find the best job offers for experts in ChatGPT and related technologies.

Funding

Current Stage
Early Stage
Company data provided by crunchbase