AI Red Teamer

Washington2mo ago

About the role

<div class="content-intro"><p><strong>About 10a Labs: </strong>10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. Our adversarial red teaming, model evaluations, and intelligence collection enable engineering, safety, and security teams to stay ahead of evolving threats and deploy AI systems safely.</p></div><h3><strong>In This Role, You Will</strong></h3> <ul> <li>Develop and run adversarial test suites—both manual and scripted—for LLMs and image / video models.</li> <li>Craft multilingual prompts, jailbreaks, and escalation chains targeting policy edge cases.<br>Analyze outputs, triage failures, and write concise vulnerability reports.</li> <li>Contribute to internal tooling (e.g., prompt libraries, scenario generators, dashboards).</li> </ul> <h3><strong>We’re Looking for Someone Who</strong></h3> <ul> <li>Has 2-4 years of experience in red-teaming, security research, trust & safety, or related fields.</li> <li>Is comfortable scripting basic tests (Python, Bash, or similar) and working in Jupyter or prompt-engineering tools.</li> <li>Communicates clearly in English and at least one additional language (ideally major non-English language relevant to global threat landscapes).</li> <li>Thinks like an adversary, documents findings crisply, and iterates quickly.</li> </ul> <h3><strong>Requirements</strong></h3> <ul> <li>Bachelor’s degree—or equivalent experience—in CS, data science, linguistics, international studies, or security.</li> <li>Basic proficiency with Python and command-line tools.</li> <li>Demonstrated interest in AI safety, adversarial ML, or abuse detection.</li> <li>Strong writing skills for short vulnerability reports and long-form analyses.</li> <li>Ability to rapidly context switch across domains, modalities, and abuse areas.</li> <li>Excited to work in a fast-paced and ambiguous space.</li> </ul> <h3><strong>Nice to Have</strong></h3> <ul> <li>Full professional proficiency in Arabic, Chinese, Farsi, Portuguese, Russian, or Spanish, as well as English.</li> <li>Prior work in content moderation, disinformation analysis, or cyber-threat intelligence.</li> <li>Experience with prompt-automation frameworks (e.g., Promptfoo, LangChain, Garak).<br>Familiarity with vector search or LLM fine-tuning workflows.</li> <li>Formal training or certification in red-teaming or penetration testing.</li> </ul> <h3><strong>Compensation & Benefits</strong></h3> <ul> <li>Salary range: $70K–$90K depending on experience.</li> <li>Opportunity for spot bonuses and annual performance-based bonus.</li> <li>Fully remote (U.S.-based) with flexible hours.</li> <li>Comprehensive health, dental, and vision.</li> <li>Generous PTO and paid holidays.</li> <li>401(k) plan.</li> <li>Professional-development stipend for courses, conferences, or language study.</li> <li>We reward excellence with growth—team members who excel have clear paths for promotion and skill development.</li> </ul> <p> </p>

Perks & benefits

401k
Paid Time Off

764,000+ hidden jobs like this

10a Labs and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

Unlimited applications — free stops at 5
Track every application in one place
Apply straight to the source, one click
Save & organize roles you love
Roles pulled from company boards before the big sites

Weekly

$9.99

$4.99/week

For an active search. Cancel anytime.

Get Weekly

Monthly

$24.99

$12.99/month

The smart pick. Save 35% vs weekly.

Get Monthly

Lifetime

$99

$49.99once

Pay once. Every future feature, forever.

Get Lifetime