DavidAU/Mistral-Nemo-Inst-2407-12B-Thinking-Uncensored-HERETIC-HI-Claude-Opus Text Generation • 12B • Updated 6 days ago • 743 • 8
view article Article Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model 17 days ago • 16
AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models Paper • 2506.14682 • Published Jun 17, 2025
MAIF: Enforcing AI Trust and Provenance with an Artifact-Centric Agentic Paradigm Paper • 2511.15097 • Published Nov 19, 2025