AIM Intelligence

company

https://aim-intelligence.com

AIM-Intelligence

AI & ML interests

AI Safety & AI Security

Papers

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates

View all Papers

AIM-Intelligence 's Papers 10

Submitted by

DongGeon Lee

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

AIM-Intelligence

AIM Intelligence

X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates

AIM-Intelligence

AIM Intelligence

ObjexMT: Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge under Multi-Turn Jailbreaks

AIM-Intelligence

AIM Intelligence

Eliciting and Analyzing Emergent Misalignment in State-of-the-Art Large Language Models

AIM-Intelligence

AIM Intelligence

Submitted by

DongGeon Lee

When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs

AIM-Intelligence

AIM Intelligence

Better Safe Than Sorry? Overreaction Problem of Vision Language Models in Visual Emergency Recognition

AIM-Intelligence

AIM Intelligence

HRET: A Self-Evolving LLM Evaluation Toolkit for Korean

AIM-Intelligence

AIM Intelligence

sudo rm -rf agentic_security

AIM-Intelligence

AIM Intelligence

One-Shot is Enough: Consolidating Multi-Turn Attacks into Efficient Single-Turn Prompts for LLMs

AIM-Intelligence

AIM Intelligence

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

AIM-Intelligence

AIM Intelligence