Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

AIM Intelligence

Team
company
https://aim-intelligence.com
AIM-Intelligence
Activity Feed

AI & ML interests

AI Safety & AI Security

Papers

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates

View all Papers

sangyoon yu's profile pictureHanwool Albert Lee's profile pictureyi's profile pictureaim-intelligence's profile pictureEuijun Lee's profile pictureHaon Park's profile pictureDasolChoi's profile pictureSiddhant's profile pictureArth SIngh's profile pictureKim Chaeyun's profile pictureKihyun Kim's profile pictureYumin Kim's profile picturemijin koo's profile picture
AIM-Intelligence 's Papers 10
Submitted by
DongGeon Lee
10

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

AIM-Intelligence AIM Intelligence
12 3
1

X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates

AIM-Intelligence AIM Intelligence
4
2

ObjexMT: Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge under Multi-Turn Jailbreaks

AIM-Intelligence AIM Intelligence
0
1

Eliciting and Analyzing Emergent Misalignment in State-of-the-Art Large Language Models

AIM-Intelligence AIM Intelligence
Submitted by
DongGeon Lee
5

When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs

AIM-Intelligence AIM Intelligence
10 2
2

Better Safe Than Sorry? Overreaction Problem of Vision Language Models in Visual Emergency Recognition

AIM-Intelligence AIM Intelligence
4
-

HRET: A Self-Evolving LLM Evaluation Toolkit for Korean

AIM-Intelligence AIM Intelligence
1

sudo rm -rf agentic_security

AIM-Intelligence AIM Intelligence
2

One-Shot is Enough: Consolidating Multi-Turn Attacks into Efficient Single-Turn Prompts for LLMs

AIM-Intelligence AIM Intelligence
2

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

AIM-Intelligence AIM Intelligence
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs