Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published 17 days ago • 46
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published 17 days ago • 46
Beyond Over-Refusal: Scenario-Based Diagnostics and Post-Hoc Mitigation for Exaggerated Refusals in LLMs Paper • 2510.08158 • Published Oct 9, 2025 • 1
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks Paper • 2401.16589 • Published Jan 29, 2024 • 1
Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers Paper • 2402.11700 • Published Feb 18, 2024 • 1
Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models Paper • 2402.18397 • Published Feb 28, 2024 • 1
LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification Paper • 2506.01484 • Published Jun 2, 2025 • 6
LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification Paper • 2506.01484 • Published Jun 2, 2025 • 6
LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification Paper • 2506.01484 • Published Jun 2, 2025 • 6 • 3
Hallucinations Can Improve Large Language Models in Drug Discovery Paper • 2501.13824 • Published Jan 23, 2025 • 10 • 8
Graph-Guided Textual Explanation Generation Framework Paper • 2412.12318 • Published Dec 16, 2024 • 4
Hallucinations Can Improve Large Language Models in Drug Discovery Paper • 2501.13824 • Published Jan 23, 2025 • 10 • 8
Hallucinations Can Improve Large Language Models in Drug Discovery Paper • 2501.13824 • Published Jan 23, 2025 • 10
Hallucinations Can Improve Large Language Models in Drug Discovery Paper • 2501.13824 • Published Jan 23, 2025 • 10
Hallucinations Can Improve Large Language Models in Drug Discovery Paper • 2501.13824 • Published Jan 23, 2025 • 10 • 8
Graph-Guided Textual Explanation Generation Framework Paper • 2412.12318 • Published Dec 16, 2024 • 4