🏷️ Over-Refusal

1 articles about 'Over-Refusal'

Safe but Useless? New Benchmark Exposes the LLM Alignment Dilemma

2026-05-01 research 👁 13

A research team has introduced CarryOnBench, the first benchmark to systematically evaluate whether large language model…