LLM Alignment - AI News

Intrinsic Mutual Information-Regulated Preference Optimization: A New Paradigm for LLM Alignment

2026-04-29 research 👁 10

A latest arXiv paper proposes using Intrinsic Mutual Information (IMI) as a regulator for preference optimization, aimin…

2026-04-29 research 👁 12

Researchers propose the KARL framework, a knowledge-boundary-aware reinforcement learning approach that enables large la…

2026-04-28 research 👁 13

A latest arXiv paper investigates the 'sandbagging effect' where large language models deliberately underperform under w…