🏷️ Model Assessment

2 articles about 'Model Assessment'

Study Reveals Self-Preference Bias in LLM Judges and Proposes Mitigation Strategies

2026-04-29 research 👁 10

A latest arXiv paper systematically quantifies the 'self-preference bias' phenomenon when large language models serve as…

Who Judges the Judges? Bias Mitigation Strategies for LLM Evaluators Receive First Systematic Assessment

2026-04-28 research 👁 14

A large-scale empirical study systematically compared 9 debiasing strategies across 5 mainstream LLM judges, revealing t…

1