Study Reveals Self-Preference Bias in LLM Judges and Proposes Mitigation Strategies
A latest arXiv paper systematically quantifies the 'self-preference bias' phenomenon when large language models serve as…
2 articles about 'Model Assessment'
A latest arXiv paper systematically quantifies the 'self-preference bias' phenomenon when large language models serve as…
A large-scale empirical study systematically compared 9 debiasing strategies across 5 mainstream LLM judges, revealing t…