Multi-Agent LLM System Enables Reliable Self-Harm Risk Screening
A latest arXiv paper proposes a statistical framework based on multi-agent large language model pipelines, aimed at addr…
18 articles about 'SME'
A latest arXiv paper proposes a statistical framework based on multi-agent large language model pipelines, aimed at addr…
A latest arXiv paper systematically quantifies the 'self-preference bias' phenomenon when large language models serve as…
A latest arXiv paper proposes a utility-based dynamic data valuation framework that starts from token-level information …
A large-scale empirical study systematically compared 9 debiasing strategies across 5 mainstream LLM judges, revealing t…
A new arXiv paper proposes an LLM reliability auditing framework for psychiatric hospitalization risk scoring, systemati…
The officially released "Guidelines on Performance Assessment Management for Fund Management Companies" is driving the m…