LongSumEval: Reshaping Long-Document Summarization Evaluation with QA Feedback
A latest arXiv paper introduces LongSumEval, a framework that unifies summarization evaluation and generation optimizati…
3304 articles about 'AR'
A latest arXiv paper introduces LongSumEval, a framework that unifies summarization evaluation and generation optimizati…
A new study analyzing the intermediate reasoning steps of large language models reveals hidden biases and discrimination…
A latest arXiv paper proposes the Dual-Track CoT method, which uses a budget-aware stepwise guidance strategy to signifi…
A latest arXiv paper analyzes feature-level mechanisms to reveal why reinforcement learning post-training enhances out-o…
A latest arXiv survey systematically reviews research progress in LLM-based conversational user simulation, exploring ho…
A research team introduces the BenchGuard framework, the first to leverage frontier large language models to automatical…
Researchers introduce GAIA-v2-LILT, a refined pipeline combining functional alignment and cultural adaptation to address…
Researchers propose ADE (Adaptive Dictionary Embeddings), the first method to successfully extend multi-anchor word repr…
Mercedes-Benz reported a 17% year-over-year decline in Q1 2025 EBIT to €1.9 billion, with revenue falling 4.9% and autom…
A latest arXiv paper proposes Exploratory Sampling (ESamp), a decoding method that explicitly encourages large language …
A new study proposes a unified theoretical framework that brings unsupervised concept extraction techniques — including …
A new study proposes a data augmentation pipeline combining large language model text paraphrasing with text-to-speech s…