Language Model Training - AI News

New Study Decouples the True Contributions of Subword Tokenization to Large Language Model Training

2026-05-01 research 👁 11

A latest arXiv paper systematically disentangles the specific contributions of Subword Tokenization to large language mo…

2026-04-29 research 👁 10

A latest arXiv paper conducts an in-depth study on the training stability of Masked Diffusion Language Models (MDMs), co…

2026-04-28 research 👁 11

A new study finds that the power-law distribution inherent in natural language is not a barrier to model learning but ca…