MiniCPM-o 4.5 Released: Toward Real-Time Full-Duplex Omni-Modal Interaction
The ModelBest (面壁智能) team has released MiniCPM-o 4.5, the first on-device model to achieve real-time full-duplex omni-mo…
7 articles about 'Multimodal Large Model'
The ModelBest (面壁智能) team has released MiniCPM-o 4.5, the first on-device model to achieve real-time full-duplex omni-mo…
A latest arXiv paper proposes the AutoSurfer framework, which addresses the scarcity of training data and incomplete web…
A research team has released the M³-VQA benchmark, focusing on multi-entity and multi-hop reasoning visual question answ…
A research team has proposed PivotMerge, a method that leverages post-alignment model merging techniques to effectively …
SenseTime has officially released and open-sourced the SenseNova U1 series of models. Built on its proprietary NEO-unify…
A latest study proposes a multi-layered methodology that accelerates multimodal foundation models through hardware-softw…
SenseTime has launched and open-sourced its next-generation native multimodal large model SenseNova-U1. Hygon DCU was th…