MateClaw v1.2.0 Turns AI Agents Into Digital Employees
MateClaw v1.2.0 reimagines AI agents as persistent digital employees with roles, memory, and multimodal content generati…
40 articles about 'Multimodal AI'
MateClaw v1.2.0 reimagines AI agents as persistent digital employees with roles, memory, and multimodal content generati…
LG AI Research unveils EXAONE 4.0, a multimodal foundation model expanding beyond language into vision and reasoning for…
Google's Gemini API paired with Firebase creates a powerful stack for developers building multimodal AI applications at …
Carnegie Mellon team develops a multimodal AI system that integrates imaging, text, and lab data to diagnose diseases wi…
LG AI Research unveils a multimodal foundation model designed to power its smart home ecosystem with contextual AI under…
French AI startup Mistral AI launches its most ambitious multimodal foundation model yet, targeting enterprise customers…
Google DeepMind launches Gemini 3.0, featuring native multimodal reasoning that processes text, images, audio, and video…
Sony Research Tokyo reveals a new multimodal AI framework designed to power next-generation robotics and gaming experien…
Meta unveils Llama 4 Maverick, its largest open-source AI model featuring a mixture-of-experts architecture and multimod…
South Korea's Kakao Brain releases an open-source vision-language model optimized for Korean, advancing multilingual AI …
LINE Yahoo Japan develops a multimodal AI model optimized for Japanese language to automate content moderation across it…
Google DeepMind's Gemini 2.5 Ultra tops benchmarks across text, vision, code, and math, raising the bar for frontier AI …