Multimodal AI - AI News

MateClaw v1.2.0 Turns AI Agents Into Digital Employees

2026-05-06 app 👁 8

MateClaw v1.2.0 reimagines AI agents as persistent digital employees with roles, memory, and multimodal content generati…

2026-05-06 llm 👁 7

LG AI Research unveils EXAONE 4.0, a multimodal foundation model expanding beyond language into vision and reasoning for…

2026-05-06 tutorial 👁 8

Google's Gemini API paired with Firebase creates a powerful stack for developers building multimodal AI applications at …

2026-05-06 research 👁 9

Carnegie Mellon team develops a multimodal AI system that integrates imaging, text, and lab data to diagnose diseases wi…

2026-05-06 industry 👁 8

LG AI Research unveils a multimodal foundation model designed to power its smart home ecosystem with contextual AI under…

2026-05-06 llm 👁 9

French AI startup Mistral AI launches its most ambitious multimodal foundation model yet, targeting enterprise customers…

2026-05-06 llm 👁 8

Google DeepMind launches Gemini 3.0, featuring native multimodal reasoning that processes text, images, audio, and video…

2026-05-05 research 👁 9

Sony Research Tokyo reveals a new multimodal AI framework designed to power next-generation robotics and gaming experien…

2026-05-05 llm 👁 9

Meta unveils Llama 4 Maverick, its largest open-source AI model featuring a mixture-of-experts architecture and multimod…

2026-05-05 llm 👁 9

South Korea's Kakao Brain releases an open-source vision-language model optimized for Korean, advancing multilingual AI …

2026-05-05 industry 👁 9

LINE Yahoo Japan develops a multimodal AI model optimized for Japanese language to automate content moderation across it…

2026-05-05 llm 👁 10

Google DeepMind's Gemini 2.5 Ultra tops benchmarks across text, vision, code, and math, raising the bar for frontier AI …