Twelve Labs Raises $150M for AI Video Understanding
Korean AI startup Twelve Labs secures $150 million in funding to advance its multimodal video understanding platform and…
40 articles about 'multimodal AI'
Korean AI startup Twelve Labs secures $150 million in funding to advance its multimodal video understanding platform and…
Sony Research Tokyo introduces a multimodal AI system that autonomously creates and controls game characters with unprec…
A new Diffusion Transformer architecture promises to merge image, video, and 3D generation into a single unified model, …
Google DeepMind launches Gemini 2.5 Ultra, its most powerful AI model featuring native multimodal reasoning across text,…
Sony Research introduces a new multimodal AI model designed to generate and transform creative content across music, ima…
Sony Research has developed a new multimodal AI model designed to streamline creative content production across music, f…
Boston Dynamics integrates multimodal AI into its electric Atlas robot, targeting warehouse automation with advanced per…
An indie developer launches 'Sharp Tongue,' a WeChat mini-program that uses AI to deliver brutally funny critiques of an…
Sony AI Research Lab Tokyo introduces a new multimodal framework designed to generate music, images, and 3D assets from …
OpenAI unveils GPT-5 Turbo featuring advanced reasoning, native multimodal capabilities, and significant API improvement…
NVIDIA expands its NeMo framework with new multimodal capabilities for building autonomous AI agents, targeting enterpri…
DeepSeek quietly published a technical paper revealing its first multimodal vision capabilities, only to remove the docu…