llm - AI News | GogoAI News

Alibaba Qwen3.5 LiveTranslate: 2.8s Latency Breakthrough

2026-05-20 llm 👁 7

Alibaba's Qwen3.5-LiveTranslate-Flash slashes real-time translation latency to 2.8 seconds while preserving speaker voic…

2026-05-17 research 👁 13

New δ-mem framework slashes GPU memory usage for LLMs by 90%, enabling efficient online inference on consumer hardware.

2026-05-15 app 👁 11

AWS launches Assisted NLU for Amazon Lex, leveraging LLMs to improve intent recognition and slot filling without manual …

2026-05-15 llm 👁 12

DeepSeek's AI model accidentally outputs explicit content from China's V2EX forum, raising data privacy and training set…

2026-05-14 industry 👁 10

Brix is hiring AI engineers to build autonomous recruiting agents. This role focuses on LLM reasoning and multi-agent wo…

2026-05-13 research 👁 12

A developer systematically refutes three hypotheses on semantic units using geometric algebra and factor attention on du…

2026-05-13 llm 👁 12

Correcting an AI in chat does not instantly update its model. Learn how training data cycles and RAG systems impact long…

2026-05-12 industry 👁 9

Xiaomi's MiMo Orbit initiative distributes nearly 80 trillion tokens in under two weeks, signaling aggressive expansion …

2026-05-11 llm 👁 13

Developers now fine-tune powerful small language models on consumer hardware, reducing costs and boosting privacy for lo…

2026-05-11 research 👁 11

New study uses LLM judges and TrueSkill to rank 1,000 Show HN posts by merit.

2026-05-10 llm 👁 11

DeepSeek releases R1 model, offering open-source reasoning capabilities that rival top proprietary models at a fraction …

2026-05-07 industry 👁 8

Panasonic integrates large language models into its industrial IoT edge devices, enabling real-time AI inference on fact…