Honghao Wang

Current LLMs Waste 96% GPU — Inference Systems May Need a Rethink! NVIDIA’s Chinese Team Achieves Nearly 6× Token Speed with Free Token Slots, No Closed-Source Dependency

AI news

Current LLMs Waste 96% GPU — Inference Systems May Need a Rethink! NVIDIA’s Chinese Team Achieves Nearly 6× Token Speed with Free Token Slots, No Closed-Source Dependency

# 🚀 Another Masterpiece from a Chinese AI Team ![image](https://blog.aitoearn.ai/content/images/2025/11/img_001-450.jpg) ![image](https://blog.aitoearn.ai/content/images/2025/11/img_002-425.jpg) ![image](https://blog.aitoearn.ai/content/images/2025/11/img_003-401.jpg) **Edited by | Yun Zhao** --- ## Introduction If

Google Officially Announces Gemini 3: Team Reveals Two “Aha Moments” in Model Training — Hassabis: “Another Step Toward AGI” | [Jingwei Low-Key Share]

AI news

Google Officially Announces Gemini 3: Team Reveals Two “Aha Moments” in Model Training — Hassabis: “Another Step Toward AGI” | [Jingwei Low-Key Share]

Gemini 3: Another Step Toward AGI On November 18 (Pacific Time), Google officially launched Gemini 3, marking a significant leap forward in the journey toward Artificial General Intelligence (AGI). This new-generation model sets fresh boundaries for AI–human collaboration, delivering breakthroughs in reasoning, multimodal understanding, and coding. --- Key “Aha