AI news

Breaking the Terabyte-Scale Model “Memory Wall”: Collaborative Compression Framework Fits 1.3TB MoE Model into a 128GB Laptop

AI news

Breaking the Terabyte-Scale Model “Memory Wall”: Collaborative Compression Framework Fits 1.3TB MoE Model into a 128GB Laptop

Collaborative Compression: Breaking the Memory Wall for Trillion-Parameter MoE Models This article introduces the Collaborative Compression framework, which — for the first time — successfully deployed a trillion-parameter Mixture-of-Experts (MoE) large model on a consumer-grade PC with 128 GB RAM, achieving over 5 tokens/second in local inference. Developed by the Moxin

Translate the following blog post title into English, concise and natural. Return plain text only without quotes.

GKE 如何扩展至 13 万节点集群

AI news

Translate the following blog post title into English, concise and natural. Return plain text only without quotes. GKE 如何扩展至 13 万节点集群

At Google Cloud, we’re constantly pushing the scalability of Google Kubernetes Engine (GKE) so that it can keep up with increasingly demanding workloads — especially AI. GKE already supports massive 65,000-node clusters, and at KubeCon, we shared that we successfully ran a 130,000-node cluster in experimental mode — twice

AI news

Translate the following blog post title into English, concise and natural. Return plain text only without quotes. Poetry 管理 Python 项目指南

Python development looks simple from the outside. But managing real projects is rarely easy. You need to install packages, update them, avoid version conflicts, create virtual environments, and prepare your project for distribution. Many beginners think they can handle everything with pip and venv. This works for small scripts, but

AI news

Translate the following blog post title into English, concise and natural. Return plain text only without quotes. QConSF 2025:人在回路:混乱行业中的技术领导力

At QCon San Francisco 2025, Michelle Brush, engineering director of Site Reliability Engineering (SRE) at Google, delivered the closing keynote “Humans in the Loop: Engineering Leadership in a Chaotic Industry” on November 19, 2025. Speaking to a room of software leaders, she explored the broader changes underway in software engineering,

Translate the following blog post title into English, concise and natural. Return plain text only without quotes.

英特尔中国 40 年,未来战略明确:坚持代工业务,Intel 18A 是未来三代产品基石

AI news

Translate the following blog post title into English, concise and natural. Return plain text only without quotes. 英特尔中国 40 年,未来战略明确:坚持代工业务,Intel 18A 是未来三代产品基石

2025-11-22 13:30 浙江 “过去这一年,公司发生了很多积极的变化,也有了更多的坚持和坚守。 作者 | 褚杏娟 “过去这一年,公司发生了很多积极的变化,也有了更多的坚持和坚守。”英特尔市场营销集团副总裁、中国区总经理郭威,在近日的英特尔产业创新大会上说道。 在组织方面,英特尔不断进步和演进,一是整个组织扁平化,由下而上;二是进一步弘扬英特尔工程师文化,更专注客户和产品。 - 在战略层面,英特尔明确了以下关键方向: - 稳健运营:确保资源集中投入于最具战略价值的领域; - X86 生态:作为英特尔核心资产,覆盖从性能、软件到供应商、集成商及终端用户的全链条; 坚持发展代工业务,郭威强调“英特尔是唯一一家既设计芯片、又制造芯片的公司,同时能够利用英特尔庞大的 IP,帮助客户共同生产芯片。” 郭威表示,技术和产品是英特尔的核心价值。公司将从三方面持续发力:一是基于 Intel 18A 制程工艺,明年英特尔将推出更多基于 Intel