AI training - aitoearn

AI and seniors

Artificial Intelligence on the Shoulders of Elders

# Beijing – 2025‑11‑02 ## What Can Seniors Offer AI? ### **Their Unique Value in the Age of Artificial Intelligence** ![image](https://blog.aitoearn.ai/content/images/2025/11/img_001-61.jpg) We often assume seniors are *passive recipients* of new technology. In fact, over a lifetime they have cultivated **emotional intelligence*

AI training

The Secret to Boosting AI Learning Efficiency by 50×: Online Strategy Distillation

Interpreting Thinking Machines Lab’s Latest Research: On‑Policy Distillation --- Introduction: Rethinking How Machines Learn Imagine you’re teaching a student to write an essay. * Traditional way: Give them ten sample essays and tell them to imitate. * → This is imitation learning. * Problem: Faced with a new topic, they struggle.

GPU scheduling

HAMi × NVIDIA: Detailed Guide to GPU Topology-Aware Scheduling

# HAMi NVIDIA GPU Topology-Aware Scheduling — Design & Code Deep Dive **Date:** 2025-10-25 13:30 (Zhejiang) This article explains the **design philosophy**, **core principles**, and **code implementation** of HAMi’s new **topology-aware scheduling** capability for NVIDIA GPUs in version `v2.7.0`. We focus on how HAMi intelligently schedules GPU workloads

ExGRPO

New Paradigm for Large Model Inference Learning: ExGRPO Framework — From Blind Practice to Smart Review

2025-10-24 00:01 Jilin Beyond Traditional Online-Policy RLVR Methods --- Large Model Intelligence｜Sharing Source: Quantum Bits A joint research team from Shanghai Artificial Intelligence Laboratory, University of Macau, Nanjing University, and The Chinese University of Hong Kong has introduced a novel experience management and learning framework — ExGRPO. Goal: Scientifically

ExGRPO

New Paradigm for Large Model Reasoning: ExGRPO Framework — From Blind Practice to Smart Review

Large Models in Reinforcement Learning Finally Understand Which Experiences Are Most Valuable! A research team from Shanghai Artificial Intelligence Laboratory, University of Macau, Nanjing University, and The Chinese University of Hong Kong has proposed a groundbreaking experience management and learning framework — ExGRPO. By identifying, storing, filtering, and learning truly valuable

nanochat

nanochat — Full-Stack LLM Implementation by Andrej Karpathy nanochat (via) is a fascinating new project from Andrej Karpathy, discussed in detail in this forum post. It delivers a complete ChatGPT-style LLM stack, including training, inference, and a web-based UI, all in a single, minimal, hackable, dependency-light codebase. > "This repo