AI news
Z Tech | LMSYS Team Releases Large-Scale MoE Reinforcement Learning Framework Miles — Small Steps Lead to Long Journeys
!image Miles: Enterprise-Scale Reinforcement Learning for MoE & Production The lightweight RL framework slime quietly gained popularity thanks to its support for diverse post-training pipelines and Mixture-of-Experts (MoE) tasks — including GLM‑4.6. Building on that foundation, the LMSYS team has officially introduced Miles: a reinforcement learning framework purpose-built for