Open Source Today (2025-10-17): Facebook Releases MobileLLM-Pro with 128k Long Context Window and Outstanding Cross-Task Generalization

Open Source Today (2025-10-17): Facebook Releases MobileLLM-Pro with 128k Long Context Window and Outstanding Cross-Task Generalization

Daily Discovery of Latest LLMs

Date: 2025-10-17 · Location: Hong Kong, China

---

📋 Overview

Base Language Model: MobileLLM-Pro

Markdown Model: Nanonets-OCR2

Multimodal Model: Home-cooked Mistral Small Omni

Chat Interface Project: chat-ui

Document AI Engine: PaddleOCR

Optimization Method: EPO

---

🏆 Base Models

① MobileLLM-Pro

image

Description:

MobileLLM-Pro is part of the MobileLLM series with 1 billion parameters, optimized for on-device inference on mobile platforms.

It supports:

  • General-purpose language tasks: tool invocation, Q&A, rewriting, summarizing.
  • Context windows: up to 128k tokens for large document summarization & retrieval.
  • Efficient quantization: int4 checkpoints maintain near-lossless performance.

Model Variants:

  • Pre-trained base model
  • Instruction-tuned version

One-click access:

https://sota.jiqizhixin.com/project/mobilellm-pro2

---

② Nanonets-OCR2-3B

image

Description:

Nanonets-OCR2-3B is an advanced OCR-to-Markdown model capable of:

  • Extracting text and outputting semantic-structured Markdown.
  • Handling complex, multilingual documents.
  • Performing Visual Question Answering (VQA).

One-click access:

https://sota.jiqizhixin.com/project/nanonets-ocr2-3b

---

③ Home-cooked Mistral Small Omni

image

Description:

Home-Cook-Mistral-Small-Omni-24B merges:

  • Mistral Small 2506 (visual capabilities)
  • Voxtral 2507 (audio capabilities)

Features:

  • Multimodal processing (vision + audio)
  • Released under Apache 2.0 License

One-click access:

https://sota.jiqizhixin.com/project/home-cooked-mistral-small-omni

---

🛠️ Frameworks & Tools

① chat-ui

image

Description:

An open-source SvelteKit chat interface for LLMs:

  • Powers HuggingChat
  • Compatible with OpenAI APIs
  • Supports multi-service integration for NLP apps

One-click access:

https://sota.jiqizhixin.com/project/chat-ui

---

② PaddleOCR

image

Description:

PaddleOCR offers:

  • End-to-end OCR + Document AI processing
  • Conversion of PDF/images → structured data
  • Support for 100+ languages
  • Lightweight design with high accuracy
  • Widely used by developers & enterprises

One-click access:

https://sota.jiqizhixin.com/project/paddleocr2

---

💡 Monetization Tip:

Platforms like AiToEarn官网 enable simultaneous publishing across major social media networks, analytics tracking, model rankings, and monetization options — ideal for LLM & OCR integration workflows.

---

🤖 Agent Development

① EPO — Entropy-Regularized Policy Optimization

image

Description:

EPO introduces entropy regularization into reinforcement learning for LLM agents to improve:

  • Training stability
  • Generalization performance in multi-turn environments

Results:

  • ScienceWorld: PPO+EPO doubles reward compared to baseline, with smooth monotonic gains.
  • ALFWorld: GRPO+EPO shows steady upward trends.
  • Validation: Success rate >0.8 within 40 steps (baseline struggles at <0.4 after 100 steps).

One-click access:

https://sota.jiqizhixin.com/project/epo-2

---

image
image

---

📚 Original Source

Read original

Open in WeChat

---

🌐 Final Note

By combining PaddleOCR for intelligent document processing, EPO for stable agent training, and AiToEarn for global content publishing & monetization, creators can build end-to-end AI solutions that not only perform well but also generate sustainable revenue across multiple platforms.

---

Would you like me to add a comparison table of these projects for easier side-by-side analysis? That would make this Markdown even more concise for quick reference.

Read more

Translate the following blog post title into English, concise and natural. Return plain text only without quotes.

ChatGPT Atlas 发布,AI 浏览器大乱斗...

Translate the following blog post title into English, concise and natural. Return plain text only without quotes. ChatGPT Atlas 发布,AI 浏览器大乱斗...

# AI Browsers: When LLM Companies Step In 原创 lencx · 2025-10-22 07:00 · 上海 --- ## Overview Large Language Model (LLM) companies are making moves into the **AI browser** space. From new entrants like **Dia**[1], **Comet**[2], and **ChatGPT Atlas**[3], to established browsers like **Chrome** and **Edge** (which now feature

By Honghao Wang