open source

Multifunctional Automated Novel Generation Tool Based on Large Language Models | Open Source Daily No.764

AI tools

Multifunctional Automated Novel Generation Tool Based on Large Language Models | Open Source Daily No.764

YILING0013/AI_NovelGenerator Stars: 2.2k — License: AGPL-3.0 AI_NovelGenerator is a multifunctional automatic novel generation tool based on large language models, specializing in creating multi-chapter long-form novels while maintaining plot coherence and consistent world-building. * Novel Setting Workshop module provides world-building frameworks, character profiles, and plot blueprints * Supports multi-stage

By Honghao Wang
Open Source! High Performance, Strong Results, Strict Privacy — OPPO’s Terminal Large Model in Practice

AndesVL

Open Source! High Performance, Strong Results, Strict Privacy — OPPO’s Terminal Large Model in Practice

# AndesVL: Next-Generation On-Device Multimodal Large Model ## Introduction Multimodal large models running directly on devices often suffer from **insufficient performance**, **limited capabilities**, and **poor adaptability** — making it challenging to meet **high-performance**, **strong privacy**, and **low-latency** demands in edge AI applications. These issues create a bottleneck in the evolution of AI smartphones.

By Honghao Wang
400 Yuan Remote-Controlled 95% Robotic Arm! Shanghai Jiao Tong University Launches Open-Source U-Arm for a Universal, Low-Cost Human-Machine Teleoperation Interface

robotic arm

400 Yuan Remote-Controlled 95% Robotic Arm! Shanghai Jiao Tong University Launches Open-Source U-Arm for a Universal, Low-Cost Human-Machine Teleoperation Interface

400 RMB Remote-Control Robotic Arm — Shanghai Jiao Tong University’s U-Arm Open-Source Project Shanghai Jiao Tong University has unveiled LeRobot-Anything-U-Arm, an open-source, low-cost teleoperation system tested successfully on multiple mainstream robotic arms including XArm6, Dobot CR5, and ARX R5. --- Why U-Arm? — Lower Cost, Higher Efficiency Teleoperation Challenges * Mainstream approach:

By Honghao Wang
AI Algorithm Open Source | Logics-Parsing: End-to-End Structured Processing for Complex PDF Documents

AI document parsing

AI Algorithm Open Source | Logics-Parsing: End-to-End Structured Processing for Complex PDF Documents

Logics-Parsing: Advanced Document Parsing for Complex Layouts In both work and study, extracting usable content from images or PDFs is often frustrating — especially when tools struggle with: * Converting messy handwritten content into clean notes * Importing tables from references into presentation slides * Editing papers with specialized formats (e.g., chemistry) Even

By Honghao Wang
New Approach to Document Image Parsing: Efficient Recognition and Structuring with Multimodal Models | Open Source Daily No.760

Multimodal AI

New Approach to Document Image Parsing: Efficient Recognition and Structuring with Multimodal Models | Open Source Daily No.760

Dolphin: Multimodal Document Image Parsing Repo: bytedance/Dolphin Stars: 6.4k License: MIT Dolphin is a multimodal model for document image parsing, using heterogeneous anchor prompts to enable an “analyze first, then parse” workflow. Key Features * Two-stage processing: * Layout Analysis: Page-level layout detection that produces an element sequence in natural

By Honghao Wang