Building intelligent systems that can handle open-ended, multi-step problems has long been a challenge in AI development. Traditional multi-agent frameworks…
Amphion: A Unified Open-Source Toolkit for Zero-Shot Speech, Singing, and Audio Generation 9539
Amphion is an open-source toolkit purpose-built for audio, music, and speech generation that dramatically lowers the entry barrier for junior…
PP-FormulaNet: High-Accuracy and High-Speed Math Formula Recognition for Document Intelligence 5930
In the world of scientific publishing, academic research, and educational technology, one persistent bottleneck remains: converting handwritten or printed mathematical…
SGLang: High-Performance LLM Serving for Structured, Multi-Step, and Multimodal AI Applications 21238
Large language models (LLMs) are no longer just tools for answering questions—they power agents, structured data pipelines, multi-turn conversations, and…
PP-OCR: Ultra-Lightweight, Multilingual OCR and Document AI for Real-World Applications 66154
In today’s AI-driven world, turning unstructured visual data—like scanned invoices, handwritten notes, or multilingual PDFs—into structured, machine-readable formats is a…
MinerU: High-Precision Open-Source Document Parsing for Real-World PDFs, Tables, and Formulas 50296
Converting real-world documents—especially PDFs containing mixed content like equations, tables, multi-column layouts, and scanned text—into clean, structured, machine-readable formats remains…
MetaGPT: Automate Full Software Development with AI Agents That Work Like a Real Engineering Team 60511
Building reliable software from natural language prompts remains a major challenge—even for today’s most capable large language models (LLMs). While…
LlamaFactory: Fine-Tune 100+ Language Models Effortlessly—No Coding Required 63856
Fine-tuning large language models (LLMs) used to be a complex, time-consuming endeavor—requiring deep expertise in deep learning frameworks, custom code…
llama.cpp: Run Large Language Models Anywhere—Fast, Lightweight, and Offline 91182
In an era where large language models (LLMs) power everything from chatbots to code assistants, deploying them outside of cloud…
BitNet: Run 1.58-Bit LLMs Locally on CPUs with 6x Speedup and 82% Less Energy 24452
Running large language models (LLMs) used to require powerful GPUs, expensive cloud infrastructure, or specialized hardware—until BitNet changed the game.…