PaperCodex

EvoGit: Decentralized, Autonomous Code Evolution for Scalable Multi-Agent Software Development 957

Modern software development faces persistent bottlenecks: slow iteration cycles, coordination overhead in large teams, opaque AI-assisted coding workflows, and limited…

01/13/2026Autonomous Programming, Evolutionary Software Development, Multi-agent Code Generation

YuLan: A Transparent, Bilingual Open-Source LLM Built from Scratch for Reproducible AI Research 633

YuLan is an open-source large language model (LLM) series developed by the Gaoling School of Artificial Intelligence (GSAI) at Renmin…

01/13/2026Bilingual Language Modeling, Chinese NLP, Open-source LLM

shapiq: Go Beyond Feature Importance with Shapley Interactions for Model Explainability 614

In the world of explainable AI, understanding which features matter is only half the story. What if two seemingly unimportant…

01/13/2026Explainable AI, Feature Interaction Analysis, Model Interpretability

Windows Agent Arena: Benchmark Multimodal AI Agents in Real Windows Environments at Scale 771

Evaluating AI agents that interact with desktop operating systems has long been hampered by artificial or limited test environments. Most…

01/13/2026Desktop AI Benchmarking, Multimodal Agent Evaluation, OS-level Reasoning

OmniQuant: Near-Lossless LLM Quantization for Real-World Deployment on GPUs and Mobile Devices 857

Deploying large language models (LLMs) in real-world applications remains a major engineering challenge. While models like LLaMA-2, Falcon, and Mixtral…

01/09/2026Efficient LLM Deployment, Large Language Model Quantization, Post-training Quantization

ViDoRAG: Multi-Agent RAG for Visually Rich Documents with Dynamic Reasoning and Hybrid Retrieval 616

Traditional Retrieval-Augmented Generation (RAG) systems excel at answering questions using text-based documents—but they often stumble when faced with visually rich…

01/09/2026Iterative Reasoning Agents, Multimodal Retrieval-Augmented Generation, Visual Document Understanding

EasyRAG: A Lightweight, High-Accuracy RAG Framework for Resource-Constrained Network Operations and Enterprise QA 584

In today’s fast-paced IT and enterprise environments, teams increasingly rely on retrieval-augmented generation (RAG) systems to provide accurate, context-aware answers…

01/09/2026Automated Network Operations, Question Answering, Retrieval-Augmented Generation

Code2Video: Generate Accurate, Structured Educational Videos Using Executable Code 673

Traditional AI-powered video generators—especially those based on diffusion or pixel-level synthesis—struggle when it comes to creating high-quality educational content. While…

01/09/2026AI For STEM Education, Code-driven Animation, Educational Video Generation

FlowSearch: Dynamic Knowledge Flows for Multi-Agent Deep Research Automation 578

Deep research—whether in scientific discovery, engineering design, or AI innovation—is rarely linear. It demands navigating complex dependencies, synthesizing cross-disciplinary insights,…

01/09/2026Dynamic Knowledge Flow, Multi-agent Research Automation, Scientific Reasoning

ScaleCUA: Cross-Platform GUI Automation Powered by Large-Scale Open Data 616

Building reliable computer use agents (CUAs)—systems that can autonomously interact with graphical user interfaces (GUIs)—has long been hindered by a…

01/09/2026Cross-platform Agent, GUI Automation, vision-language modeling