Awesome Retrieval-Augmented Generation Papers and Source Codes

DeepRetrieval: Boost Search Accuracy by 2.6× Without Any Labeled Data—Powered by Reinforcement Learning 679

Imagine you’re building a retrieval-augmented generation (RAG) system, a scientific literature assistant, or a natural-language interface to a clinical trial…

01/13/2026Information Retrieval, Query Rewriting, Retrieval-Augmented Generation

IoA: Enable Heterogeneous AI Agents to Collaborate Like the Internet — Solve Complex Tasks Beyond Single-Agent Limits 770

Imagine a world where AI agents—each with unique skills like web browsing, code execution, or data analysis—can autonomously find one…

01/13/2026Embodied AI, Multi-agent Collaboration, Retrieval-Augmented Generation

DISC-FinLLM: A Specialized Chinese Financial LLM for Accurate, Context-Aware Financial Intelligence 818

If you’re building AI-powered tools for the Chinese financial sector—whether for banking, fintech, investment research, or regulatory compliance—you’ve likely run…

01/13/2026Financial Large Language Model, Financial Text Analysis, Retrieval-Augmented Generation

RAG Foundry (RAG-FiT): Build, Train, and Evaluate Domain-Specific RAG Systems Without the Complexity 750

Building effective Retrieval-Augmented Generation (RAG) systems is notoriously difficult. Practitioners must juggle data preparation, retrieval integration, prompt engineering, model fine-tuning,…

01/13/2026Parameter-Efficient Fine-Tuning, RAG Evaluation, Retrieval-Augmented Generation

EasyRAG: A Lightweight, High-Accuracy RAG Framework for Resource-Constrained Network Operations and Enterprise QA 584

In today’s fast-paced IT and enterprise environments, teams increasingly rely on retrieval-augmented generation (RAG) systems to provide accurate, context-aware answers…

01/09/2026Automated Network Operations, Question Answering, Retrieval-Augmented Generation

FinTeam: A Multi-Agent Financial Intelligence System That Generates Human-Accepted Reports and Outperforms GPT-4o 779

Financial analysis is rarely a solo endeavor. In real-world institutions—from investment banks to asset management firms—complex tasks like producing quarterly…

01/05/2026Financial Reasoning, Multi-agent Systems, Retrieval-Augmented Generation

LMCache: Slash LLM Inference Latency and Multiply Throughput with Enterprise-Grade KV Cache Reuse 6375

Deploying large language models (LLMs) at scale introduces a familiar bottleneck: the growing size of Key-Value (KV) caches rapidly outpaces…

01/04/2026KV Cache Reuse, LLM Inference Optimization, Retrieval-Augmented Generation

FlagEmbedding: High-Performance, Task-Aware Text Embeddings for Multilingual RAG and Semantic Search 10677

Modern AI applications—from customer support chatbots to enterprise knowledge retrieval—rely heavily on high-quality text embeddings to power semantic search and…

12/27/2025Retrieval-Augmented Generation, Semantic Search, Text Embedding