Skip to content

PaperCodex

Subscribe
FlashRAG: A Modular, Lightweight Toolkit for Reproducible and Efficient Retrieval-Augmented Generation Research

FlashRAG: A Modular, Lightweight Toolkit for Reproducible and Efficient Retrieval-Augmented Generation Research 3208

Retrieval-Augmented Generation (RAG) has emerged as a cornerstone technique for enhancing the factual grounding, knowledge scope, and reasoning capabilities of…

12/17/2025Multimodal RAG, Reasoning-Augmented QA, Retrieval-Augmented Generation
HunyuanVideo: Open-Source, High-Fidelity Video Generation That Rivals Closed Models

HunyuanVideo: Open-Source, High-Fidelity Video Generation That Rivals Closed Models 11437

HunyuanVideo is a groundbreaking open-source video foundation model developed by Tencent, designed to deliver professional-grade video generation capabilities without the…

12/17/2025Image-to-video Generation, Multimodal Video Synthesis, Text-to-Video Generation
FireRedASR: Industrial-Grade Mandarin Speech Recognition with SOTA Accuracy and LLM Integration

FireRedASR: Industrial-Grade Mandarin Speech Recognition with SOTA Accuracy and LLM Integration 1658

FireRedASR is an open-source, industrial-grade automatic speech recognition (ASR) system specifically engineered for Mandarin Chinese—but with strong capabilities in Chinese…

12/17/2025Automatic Speech Recognition, LLM-Integrated Speech Processing, Multilingual ASR
UltraRAG: Build Adaptive, Multimodal RAG Systems Without Writing Complex Code

UltraRAG: Build Adaptive, Multimodal RAG Systems Without Writing Complex Code 2325

Retrieval-Augmented Generation (RAG) has become a cornerstone technique for grounding large language models (LLMs) in real-world knowledge. However, building effective…

12/16/2025Adaptive Knowledge Integration, Multimodal Reasoning, Retrieval-Augmented Generation
VLMEvalKit: One-Command Evaluation for 200+ Vision-Language Models Across 80+ Benchmarks

VLMEvalKit: One-Command Evaluation for 200+ Vision-Language Models Across 80+ Benchmarks 3536

Evaluating large vision-language models (LVLMs) used to be a fragmented, time-consuming chore—juggling dozens of benchmark repositories, writing custom data loaders,…

12/16/2025Benchmarking, Multi-modal Evaluation, vision-language modeling
HunFlair: State-of-the-Art Biomedical Named Entity Recognition with Just Four Lines of Code

HunFlair: State-of-the-Art Biomedical Named Entity Recognition with Just Four Lines of Code 14333

Biomedical text is dense with critical information—gene names, chemical compounds, diseases, species—but extracting that information manually is time-consuming and error-prone.…

12/15/2025Biomedical Text Mining, Named Entity Recognition, Sequence Labeling
RAG-Anything: The First All-in-One RAG Framework for Multimodal Documents—Text, Images, Tables & Equations in One System

RAG-Anything: The First All-in-One RAG Framework for Multimodal Documents—Text, Images, Tables & Equations in One System 11048

Retrieval-Augmented Generation (RAG) has revolutionized how we use large language models by grounding their responses in external knowledge. But here’s…

12/15/2025Cross-Modal Knowledge Retrieval, Multimodal Document Understanding, Multimodal Retrieval-Augmented Generation
MiniRAG: Enable Small Language Models to Deliver Powerful RAG with Minimal Resources

MiniRAG: Enable Small Language Models to Deliver Powerful RAG with Minimal Resources 1605

Retrieval-Augmented Generation (RAG) has become a cornerstone technique for grounding language models in factual knowledge. However, traditional RAG pipelines struggle…

12/15/2025Knowledge Graph Reasoning, On-Device AI, Retrieval-Augmented Generation
EASYTOOL: Streamline LLM Agent Tool Usage with Concise, Unified Instructions

EASYTOOL: Streamline LLM Agent Tool Usage with Concise, Unified Instructions 24492

Building capable AI agents that interact with real-world tools—like APIs, software libraries, or external services—is a core challenge in deploying…

12/15/2025Multi-tool Agent Coordination, Task Automation, Tool-augmented Reasoning
AutoGL: Automate Graph Learning Pipelines Without Manual Tuning or Expert GNN Knowledge

AutoGL: Automate Graph Learning Pipelines Without Manual Tuning or Expert GNN Knowledge 1131

Graph-based machine learning has become essential across domains—from social network analysis and fraud detection to drug discovery and recommendation systems.…

12/15/2025Graph Classification, Link Prediction, Node Classification

Posts pagination

Previous 1 … 38 39 40 … 43 Next
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex