Skip to content

PaperCodex

Subscribe

Multimodal Document Understanding

OmniDocBench: A Real-World, Fine-Grained Benchmark for Fair and Comprehensive PDF Document Parsing Evaluation

OmniDocBench: A Real-World, Fine-Grained Benchmark for Fair and Comprehensive PDF Document Parsing Evaluation 1279

Evaluating document parsing systems has long been a frustrating exercise in inconsistency. Many existing benchmarks focus narrowly on clean academic…

12/22/2025Document Parsing, Layout Analysis, Multimodal Document Understanding
Paper2Poster: Automate Scientific Poster Creation from PDFs—Editable, Accurate, and Under $0.01

Paper2Poster: Automate Scientific Poster Creation from PDFs—Editable, Accurate, and Under $0.01 2943

Creating professional academic posters from dense, multi-page scientific papers is a universal pain point for researchers, PhD students, and lab…

12/19/2025Automated Academic Communication, Multimodal Document Understanding, Scientific Poster Generation
RAG-Anything: The First All-in-One RAG Framework for Multimodal Documents—Text, Images, Tables & Equations in One System

RAG-Anything: The First All-in-One RAG Framework for Multimodal Documents—Text, Images, Tables & Equations in One System 11048

Retrieval-Augmented Generation (RAG) has revolutionized how we use large language models by grounding their responses in external knowledge. But here’s…

12/15/2025Cross-Modal Knowledge Retrieval, Multimodal Document Understanding, Multimodal Retrieval-Augmented Generation
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex