Skip to content

PaperCodex

Subscribe

vision-language modeling

Mini-InternVL: Achieve 90% of Multimodal Performance with Just 5% of Model Size for Edge and Consumer Deployments

Mini-InternVL: Achieve 90% of Multimodal Performance with Just 5% of Model Size for Edge and Consumer Deployments 9328

In an era where multimodal large language models (MLLMs) are rapidly advancing, a critical barrier remains: most high-performing vision-language models…

12/18/2025Edge AI, Multimodal Reasoning, vision-language modeling
VLMEvalKit: One-Command Evaluation for 200+ Vision-Language Models Across 80+ Benchmarks

VLMEvalKit: One-Command Evaluation for 200+ Vision-Language Models Across 80+ Benchmarks 3536

Evaluating large vision-language models (LVLMs) used to be a fragmented, time-consuming chore—juggling dozens of benchmark repositories, writing custom data loaders,…

12/16/2025Benchmarking, Multi-modal Evaluation, vision-language modeling
PP-OCR: Ultra-Lightweight, Multilingual OCR and Document AI for Real-World Applications

PP-OCR: Ultra-Lightweight, Multilingual OCR and Document AI for Real-World Applications 66154

In today’s AI-driven world, turning unstructured visual data—like scanned invoices, handwritten notes, or multilingual PDFs—into structured, machine-readable formats is a…

12/12/2025Document Parsing, Optical Character Recognition, vision-language modeling
MonkeyOCR: High-Accuracy Document Parsing for Complex Layouts with Tables, Formulas, and Multilingual Text—Fast, Lightweight, and Deployable

MonkeyOCR: High-Accuracy Document Parsing for Complex Layouts with Tables, Formulas, and Multilingual Text—Fast, Lightweight, and Deployable 6354

Parsing complex documents—especially those containing tables, mathematical formulas, mixed layouts, or multilingual content—remains a persistent challenge in real-world AI applications.…

12/11/202512/15/2025Document Parsing, Optical Character Recognition (OCR), vision-language modeling

Posts pagination

Previous 1 2 3
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex