PaperCodex

DB-GPT-Hub: Fine-Tune LLMs for Accurate Text-to-SQL Without Breaking the Bank 1945

If you’ve ever tried building a natural language interface to a relational database, you know the real bottleneck isn’t the…

12/26/2025Natural Language To Code, Parameter-Efficient Fine-Tuning, Text-to-SQL

MedSegDiff: Accurate Medical Image Segmentation Using Diffusion Models and Vision Transformers 1335

Medical image segmentation—the process of delineating organs, tumors, or tissues in scans like MRI or dermoscopic images—is a foundational task…

12/26/2025Diffusion Probabilistic Models, Medical Image Segmentation, Vision Transformers

mPLUG-DocOwl: High-Accuracy, OCR-Free Document Understanding for Enterprise and Research Workflows 2261

In today’s data-driven world, extracting structured, actionable insights from digital documents—such as invoices, reports, scientific papers, or web pages—is a…

12/26/2025Chart And Table Interpretation, Multimodal Document Question Answering, OCR-free Document Understanding

OpenHands: Empower AI Agents to Code, Debug, and Ship Like Human Developers 65759

In today’s fast-paced software landscape, developers are under constant pressure to write, test, debug, and deploy code faster than ever—often…

12/26/2025Agent-Based Automation, AI Software Development, Autonomous Coding Agents

PaperDebugger: AI-Powered In-Editor Academic Writing Assistant for Overleaf Users 1144

Academic writing is a deeply iterative and often fragmented process. Researchers routinely juggle LaTeX editors like Overleaf, reference managers, peer…

12/26/2025Academic Writing Assistance, In-Editor AI Editing, Multi-agent Systems

AutoAgent: Build Powerful LLM Agents with Zero Code—Just Use Natural Language 8280

Building AI agents today usually means writing code. Frameworks like LangChain and AutoGen have unlocked incredible capabilities—but they also demand…

12/26/2025LLM Agents, Natural Language Programming, Zero-code AI

VMamba: A Linear-Time Vision Backbone for High-Resolution, Scalable Computer Vision Tasks 2969

In the rapidly evolving landscape of computer vision, model efficiency and scalability are no longer optional—they’re essential. Enter VMamba, a…

12/26/2025Image Classification, Object Detection, Semantic Segmentation

OMG-Seg: One Unified Model for All Segmentation Tasks—No More Fragmented Pipelines 1338

For years, computer vision practitioners have juggled a patchwork of specialized models to tackle different segmentation tasks—semantic, instance, panoptic, video,…

12/26/2025Instance Segmentation, Panoptic Segmentation, Semantic Segmentation

EvoX: Distributed GPU-Accelerated Evolutionary Computation for Large-Scale Optimization 1598

Evolutionary Computation (EC) has long been a powerful approach for solving complex optimization problems—especially where gradients are unavailable, environments are…

12/26/2025Evolutionary Computation, Hyperparameter Optimization, Neuroevolution

Agents: Build, Evolve, and Deploy Autonomous Language Agents Without Heavy Coding 5778

In today’s fast-moving AI landscape, organizations and researchers increasingly need intelligent systems that don’t just respond to commands—but plan, collaborate,…

12/26/2025Autonomous Agents, Language Agent Learning, Multi-agent Systems