PaperCodex

TEQ: Accurate 3- and 4-Bit LLM Quantization Without Inference Overhead 2544

Deploying large language models (LLMs) in production often runs into a hard trade-off: reduce model size and latency through quantization,…

12/22/2025Efficient LLM Inference, Large Language Model Quantization, Weight-only Quantization

YOLOv9: Train-from-Scratch Object Detection That Beats Pretrained Models with Programmable Gradient Information 9391

YOLOv9 marks a significant leap forward in real-time object detection by directly confronting a long-standing but often overlooked problem in…

12/22/2025Instance Segmentation, Object Detection, Panoptic Segmentation

Mulberry: Step-by-Step Multimodal Reasoning with o1-Like Reflection for Trustworthy AI Decisions 1217

Traditional multimodal large language models (MLLMs) often produce answers without revealing how they got there—especially when dealing with complex questions…

12/22/2025Interpretable AI, Multimodal Reasoning, Visual Question Answering

ELF: Train Real-Time Strategy AI Bots 10x Faster with a Lightweight, Flexible RL Platform 2094

Reinforcement learning (RL) for real-time strategy (RTS) games has long been bottlenecked by slow simulation, rigid environment interfaces, and high…

12/22/2025Multi-Agent Training, Real-Time Strategy Game AI, Reinforcement Learning

AdaNet: Automate High-Quality Model Ensembling with Minimal Effort in TensorFlow 3462

In modern machine learning workflows, teams often face a tough trade-off: spend days or weeks manually tuning architectures and hyperparameters,…

12/22/2025Automated Machine Learning, Ensemble Learning, Structured Data Classification

GhostNet: High-Accuracy Vision Models with Minimal Compute for Edge Deployment 4355

Overview Deploying powerful computer vision models on resource-constrained devices—such as smartphones, IoT sensors, or drones—has long been a major engineering…

12/22/2025Edge AI, Image Classification, Object Detection

EfficientViT-SAM: Real-Time, High-Accuracy Image Segmentation Without Compromise 3102

If you’ve worked with Meta’s Segment Anything Model (SAM), you know its power—and its pain points. While SAM delivers state-of-the-art…

12/22/2025Image Segmentation, Real-time Computer Vision, Zero-shot Segmentation

Bitnet.cpp: Run 1.58-Bit LLMs at the Edge with Lossless Speed and Efficiency 24456

Large language models (LLMs) are becoming increasingly central to real-world applications—but their computational demands remain a major barrier for edge…

12/22/2025Edge Inference, Low-bit LLMs, On-Device AI

SWE-Lancer: Benchmark Real-World Freelance Coding Tasks to Measure LLMs’ True Engineering Value 1438

Evaluating large language models (LLMs) on synthetic coding benchmarks often fails to reflect their real-world utility. Enter SWE-Lancer—a rigorously constructed…

12/22/2025Code Generation, Software Engineering Evaluation, Technical Decision-making

Open-Sora Plan: Open-Source High-Quality Long Video Generation for Real-World Applications 12044

Open-Sora Plan is an open-source initiative designed to democratize access to state-of-the-art video generation capabilities. Inspired by the promise of…

12/22/2025Image-to-Video Synthesis, Open-source Video Diffusion Models, Text-to-Video Generation