For engineers, researchers, and product teams building real-time vision systems—whether for surveillance cameras, autonomous drones, or mobile apps—achieving high detection…
Object Detection
DocLayout-YOLO: Real-Time, High-Accuracy Document Layout Detection Without the Speed-Accuracy Trade-Off 1870
Document layout analysis (DLA) is a foundational task in building real-world document understanding systems—whether you’re extracting structured data from invoices,…
RFBNet: High-Accuracy, Real-Time Object Detection Without Heavy Backbones 1422
When building real-world computer vision systems—whether for autonomous drones, industrial inspection, or mobile apps—one of the toughest trade-offs is between…
GCNet: Boost Vision Models with Lightweight Global Context for Better Accuracy and Efficiency 1217
If you’ve worked on computer vision tasks like object detection or instance segmentation, you’ve likely encountered the challenge of modeling…
VMamba: A Linear-Time Vision Backbone for High-Resolution, Scalable Computer Vision Tasks 2969
In the rapidly evolving landscape of computer vision, model efficiency and scalability are no longer optional—they’re essential. Enter VMamba, a…
MambaVision: Achieve SOTA Image Classification & Downstream Vision Tasks with Hybrid Mamba-Transformer Efficiency 1946
If you’re building computer vision systems that demand both high accuracy and real-world efficiency—without getting bogged down in architectural complexity—MambaVision…
YOLOv6: Real-Time Object Detection Optimized for Speed, Accuracy, and Industrial Deployment 5869
YOLOv6 is a high-performance, single-stage object detection framework developed by Meituan with a strong emphasis on real-world industrial applications. Unlike…
FastViT: Achieve State-of-the-Art Speed and Accuracy for Vision Tasks on Mobile and Edge Devices 1974
FastViT is a high-performance hybrid vision transformer designed to deliver exceptional speed and accuracy—especially on resource-constrained platforms like mobile phones…
YOLOv9: Train-from-Scratch Object Detection That Beats Pretrained Models with Programmable Gradient Information 9391
YOLOv9 marks a significant leap forward in real-time object detection by directly confronting a long-standing but often overlooked problem in…
GhostNet: High-Accuracy Vision Models with Minimal Compute for Edge Deployment 4355
Overview Deploying powerful computer vision models on resource-constrained devices—such as smartphones, IoT sensors, or drones—has long been a major engineering…