Skip to content

PaperCodex

Subscribe

Image Classification

LSKNet: A Lightweight, High-Performance Backbone for Remote Sensing Object Detection, Segmentation, and Classification

LSKNet: A Lightweight, High-Performance Backbone for Remote Sensing Object Detection, Segmentation, and Classification 639

Remote sensing imagery—captured from satellites, drones, or aircraft—presents unique challenges for computer vision systems. Objects are often small, densely packed,…

01/13/2026Image Classification, Remote Sensing Object Detection, Semantic Segmentation
PytorchInsight: Boost CNN Performance with Lightweight, Plug-and-Play Attention Modules for Vision Tasks

PytorchInsight: Boost CNN Performance with Lightweight, Plug-and-Play Attention Modules for Vision Tasks 871

PytorchInsight is a practical, research-oriented PyTorch library designed to accelerate deep learning development—especially for computer vision practitioners who need reliable,…

01/13/2026CNN Attention Mechanisms, Image Classification, Object Detection
DynamicViT: Slash Vision Transformer Compute by 30% Without Sacrificing Accuracy

DynamicViT: Slash Vision Transformer Compute by 30% Without Sacrificing Accuracy 641

Vision Transformers (ViTs) have revolutionized computer vision, but their computational demands remain a major barrier for real-world deployment—especially on edge…

01/13/2026Efficient Vision Transformers, Image Classification, Model Acceleration
UniRepLKNet: A Universal Large-Kernel ConvNet for Faster, Stronger, and Truly Multimodal AI

UniRepLKNet: A Universal Large-Kernel ConvNet for Faster, Stronger, and Truly Multimodal AI 1053

In the era of Vision Transformers and increasingly complex multimodal architectures, convolutional neural networks (ConvNets) have often been written off…

01/04/2026Image Classification, Multimodal Perception, Time-series Forecasting
VMamba: A Linear-Time Vision Backbone for High-Resolution, Scalable Computer Vision Tasks

VMamba: A Linear-Time Vision Backbone for High-Resolution, Scalable Computer Vision Tasks 2969

In the rapidly evolving landscape of computer vision, model efficiency and scalability are no longer optional—they’re essential. Enter VMamba, a…

12/26/2025Image Classification, Object Detection, Semantic Segmentation
MambaVision: Achieve SOTA Image Classification & Downstream Vision Tasks with Hybrid Mamba-Transformer Efficiency

MambaVision: Achieve SOTA Image Classification & Downstream Vision Tasks with Hybrid Mamba-Transformer Efficiency 1946

If you’re building computer vision systems that demand both high accuracy and real-world efficiency—without getting bogged down in architectural complexity—MambaVision…

12/26/2025Image Classification, Object Detection, Semantic Segmentation
AutoTrain: No-Code, Multi-Modal Model Training for Technical Decision-Makers

AutoTrain: No-Code, Multi-Modal Model Training for Technical Decision-Makers 4541

In today’s fast-moving AI landscape, fine-tuning state-of-the-art models on custom data is no longer a luxury—it’s a necessity for building…

12/26/2025Image Classification, LLM Fine-tuning, Text Classification
MambaOut: High-Accuracy Vision Models Without the Mamba Overhead

MambaOut: High-Accuracy Vision Models Without the Mamba Overhead 2609

The vision community has recently seen a surge in adopting sequence modeling architectures—especially Mamba—for image tasks. Inspired by its linear…

12/26/2025Efficient Deep Learning, Image Classification, Vision Backbone
FlexiViT: One Vision Transformer for All Patch Sizes—Deploy Faster or More Accurate Models Without Retraining

FlexiViT: One Vision Transformer for All Patch Sizes—Deploy Faster or More Accurate Models Without Retraining 3276

Vision Transformers (ViTs) have become a cornerstone of modern computer vision, offering strong performance across a wide range of tasks.…

12/22/2025Image Classification, Image-text Retrieval, Semantic Segmentation
FastViT: Achieve State-of-the-Art Speed and Accuracy for Vision Tasks on Mobile and Edge Devices

FastViT: Achieve State-of-the-Art Speed and Accuracy for Vision Tasks on Mobile and Edge Devices 1974

FastViT is a high-performance hybrid vision transformer designed to deliver exceptional speed and accuracy—especially on resource-constrained platforms like mobile phones…

12/22/2025Image Classification, Object Detection, Semantic Segmentation

Posts pagination

1 2 Next
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex