Awesome Visual Reasoning Grounding Papers and Source Codes

Visual-RFT: Boost Vision-Language Model Performance with Minimal Data Using Reinforcement Fine-Tuning 2276

When labeled visual data is scarce—think dozens or hundreds of examples per category—traditional supervised fine-tuning (SFT) often falls short. Enter…

12/19/2025Few-shot Object Detection, Fine-grained Image Classification, Visual Reasoning Grounding