Awesome Video-text Retrieval Papers and Source Codes

InternVideo: Build Powerful Video-Language AI Without Massive Compute or Data 2131

Building capable video-language AI systems has long been a resource-intensive endeavor—requiring vast video datasets, weeks of training on dozens of…

12/27/2025Video Question Answering, Video-text Retrieval, Zero-shot Video Classification

VideoMamba: Efficient Long- and Short-Term Video Understanding Without the Compute Overhead 1044

Video understanding has long been bottlenecked by two competing demands: capturing fine-grained local motion while simultaneously modeling long-range temporal dependencies.…

12/26/2025Action Recognition, Video Understanding, Video-text Retrieval