Skip to content

PaperCodex

Subscribe

Visual Representation Learning

SEED-Voken: Scalable, High-Fidelity Visual Tokenization for Autoregressive Image and Video Generation

SEED-Voken: Scalable, High-Fidelity Visual Tokenization for Autoregressive Image and Video Generation 984

SEED-Voken is an open-source toolkit developed by Tencent ARC that delivers state-of-the-art visual tokenizers tailored for autoregressive visual generation. Built…

01/13/2026Autoregressive Image Generation, Video Tokenization, Visual Representation Learning
MIEB: Benchmark 130 Image & Image-Text Tasks Across 38 Languages for Reliable Model Evaluation

MIEB: Benchmark 130 Image & Image-Text Tasks Across 38 Languages for Reliable Model Evaluation 3016

Evaluating image embedding models has long been a fragmented and inconsistent process. Researchers and engineers often test models on narrow,…

12/19/2025Cross-Modal Retrieval, Image Embedding Evaluation, Visual Representation Learning
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex