Skip to content

PaperCodex

Subscribe
GRUtopia: Scale Embodied AI Development with a City-Scale Simulated Society for General-Purpose Robots

GRUtopia: Scale Embodied AI Development with a City-Scale Simulated Society for General-Purpose Robots 1138

Developing general-purpose robots that can navigate, interact, and manipulate in real-world urban environments remains one of the most demanding challenges…

12/26/2025Embodied AI, Robot Navigation, Sim2Real
IMAGDressing: Generate Controllable, High-Fidelity Virtual Outfits Without Retraining Models

IMAGDressing: Generate Controllable, High-Fidelity Virtual Outfits Without Retraining Models 1314

Online fashion retailers, digital content studios, and marketing teams increasingly rely on realistic human imagery to showcase garments—but traditional virtual…

12/26/2025Controllable Image Generation, Garment-conditioned Synthesis, Virtual Dressing
MambaOut: High-Accuracy Vision Models Without the Mamba Overhead

MambaOut: High-Accuracy Vision Models Without the Mamba Overhead 2609

The vision community has recently seen a surge in adopting sequence modeling architectures—especially Mamba—for image tasks. Inspired by its linear…

12/26/2025Efficient Deep Learning, Image Classification, Vision Backbone
StudioGAN: A Unified, Reproducible Benchmark for Training and Evaluating GANs at Scale

StudioGAN: A Unified, Reproducible Benchmark for Training and Evaluating GANs at Scale 3482

Generative Adversarial Networks (GANs) have long been at the forefront of realistic image synthesis—but using them effectively in research or…

12/26/2025GAN Benchmarking, Generative Modeling, Image Synthesis
FlexiViT: One Vision Transformer for All Patch Sizes—Deploy Faster or More Accurate Models Without Retraining

FlexiViT: One Vision Transformer for All Patch Sizes—Deploy Faster or More Accurate Models Without Retraining 3276

Vision Transformers (ViTs) have become a cornerstone of modern computer vision, offering strong performance across a wide range of tasks.…

12/22/2025Image Classification, Image-text Retrieval, Semantic Segmentation
3D-Speaker-Toolkit: Multimodal Speaker Verification and Diarization with Acoustic, Semantic, and Visual Fusion

3D-Speaker-Toolkit: Multimodal Speaker Verification and Diarization with Acoustic, Semantic, and Visual Fusion 2643

Speaker analysis—whether for verifying identity, recognizing who’s speaking, or separating voices in a multi-person conversation—is a fundamental task in speech…

12/22/2025Multimodal Speech Processing, Speaker Diarization, Speaker Verification
TFB: The Fair, Comprehensive Benchmark for Time Series Forecasting That Solves Reproducibility and Bias Problems

TFB: The Fair, Comprehensive Benchmark for Time Series Forecasting That Solves Reproducibility and Bias Problems 1625

Time series forecasting powers critical decisions across industries—from predicting electricity demand and traffic congestion to estimating disease spread and stock…

12/22/2025Multivariate Forecasting, Time-series Forecasting, Univariate Forecasting
CKnowEdit: Fix Chinese Linguistic, Factual & Logical Errors in LLMs Without Retraining

CKnowEdit: Fix Chinese Linguistic, Factual & Logical Errors in LLMs Without Retraining 2667

Large language models (LLMs) have made remarkable progress in multilingual understanding—but their performance in Chinese remains uneven, especially when it…

12/22/2025Chinese NLP, Factual Correction, Knowledge Editing
FastViT: Achieve State-of-the-Art Speed and Accuracy for Vision Tasks on Mobile and Edge Devices

FastViT: Achieve State-of-the-Art Speed and Accuracy for Vision Tasks on Mobile and Edge Devices 1974

FastViT is a high-performance hybrid vision transformer designed to deliver exceptional speed and accuracy—especially on resource-constrained platforms like mobile phones…

12/22/2025Image Classification, Object Detection, Semantic Segmentation
iTransformer: Invert Your Time Series Forecasting Architecture for Better Scalability, Generalization, and Simplicity

iTransformer: Invert Your Time Series Forecasting Architecture for Better Scalability, Generalization, and Simplicity 1824

Time series forecasting is a foundational task across finance, energy, logistics, and digital platforms—yet traditional Transformer-based models often struggle with…

12/22/2025Long-sequence Forecasting, Multivariate Time Series Forecasting, Zero-shot Time Series Generalization

Posts pagination

Previous 1 … 20 21 22 … 43 Next
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex