Skip to content

PaperCodex

Subscribe
LocAgent: Pinpoint Code Changes Instantly with Graph-Guided LLM Reasoning

LocAgent: Pinpoint Code Changes Instantly with Graph-Guided LLM Reasoning 524

Locating the precise files or functions that need modification when addressing a bug report or feature request is one of…

01/09/2026Code Localization, Graph-based Reasoning, LLM-guided Software Maintenance
Strawberry Fields: Build, Simulate, and Optimize Photonic Quantum Circuits in Python

Strawberry Fields: Build, Simulate, and Optimize Photonic Quantum Circuits in Python 831

If you’re exploring quantum computing but want to move beyond abstract theory and into hands-on experimentation—especially with light-based (photonic) systems—Strawberry…

01/09/2026Graph Optimization, Quantum Machine Learning, Quantum Simulation
XVerse: Precise Multi-Subject Image Generation with Independent Identity and Attribute Control

XVerse: Precise Multi-Subject Image Generation with Independent Identity and Attribute Control 603

Generating realistic images with multiple distinct subjects—each retaining their unique identity and visual attributes like pose, lighting, or clothing style—has…

01/09/2026Controllable Image Generation, Multi-subject Image Synthesis, Text-to-Image Generation
ICPC-Eval: Stress-Test LLM Reasoning with Real-World Competitive Programming Challenges

ICPC-Eval: Stress-Test LLM Reasoning with Real-World Competitive Programming Challenges 739

Evaluating the true reasoning capabilities of large language models (LLMs) in coding has long been hampered by benchmarks that are…

01/09/2026Algorithmic Reasoning, Code Generation, Model Evaluation
EmbodiedScan: A First-Person 3D Perception Suite for Building Language-Grounded Embodied AI Agents

EmbodiedScan: A First-Person 3D Perception Suite for Building Language-Grounded Embodied AI Agents 635

In the rapidly evolving field of embodied artificial intelligence (AI), agents—whether physical robots or virtual avatars—must understand complex indoor environments…

01/09/20263D Visual Grounding, Embodied AI Perception, Multi-view 3D Object Detection
Cosmos-Transfer1: Generate Realistic, Controllable World Simulations from Multimodal Inputs for Robotics and Autonomous Driving

Cosmos-Transfer1: Generate Realistic, Controllable World Simulations from Multimodal Inputs for Robotics and Autonomous Driving 695

Cosmos-Transfer1 is a powerful conditional world generation model developed by NVIDIA as part of its Cosmos World Foundation Models (WFMs)…

01/09/2026Conditional Image Generation, Multimodal Synthesis, World-to-world Transfer
NextStep-1: High-Fidelity Autoregressive Image Generation Without Diffusion or Discrete Token Loss

NextStep-1: High-Fidelity Autoregressive Image Generation Without Diffusion or Discrete Token Loss 553

Autoregressive (AR) models have long dominated natural language generation, but applying the same step-by-step prediction approach to images has been…

01/09/2026Autoregressive Modeling, Image Editing, Text-to-Image Generation
RLinf: Accelerate Large-Scale Reinforcement Learning for Agentic AI and Embodied Intelligence

RLinf: Accelerate Large-Scale Reinforcement Learning for Agentic AI and Embodied Intelligence 503

Reinforcement learning (RL) is rapidly becoming the engine behind next-generation agentic AI—powering everything from math-reasoning language models to vision-guided robotic…

01/09/2026Embodied Intelligence, Reasoning Agents, Reinforcement Learning
LongLive: Real-Time Interactive Long Video Generation with Seamless Prompt Control

LongLive: Real-Time Interactive Long Video Generation with Seamless Prompt Control 656

Creating long, coherent, and high-quality videos from text has long been a formidable challenge in generative AI. Existing approaches—especially diffusion-based…

01/09/2026Interactive Video Synthesis, Long-form Video Generation, Text-to-Video Generation
Cube: Generate 3D Assets from Text Prompts—No Modeling Skills Required

Cube: Generate 3D Assets from Text Prompts—No Modeling Skills Required 844

Imagine describing a “mechanical lobster with tank treads” in plain English and instantly getting a usable 3D model—no Blender expertise,…

01/09/20263D Shape Modeling, Multimodal AI, Text-to-3D Generation

Posts pagination

Previous 1 … 12 13 14 … 53 Next
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex