Skip to content

PaperCodex

Subscribe
Chronos: The First AI Built for Debugging—Not Code Generation

Chronos: The First AI Built for Debugging—Not Code Generation 5310

Despite massive advances in large language models (LLMs) for coding, a silent crisis persists: debugging remains largely unsolved. Top models…

12/18/2025AI-powered Software Maintenance, Autonomous Debugging, Repository-scale Code Understanding
Dolphin: Lightweight, Accurate Document Image Parsing for Real-World Mixed-Content Pages

Dolphin: Lightweight, Accurate Document Image Parsing for Real-World Mixed-Content Pages 7904

Parsing complex document images—those containing intertwined text paragraphs, tables, mathematical formulas, figures, and code—is a persistent challenge in applied AI.…

12/18/2025Document Image Parsing, Layout Analysis, Multimodal Understanding
VLM-R1: Boost Visual Reasoning and Generalization with R1-Style Reinforcement Learning for Vision-Language Models

VLM-R1: Boost Visual Reasoning and Generalization with R1-Style Reinforcement Learning for Vision-Language Models 5743

If you’re working on vision-language tasks that require precise reasoning—like identifying objects based on natural language descriptions, detecting UI defects…

12/18/2025Multimodal Reasoning, Open-Vocabulary Detection, Referring Expression Comprehension
LiteCUA: Bridge the Gap Between LLMs and Real Computers with Lightweight, Context-Aware Automation

LiteCUA: Bridge the Gap Between LLMs and Real Computers with Lightweight, Context-Aware Automation 4853

Imagine an AI agent that doesn’t just talk about using a computer—it actually uses one. That’s the promise of LiteCUA,…

12/18/2025Computer Use Agent, Contextualized Agent Environment, OS-level Automation
RSL-RL: A Lightweight, Robotics-Optimized RL Library for Fast Sim-to-Real Transfer

RSL-RL: A Lightweight, Robotics-Optimized RL Library for Fast Sim-to-Real Transfer 1956

Reinforcement learning (RL) has become a cornerstone of modern robotics research, yet many general-purpose RL libraries fall short when it…

12/18/2025Reinforcement Learning For Robotics, Robotic Control, Sim-to-real Transfer
SmolVLA: High-Performance Vision-Language-Action Robotics on a Single GPU

SmolVLA: High-Performance Vision-Language-Action Robotics on a Single GPU 20075

SmolVLA is a compact yet capable Vision-Language-Action (VLA) model designed to bring state-of-the-art robot control within reach of researchers, educators,…

12/18/2025Imitation Learning, Robotic Manipulation, Vision-Language-Action Modeling
StableVideo: Text-Driven Video Editing with Frame-to-Frame Consistency

StableVideo: Text-Driven Video Editing with Frame-to-Frame Consistency 1444

Editing objects in existing videos while preserving their appearance across time has long been a challenge for diffusion-based models. While…

12/18/2025Temporal Consistency, Text-to-Video Generation, Video Editing
ElizaOS: The Web3-Friendly AI Agent Framework That Just Works

ElizaOS: The Web3-Friendly AI Agent Framework That Just Works 17177

In today’s fast-evolving landscape of artificial intelligence and decentralized systems, developers increasingly need tools that bridge the gap between large…

12/18/2025Autonomous AI Agents, Multi-agent Systems, Web3 Integration
ComfyUI-R1: Automate Complex AI Art Workflows with Reasoning-Powered Generation and Debugging

ComfyUI-R1: Automate Complex AI Art Workflows with Reasoning-Powered Generation and Debugging 3890

Building visual AI workflows in ComfyUI offers immense creative flexibility—but mastering its node-based interface demands significant expertise. Users often struggle…

12/18/2025Automated Debugging, Parameter Optimization, Workflow Generation
Paper2Video: Automatically Turn Scientific Papers into Ready-to-Use Presentation Videos

Paper2Video: Automatically Turn Scientific Papers into Ready-to-Use Presentation Videos 1860

Creating high-quality academic presentation videos is notoriously time-consuming. Researchers often spend hours designing slides, recording voiceovers, editing footage, and syncing…

12/17/2025Academic Video Automation, Automatic Video Generation, Multimodal Presentation Synthesis

Posts pagination

Previous 1 … 34 35 36 … 43 Next
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex