Skip to content

PaperCodex

Subscribe
PaperDebugger: AI-Powered In-Editor Academic Writing Assistant for Overleaf Users

PaperDebugger: AI-Powered In-Editor Academic Writing Assistant for Overleaf Users 1144

Academic writing is a deeply iterative and often fragmented process. Researchers routinely juggle LaTeX editors like Overleaf, reference managers, peer…

12/26/2025Academic Writing Assistance, In-Editor AI Editing, Multi-agent Systems
AutoAgent: Build Powerful LLM Agents with Zero Code—Just Use Natural Language

AutoAgent: Build Powerful LLM Agents with Zero Code—Just Use Natural Language 8280

Building AI agents today usually means writing code. Frameworks like LangChain and AutoGen have unlocked incredible capabilities—but they also demand…

12/26/2025LLM Agents, Natural Language Programming, Zero-code AI
VMamba: A Linear-Time Vision Backbone for High-Resolution, Scalable Computer Vision Tasks

VMamba: A Linear-Time Vision Backbone for High-Resolution, Scalable Computer Vision Tasks 2969

In the rapidly evolving landscape of computer vision, model efficiency and scalability are no longer optional—they’re essential. Enter VMamba, a…

12/26/2025Image Classification, Object Detection, Semantic Segmentation
OMG-Seg: One Unified Model for All Segmentation Tasks—No More Fragmented Pipelines

OMG-Seg: One Unified Model for All Segmentation Tasks—No More Fragmented Pipelines 1338

For years, computer vision practitioners have juggled a patchwork of specialized models to tackle different segmentation tasks—semantic, instance, panoptic, video,…

12/26/2025Instance Segmentation, Panoptic Segmentation, Semantic Segmentation
EvoX: Distributed GPU-Accelerated Evolutionary Computation for Large-Scale Optimization

EvoX: Distributed GPU-Accelerated Evolutionary Computation for Large-Scale Optimization 1598

Evolutionary Computation (EC) has long been a powerful approach for solving complex optimization problems—especially where gradients are unavailable, environments are…

12/26/2025Evolutionary Computation, Hyperparameter Optimization, Neuroevolution
Agents: Build, Evolve, and Deploy Autonomous Language Agents Without Heavy Coding

Agents: Build, Evolve, and Deploy Autonomous Language Agents Without Heavy Coding 5778

In today’s fast-moving AI landscape, organizations and researchers increasingly need intelligent systems that don’t just respond to commands—but plan, collaborate,…

12/26/2025Autonomous Agents, Language Agent Learning, Multi-agent Systems
HeterPS: Accelerate Deep Learning Training Across Mixed Hardware with Reinforcement Learning-Based Scheduling

HeterPS: Accelerate Deep Learning Training Across Mixed Hardware with Reinforcement Learning-Based Scheduling 23500

Training large-scale deep neural networks (DNNs) efficiently is a persistent challenge—especially when your infrastructure includes a mix of hardware like…

12/26/2025Distributed Training, Heterogeneous Computing, Large-scale Deep Learning
Video-LLaVA: One Unified Model for Both Image and Video Understanding—No More Modality Silos

Video-LLaVA: One Unified Model for Both Image and Video Understanding—No More Modality Silos 3417

If you’re evaluating vision-language models for a project that involves both images and videos, you’ve probably faced a frustrating trade-off:…

12/26/2025Multimodal Understanding, Video-language Modeling, Visual Question Answering
SpeechAlign: Bridging the Gap Between Realistic and Human-Preferred Speech Generation

SpeechAlign: Bridging the Gap Between Realistic and Human-Preferred Speech Generation 1396

Recent advances in speech language models (SLMs) have made it possible to generate highly realistic speech—often indistinguishable from human voices…

12/26/2025Neural Codec Modeling, Preference Alignment, Speech Generation
mPLUG-Owl: Modular Multimodal AI for Real-World Vision-Language Tasks

mPLUG-Owl: Modular Multimodal AI for Real-World Vision-Language Tasks 2537

In today’s AI-driven product landscape, the ability to understand both images and text isn’t just a research novelty—it’s a practical…

12/26/2025Instruction-following Multimodal Models, Multimodal Reasoning, Vision-language Understanding

Posts pagination

Previous 1 … 17 18 19 … 43 Next
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex