Skip to content

PaperCodex

Subscribe

Reinforcement Learning For LLMs

Search-R1: Train LLMs to Reason and Search Like Human Researchers Using Open-Source Reinforcement Learning

Search-R1: Train LLMs to Reason and Search Like Human Researchers Using Open-Source Reinforcement Learning 3614

In the rapidly evolving landscape of large language models (LLMs), a critical limitation persists: despite their impressive fluency, LLMs often…

12/27/2025Reinforcement Learning For LLMs, Retrieval-Augmented Generation, Tool-augmented Reasoning
Puppeteer: Dynamic Multi-Agent Orchestration for Efficient, Adaptive LLM Collaboration

Puppeteer: Dynamic Multi-Agent Orchestration for Efficient, Adaptive LLM Collaboration 27888

Managing complex tasks with large language models (LLMs) often hits a ceiling: while single models excel at narrow tasks, scaling…

12/26/2025Dynamic Orchestration, Multi-agent Systems, Reinforcement Learning For LLMs
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex