Awesome Reasoning Papers and Source Codes

TTRL: Boost LLM Reasoning Without Labels Using Test-Time Reinforcement Learning 836

Imagine being able to improve a large language model’s (LLM) reasoning capabilities after deployment, using only unlabeled test data—no ground-truth…

01/05/2026Reasoning, Reinforcement Learning, Test-time Scaling

Tree of Thoughts: Unlock Strategic Reasoning in LLMs for Complex Problem Solving 5714

Large language models (LLMs) have transformed how we approach tasks ranging from coding assistance to content generation. Yet, their standard…

12/27/2025Planning, Reasoning, Search

Reasoning Gym: Train and Evaluate Reasoning Models with Infinite, Verifiable Reinforcement Learning Environments 1265

If you’re building or evaluating reasoning-capable AI systems—especially large language models (LLMs)—you’ve likely hit a wall with static benchmarks. Traditional…

12/19/2025Procedural Task Generation, Reasoning, Reinforcement Learning