Awesome Test-time Scaling Papers and Source Codes

TTRL: Boost LLM Reasoning Without Labels Using Test-Time Reinforcement Learning 836

Imagine being able to improve a large language model’s (LLM) reasoning capabilities after deployment, using only unlabeled test data—no ground-truth…

01/05/2026Reasoning, Reinforcement Learning, Test-time Scaling

SkyThought: Boost Code Generation Accuracy Without Retraining—Even Small Models Beat GPT-4o-mini 3358

SkyThought is an open-source framework built around S*—a breakthrough test-time scaling approach designed specifically to elevate code generation performance in…

12/18/2025Code Generation, Program Synthesis, Test-time Scaling

S1: Boost Reasoning Performance with Just 1,000 Examples and Smart Test-Time Scaling 6613

In the rapidly evolving landscape of large language models (LLMs), achieving strong reasoning capabilities often comes at the cost of…

12/18/2025Mathematical Reasoning, Structured Reasoning, Test-time Scaling