Awesome Reinforcement Learning From AI Feedback Papers and Source Codes

PokeeResearch: Open-Source, High-Accuracy Deep Research Agent with Self-Verification and RL-Optimized Reasoning 1595

In today’s fast-moving technical and research environments, teams need reliable, up-to-date answers to complex questions—without the black-box limitations or high…

01/04/2026Deep Research Agent, Reinforcement Learning From AI Feedback, Retrieval-augmented Reasoning