AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play Paper • 2509.24193 • Published Sep 29 • 6
Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards Paper • 2509.21882 • Published Sep 26
Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs Paper • 2511.19773 • Published Nov 24 • 9
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration Paper • 2504.04915 • Published Apr 7
MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale Paper • 2506.04405 • Published Jun 4 • 7
Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models Paper • 2311.00287 • Published Nov 1, 2023
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers Paper • 2404.18443 • Published Apr 29, 2024
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Paper • 2503.07459 • Published Mar 10 • 16