Divye Dixit
Sankhya0
·
AI & ML interests
None yet
Organizations
RL
-
ExGRPO: Learning to Reason from Experience
Paper • 2510.02245 • Published • 80 -
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
Paper • 2508.07407 • Published • 98 -
rStar2-Agent: Agentic Reasoning Technical Report
Paper • 2508.20722 • Published • 117 -
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Paper • 2508.19828 • Published • 7
Embodied ai
RL
-
ExGRPO: Learning to Reason from Experience
Paper • 2510.02245 • Published • 80 -
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
Paper • 2508.07407 • Published • 98 -
rStar2-Agent: Agentic Reasoning Technical Report
Paper • 2508.20722 • Published • 117 -
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Paper • 2508.19828 • Published • 7
models
0
None public yet
datasets
0
None public yet