-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
Collections
Discover the best community collections!
Collections including paper arxiv:2507.09477
-
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs
Paper • 2507.09477 • Published • 88 -
BYOKG-RAG: Multi-Strategy Graph Retrieval for Knowledge Graph Question Answering
Paper • 2507.04127 • Published • 10 -
Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers
Paper • 2507.06223 • Published • 14 -
SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension
Paper • 2508.01959 • Published • 60
-
π^3: Scalable Permutation-Equivariant Visual Geometry Learning
Paper • 2507.13347 • Published • 67 -
Voxtral
Paper • 2507.13264 • Published • 34 -
SingLoRA: Low Rank Adaptation Using a Single Matrix
Paper • 2507.05566 • Published • 116 -
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs
Paper • 2507.09477 • Published • 88
-
Efficient Agents: Building Effective Agents While Reducing Cost
Paper • 2508.02694 • Published • 86 -
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
Paper • 2508.07407 • Published • 99 -
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory
Paper • 2508.09736 • Published • 58 -
Memp: Exploring Agent Procedural Memory
Paper • 2508.06433 • Published • 36
-
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs
Paper • 2507.09477 • Published • 88 -
Replacing thinking with tool usage enables reasoning in small language models
Paper • 2507.05065 • Published • 17 -
AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research
Paper • 2507.13300 • Published • 20 -
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models
Paper • 2507.07104 • Published • 46
-
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 265 -
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models
Paper • 2506.06395 • Published • 135 -
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models
Paper • 2506.05176 • Published • 82 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 282
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
Efficient Agents: Building Effective Agents While Reducing Cost
Paper • 2508.02694 • Published • 86 -
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
Paper • 2508.07407 • Published • 99 -
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory
Paper • 2508.09736 • Published • 58 -
Memp: Exploring Agent Procedural Memory
Paper • 2508.06433 • Published • 36
-
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs
Paper • 2507.09477 • Published • 88 -
BYOKG-RAG: Multi-Strategy Graph Retrieval for Knowledge Graph Question Answering
Paper • 2507.04127 • Published • 10 -
Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers
Paper • 2507.06223 • Published • 14 -
SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension
Paper • 2508.01959 • Published • 60
-
π^3: Scalable Permutation-Equivariant Visual Geometry Learning
Paper • 2507.13347 • Published • 67 -
Voxtral
Paper • 2507.13264 • Published • 34 -
SingLoRA: Low Rank Adaptation Using a Single Matrix
Paper • 2507.05566 • Published • 116 -
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs
Paper • 2507.09477 • Published • 88
-
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs
Paper • 2507.09477 • Published • 88 -
Replacing thinking with tool usage enables reasoning in small language models
Paper • 2507.05065 • Published • 17 -
AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research
Paper • 2507.13300 • Published • 20 -
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models
Paper • 2507.07104 • Published • 46
-
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 265 -
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models
Paper • 2506.06395 • Published • 135 -
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models
Paper • 2506.05176 • Published • 82 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 282