VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph Paper • 2602.12735 • Published Feb 13 • 8
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant Paper • 2603.01059 • Published Mar 1 • 1
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Paper • 2603.29620 • Published Mar 31 • 47
SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents Paper • 2604.17308 • Published Apr 19 • 22
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published May 6 • 102
SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation Paper • 2605.08043 • Published May 8 • 10
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published May 6 • 102
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Paper • 2603.29620 • Published Mar 31 • 47
Running 111 Unlocking On-Policy Distillation for Any Model Family 📝 111 Visualize on‑policy distillation token alignment