AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80
daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 4 days ago • 120
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 8 days ago • 176
ViDoRe V3: A Comprehensive Evaluation of Retrieval Augmented Generation in Complex Real-World Scenarios Paper • 2601.08620 • Published 17 days ago • 10
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts Paper • 2601.11044 • Published 15 days ago • 34
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance Paper • 2601.14171 • Published 10 days ago • 48
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents Paper • 2601.12346 • Published 12 days ago • 49
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs Paper • 2601.13836 • Published 10 days ago • 34
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development Paper • 2601.11077 • Published 14 days ago • 64
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 580 items • Updated 1 day ago • 80