Toward an Evaluation Science for Generative AI Systems Paper • 2503.05336 • Published Mar 7, 2025 • 2
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning Paper • 2502.11271 • Published Feb 16, 2025 • 18
Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies Paper • 2502.02533 • Published Feb 4, 2025 • 4