MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform Paper • 2506.00308 • Published May 30, 2025
Who's Asking? Simulating Role-Based Questions for Conversational AI Evaluation Paper • 2510.16829 • Published Oct 19, 2025
Knowledge Graph Guided Evaluation of Abstention Techniques Paper • 2412.07430 • Published Dec 10, 2024
Evaluating Large Language Models for Health-related Queries with Presuppositions Paper • 2312.08800 • Published Dec 14, 2023