arxiv:2601.11518
Jonathan Roberts PRO
jonathan-roberts1
AI & ML interests
VLMs, LLMs, LMMs
Recent Activity
updated a dataset 1 day ago
jonathan-roberts1/zerobench liked a dataset 21 days ago
google/deepsearchqa upvoted a paper 2 months ago
Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games