th's picture

13 5

th

CHEN1594

·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 3 months ago

Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Paper • 2507.05257 • Published Jul 7, 2025 • 14

upvoted a collection 3 months ago

ReasonMap

A fine-grained visual reasoning benchmark (We show more question types in the extension dataset.) • 3 items • Updated Oct 1, 2025 • 8

upvoted a paper 3 months ago

RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning

Paper • 2510.02240 • Published Oct 2, 2025 • 17

upvoted a paper 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

upvoted 6 papers 6 months ago

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3, 2025 • 53

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published Nov 26, 2024 • 89

Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts

Paper • 2506.10357 • Published Jun 12, 2025 • 21

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9, 2025 • 93

AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents

Paper • 2506.14205 • Published Jun 17, 2025 • 7

Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search

Paper • 2507.02652 • Published Jul 3, 2025 • 26

upvoted a paper 7 months ago

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Paper • 2505.18675 • Published May 24, 2025 • 26

upvoted a paper 8 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 154

upvoted a paper 9 months ago

Efficient Reasoning Models: A Survey

Paper • 2504.10903 • Published Apr 15, 2025 • 21