Zichen Ding

heroding77

·

https://heroding77.github.io/

heroding77

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

submitted a paper about 1 month ago

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

upvoted a paper about 1 month ago

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

View all activity

Organizations

upvoted 4 papers about 1 month ago

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

Paper • 2605.28424 • Published May 27 • 32

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Paper • 2605.25874 • Published May 25 • 103

ThoughtTrace: Understanding User Thoughts in Real-World LLM Interactions

Paper • 2605.20087 • Published May 19 • 18

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published May 18 • 50

upvoted a paper 2 months ago

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Paper • 2604.15093 • Published Apr 16 • 30

upvoted 2 papers 3 months ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published Apr 2 • 103

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 134

upvoted 4 papers 5 months ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published Feb 5 • 61

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published Feb 2 • 33

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published Feb 2 • 35

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Paper • 2601.12346 • Published Jan 18 • 52

upvoted a collection 6 months ago

Paper

Check out our paper list ! • 13 items • Updated Feb 20 • 3

upvoted 2 papers 6 months ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Paper • 2601.07779 • Published Jan 12 • 28

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 60

upvoted a collection 7 months ago

PaCo-RL

Data and Model collection for PaCo-RL • 8 items • Updated Mar 2 • 9

upvoted a paper 7 months ago

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Paper • 2512.04784 • Published Dec 2, 2025 • 25

upvoted 4 papers 8 months ago

InteractScience: Programmatic and Visually-Grounded Evaluation of Interactive Scientific Demonstration Code Generation

Paper • 2510.09724 • Published Oct 10, 2025 • 11

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 73

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 99

QueST: Incentivizing LLMs to Generate Difficult Problems

Paper • 2510.17715 • Published Oct 20, 2025 • 36