1 16 14

Zhibei

zhibei1204

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

Ockham98/TryOn-Adapter

upvoted a paper about 1 month ago

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

upvoted a paper about 1 month ago

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models

View all activity

Organizations

None yet

liked a model about 1 month ago

Ockham98/TryOn-Adapter

Updated Jun 8, 2025 • 4

upvoted 3 papers about 1 month ago

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Paper • 2512.04784 • Published Dec 2, 2025 • 24

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models

Paper • 2508.09874 • Published Aug 13, 2025 • 10

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 52

upvoted a paper 3 months ago

BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions

Paper • 2510.05318 • Published Oct 6, 2025 • 21

updated a dataset 3 months ago

zhibei1204/DiagramQG

Updated Sep 28, 2025 • 150 • 6

published a dataset 3 months ago

zhibei1204/DiagramQG

Updated Sep 28, 2025 • 150 • 6

liked a model 4 months ago

omlab/VLM-R1-Qwen2.5VL-3B-Math-0305

Visual Question Answering • 4B • Updated Apr 14, 2025 • 73 • 8

liked 2 datasets 5 months ago

VTSNLP/instruct_general_dataset

Viewer • Updated Sep 30, 2024 • 4.53M • 173 • 47

VTSNLP/vietnamese_curated_dataset

Viewer • Updated Nov 24, 2024 • 12.2M • 1.83k • 67

upvoted a paper 5 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28, 2025 • 82

upvoted a paper 6 months ago

From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios

Paper • 2506.20279 • Published Jun 25, 2025 • 19

upvoted a paper 7 months ago

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2, 2025 • 48

updated a dataset 7 months ago

zhibei1204/PhysReason

Updated May 29, 2025 • 133 • 17

upvoted a paper 7 months ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26, 2025 • 104

liked a dataset 8 months ago

zhibei1204/PhysReason

Updated May 29, 2025 • 133 • 17

liked a dataset 9 months ago

TheEighthDay/SeekWorld

Preview • Updated Apr 20, 2025 • 142 • 6

upvoted 2 papers 9 months ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published Apr 11, 2025 • 55

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31, 2025 • 54

liked a model 10 months ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 2.29M • • 1.41k

Zhibei

AI & ML interests

Recent Activity

Organizations

zhibei1204's activity