Zhang Xu's picture

4 6

Zhang Xu

texzhang

·

CheungXu

AI & ML interests

None yet

Recent Activity

liked a model 21 days ago

tencent/Youtu-LLM-2B

liked a model 21 days ago

tencent/Youtu-LLM-2B-Base

upvoted a paper about 1 month ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

View all activity

Organizations

None yet

liked 2 models 21 days ago

tencent/Youtu-LLM-2B

Text Generation • 2B • Updated 11 days ago • 6.81k • 214

tencent/Youtu-LLM-2B-Base

Text Generation • 2B • Updated 16 days ago • 5.4k • 38

upvoted a paper about 1 month ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 38

upvoted 2 papers about 2 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 101

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

Paper • 2508.07534 • Published Aug 11, 2025 • 1

upvoted a paper 3 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 272

liked a model 6 months ago

tencent/Hunyuan-1.8B-Instruct

Text Generation • 2B • Updated Aug 6, 2025 • 245 • 344

liked 3 models over 2 years ago

moka-ai/m3e-base

0.1B • Updated Jul 14, 2023 • 86.6k • 980

baichuan-inc/Baichuan-7B

Text Generation • Updated Jan 9, 2024 • 18.8k • 842

IDEA-CCNL/Ziya-LLaMA-13B-v1

Text Generation • Updated Sep 13, 2023 • 1.11k • 275