Chujie Zheng

chujiezheng

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

liked a model 12 days ago

Qwen/Qwen3.5-122B-A10B

liked a model 12 days ago

Qwen/Qwen3.5-35B-A3B-Base

liked a model 12 days ago

Qwen/Qwen3.5-35B-A3B

View all activity

Organizations

upvoted a paper 29 days ago

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

Paper • 2602.06079 • Published Feb 4 • 18

upvoted 3 papers about 1 month ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published Feb 2 • 60

Qwen3-TTS Technical Report

Paper • 2601.15621 • Published Jan 22 • 70

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Paper • 2601.18137 • Published Jan 26 • 29

upvoted 2 papers 3 months ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published Nov 25, 2025 • 42

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 105

upvoted a collection 6 months ago

Qwen3-Next

Collection

4 items • Updated Dec 31, 2025 • 186

upvoted a paper 7 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 272

upvoted 2 papers 8 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 319

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8, 2025 • 93

upvoted a paper 9 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 188

upvoted 4 papers 10 months ago

upvoted a collection 10 months ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.71k

upvoted 4 papers about 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 214

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 109

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21, 2025 • 67

Chujie Zheng

AI & ML interests

Recent Activity

Organizations

chujiezheng's activity