arxiv:2602.02276
Zichen Wen
zichenwen
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL liked a model 16 days ago
moonshotai/Kimi-K2.6