QP-OneModel: A Unified Generative LLM for Multi-Task Query Understanding in Xiaohongshu Search Paper • 2602.09901 • Published 17 days ago • 6
Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training Paper • 2602.00747 • Published 27 days ago • 9
Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors Paper • 2601.15625 • Published Jan 22 • 8
RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services Paper • 2511.07070 • Published Nov 10, 2025 • 20
RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services Paper • 2507.10605 • Published Jul 13, 2025 • 9
GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets Paper • 2504.19898 • Published Apr 28, 2025 • 5
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Dec 31, 2025 • 696
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Dec 31, 2025 • 376