Diffusion Language Models are Super Data Learners Paper • 2511.03276 • Published Nov 5, 2025 • 128
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper • 2509.22638 • Published Sep 26, 2025 • 70
Fostering Video Reasoning via Next-Event Prediction Paper • 2505.22457 • Published May 28, 2025 • 29
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18, 2025 • 19
🔱 Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated Nov 19, 2025 • 30
LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation Paper • 2410.13846 • Published Oct 17, 2024 • 2
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 1 day ago • 211