LLM-in-Sandbox Elicits General Agentic Intelligence Paper • 2601.16206 • Published 1 day ago • 52
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 2 days ago • 43
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation Paper • 2601.15369 • Published 3 days ago • 15
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders Paper • 2601.16208 • Published 1 day ago • 43
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 10 days ago • 31
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders Paper • 2601.10332 • Published 9 days ago • 28
view changelog Changelog Team & Enterprise Articles Now Featured on the Hugging Face Blog Dec 8, 2025 • 91
MAXS: Meta-Adaptive Exploration with LLM Agents Paper • 2601.09259 • Published 10 days ago • 93
UM-Text: A Unified Multimodal Model for Image Understanding Paper • 2601.08321 • Published 11 days ago • 8
Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering Paper • 2601.09697 • Published 10 days ago • 8
view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG 9 days ago • 59
Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization Paper • 2601.04582 • Published 16 days ago • 10
User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale Paper • 2601.08225 • Published 11 days ago • 50