MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation Paper • 2510.18692 • Published Oct 21, 2025 • 41
Hulu-Med: A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding Paper • 2510.08668 • Published Oct 9, 2025 • 9
Token Activation Map to Visually Explain Multimodal LLMs Paper • 2506.23270 • Published Jun 29, 2025 • 5