JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 21 days ago • 206
Echo-Memory: A Controlled Study of Memory in Action World Models Paper • 2606.09803 • Published 23 days ago • 32
Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation Paper • 2606.04527 • Published 28 days ago • 28
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization Paper • 2605.15980 • Published May 15 • 36
TextLDM: Language Modeling with Continuous Latent Diffusion Paper • 2605.07748 • Published May 8 • 26
Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation Paper • 2605.04128 • Published May 5 • 17