Exploring MLLM-Diffusion Information Transfer with MetaCanvas Paper • 2512.11464 • Published 3 days ago • 5
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder Paper • 2512.11749 • Published 3 days ago • 26
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published 3 days ago • 9
The N-Body Problem: Parallel Execution from Single-Person Egocentric Video Paper • 2512.11393 • Published 3 days ago • 1
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper • 2512.11799 • Published 3 days ago • 23
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper • 2512.10881 • Published 4 days ago • 26
Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale Paper • 2512.10398 • Published 4 days ago • 3
Evaluating Gemini Robotics Policies in a Veo World Simulator Paper • 2512.10675 • Published 4 days ago • 11
The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality Paper • 2512.10791 • Published 4 days ago • 4
UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving Paper • 2512.09864 • Published 5 days ago • 10
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published 5 days ago • 66
OmniPSD: Layered PSD Generation with Diffusion Transformer Paper • 2512.09247 • Published 5 days ago • 43
Learning Unmasking Policies for Diffusion Language Models Paper • 2512.09106 • Published 6 days ago • 6
EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce Paper • 2512.08868 • Published 6 days ago • 2
TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels Paper • 2512.08358 • Published 6 days ago • 3
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published 6 days ago • 121
TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models Paper • 2512.08153 • Published 6 days ago • 6