The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text Paper • 2512.16924 • Published 10 days ago • 24
MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues Paper • 2512.03046 • Published 26 days ago • 11
Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality Paper • 2512.07951 • Published 20 days ago • 47
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives Paper • 2510.20822 • Published Oct 23 • 40
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper • 2510.15742 • Published Oct 17 • 50
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Paper • 2403.11627 • Published Mar 18, 2024
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition Paper • 2405.13870 • Published May 22, 2024
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis Paper • 2412.15214 • Published Dec 19, 2024 • 15
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior Paper • 2407.04947 • Published Jul 6, 2024
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration Paper • 2505.20256 • Published May 26 • 18
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models Paper • 2508.09138 • Published Aug 12 • 37
MagicQuill: An Intelligent Interactive Image Editing System Paper • 2411.09703 • Published Nov 14, 2024 • 80
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models Paper • 2303.17599 • Published Mar 30, 2023
CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose Paper • 2206.11752 • Published Jun 23, 2022 • 1
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale Paper • 2211.07636 • Published Nov 14, 2022 • 1
Images Speak in Images: A Generalist Painter for In-Context Visual Learning Paper • 2212.02499 • Published Dec 5, 2022