12 53 13

Wen Wang

wwen1997

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

upvoted a paper 4 days ago

SpatialTree: How Spatial Abilities Branch Out in MLLMs

upvoted a paper 6 days ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

View all activity

Organizations

authored a paper 3 days ago

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

Paper • 2512.16924 • Published 10 days ago • 24

authored 2 papers 16 days ago

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

Paper • 2512.03046 • Published 26 days ago • 11

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Paper • 2512.07951 • Published 20 days ago • 47

authored 2 papers 2 months ago

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Paper • 2510.20822 • Published Oct 23 • 40

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17 • 50

authored 9 papers 5 months ago

Object-aware Inversion and Reassembly for Image Editing

Paper • 2310.12149 • Published Oct 18, 2023

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

Paper • 2403.11627 • Published Mar 18, 2024

FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition

Paper • 2405.13870 • Published May 22, 2024

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Paper • 2505.20256 • Published May 26 • 18

GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding

Paper • 2507.15846 • Published Jul 21 • 133

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published Aug 12 • 37

authored 6 papers about 1 year ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 80

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Paper • 2303.17599 • Published Mar 30, 2023

SegGPT: Segmenting Everything In Context

Paper • 2304.03284 • Published Apr 6, 2023 • 1

CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose

Paper • 2206.11752 • Published Jun 23, 2022 • 1

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale

Paper • 2211.07636 • Published Nov 14, 2022 • 1

Images Speak in Images: A Generalist Painter for In-Context Visual Learning

Paper • 2212.02499 • Published Dec 5, 2022

Wen Wang

AI & ML interests

Recent Activity

Organizations

wwen1997's activity