Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 3 days ago • 3
Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers Paper • 2506.03065 • Published Jun 3, 2025 • 27
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published Apr 24, 2025 • 92
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published Apr 8, 2025 • 182
xtuner/llava-llama-3-8b-v1_1-transformers Image-Text-to-Text • 8B • Updated Apr 28, 2024 • 43.6k • 81
google/siglip-so400m-patch14-384 Zero-Shot Image Classification • 0.9B • Updated Sep 26, 2024 • 6.14M • 634