TrajectoryMover: Generative Movement of Object Trajectories in Videos Paper • 2603.29092 • Published 3 days ago • 1
OmniRoam: World Wandering via Long-Horizon Panoramic Video Generation Paper • 2603.30045 • Published 2 days ago • 2
TokenDial: Continuous Attribute Control in Text-to-Video via Spatiotemporal Token Offsets Paper • 2603.27520 • Published 5 days ago • 2
Meta-Harness: End-to-End Optimization of Model Harnesses Paper • 2603.28052 • Published 4 days ago • 4
All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models Paper • 2604.00479 • Published 1 day ago • 13
Dynin-Omni: Omnimodal Unified Large Diffusion Language Model Paper • 2604.00007 • Published 24 days ago • 14
UniMixer: A Unified Architecture for Scaling Laws in Recommendation Systems Paper • 2604.00590 • Published 1 day ago • 3
Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM Inference Paper • 2603.29002 • Published 3 days ago • 2
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding Paper • 2604.00528 • Published 1 day ago • 4
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners? Paper • 2603.25823 • Published 7 days ago • 36
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published 8 days ago • 165
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants Paper • 2604.00842 • Published 1 day ago • 5
MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation Paper • 2603.25406 • Published 7 days ago • 3
Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification Paper • 2603.26648 • Published 6 days ago • 32
QuitoBench: A High-Quality Open Time Series Forecasting Benchmark Paper • 2603.26017 • Published 7 days ago • 25
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper • 2603.28407 • Published 3 days ago • 52
FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration Paper • 2603.29557 • Published 2 days ago • 12
VectorGym: A Multitask Benchmark for SVG Code Generation, Sketching, and Editing Paper • 2603.29852 • Published Feb 22 • 4
CutClaw: Agentic Hours-Long Video Editing via Music Synchronization Paper • 2603.29664 • Published 2 days ago • 39