MTV Collection Revisiting Multi-Task Visual Representation Learning • 2 items • Updated about 21 hours ago
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval Paper • 2503.00540 • Published Mar 1, 2025 • 3
LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding Paper • 2407.15754 • Published Jul 22, 2024 • 20