Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models Paper • 2505.24164 • Published May 30, 2025
MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query Paper • 2506.03144 • Published Jun 3, 2025 • 7
Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models Paper • 2411.09691 • Published Nov 14, 2024
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published Apr 8, 2025 • 64