Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 246k • 2.83k
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition • 4B • Updated Mar 11 • 1.23M • 844
UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation Paper • 2504.21336 • Published Apr 30, 2025 • 5
view article Article SigLIP 2: A better multilingual vision language encoder +1 ariG23498, merve, qubvel-hf • Feb 21, 2025 • 212