OmniBrainBench: A Comprehensive Multimodal Benchmark for Brain Imaging Analysis Across Multi-stage Clinical Tasks Paper • 2511.00846 • Published Nov 2, 2025 • 1
Unveiling and Bridging the Functional Perception Gap in MLLMs: Atomic Visual Alignment and Hierarchical Evaluation via PET-Bench Paper • 2601.02737 • Published Jan 6
MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning Paper • 2602.03320 • Published 17 days ago • 2
MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning Paper • 2602.03320 • Published 17 days ago • 2
VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation Paper • 2601.10124 • Published Jan 15 • 4
Making Dialogue Grounding Data Rich: A Three-Tier Data Synthesis Framework for Generalized Referring Expression Comprehension Paper • 2512.02791 • Published Dec 2, 2025 • 1
Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation? Paper • 2411.03670 • Published Nov 6, 2024
Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA Paper • 2312.17670 • Published Dec 29, 2023
Medal S: Spatio-Textual Prompt Model for Medical Segmentation Paper • 2511.13001 • Published Nov 17, 2025 • 3
Seeing the Forest and the Trees: Query-Aware Tokenizer for Long-Video Multimodal Language Models Paper • 2511.11910 • Published Nov 14, 2025 • 35
Large Language Models Do NOT Really Know What They Don't Know Paper • 2510.09033 • Published Oct 10, 2025 • 17
Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published Oct 13, 2025 • 103
Medical Reasoning in the Era of LLMs: A Systematic Review of Enhancement Techniques and Applications Paper • 2508.00669 • Published Aug 1, 2025
Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation Paper • 2405.06948 • Published May 11, 2024
Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion Paper • 2501.16679 • Published Jan 28, 2025
EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis Paper • 2505.23601 • Published May 29, 2025
MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs Paper • 2504.06897 • Published Apr 9, 2025 • 1
POSTER++: A simpler and stronger facial expression recognition network Paper • 2301.12149 • Published Jan 28, 2023 • 1
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning Paper • 2507.22607 • Published Jul 30, 2025 • 47
SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning Paper • 2506.21355 • Published Jun 26, 2025 • 10