VLM with textual-driven GRPO training for vision-grounded decision making (https://arxiv.org/pdf/2503.16965, NeurIPS 2025)
Derek Zhe Hu
zhehuderek
AI & ML interests
NLP, Multimodality
Recent Activity
updated
a model
1 day ago
zhehuderek/qwen25_vl_7b_stage2_virl39k_step80
published
a model
1 day ago
zhehuderek/qwen25_vl_7b_stage2_virl39k_step80
updated
a dataset
14 days ago
zhehuderek/ViRL39K_proc
Organizations
None yet