PSFT+RL models
SII-Wenhong
wh-zhu
AI & ML interests
None yet
Organizations
models
57
wh-zhu/Qwen2.5-7B-Instruct-SFT-lr-5e6
8B
•
Updated
wh-zhu/Qwen2.5-7B-Instruct-16-1300
8B
•
Updated
wh-zhu/Qwen2.5-7B-Instruct-ref-1300
8B
•
Updated
wh-zhu/Qwen2.5-7B-Instruct-update4-600
8B
•
Updated
wh-zhu/Qwen2.5-7B-Instruct-VL-SFT-RL120
8B
•
Updated
wh-zhu/Qwen2.5-7B-Instruct-VL-SFT-RL165
8B
•
Updated
•
6
wh-zhu/Qwen2.5-7B-Instruct-VL-PSFT-RL165
8B
•
Updated
wh-zhu/Qwen2.5-7B-Instruct-VL-ORI-RL140
8B
•
Updated
•
7
wh-zhu/Qwen2.5-7B-Instruct-edit-ruilin400
8B
•
Updated
wh-zhu/Qwen2.5-7B-Instruct-VL-RL100
8B
•
Updated
•
9