mlxha
/

Qwen-2.5-3B-grpo-code

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Qwen-2.5-3B-grpo-code / training_args.bin

Commit History

Training in progress, step 500

d702b7f
verified

mlxha commited on Apr 17, 2025

Training in progress, step 250

7eba746
verified

mlxha commited on Apr 17, 2025

Training in progress, step 225

f3b5551
verified

mlxha commited on Apr 16, 2025

Training in progress, step 25

1a12ca7
verified

mlxha commited on Apr 16, 2025