koutch/paper_llama_llama3.1-8b_train_sft_all_train_code Text Generation • 8B • Updated 16 days ago • 128
koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_code Text Generation • 4B • Updated 16 days ago • 107
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_code Text Generation • 4B • Updated 16 days ago • 137
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated 16 days ago • 178
koutch/paper_qwen_qwen3-instruct-4b_train_grpo_v1_train_code Text Generation • 4B • Updated 20 days ago • 6
koutch/paper_llama_llama3.1-8b_train_sft_train_thought Text Generation • 8B • Updated 22 days ago • 29
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_thought Text Generation • 4B • Updated 22 days ago • 24
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_dual Text Generation • 4B • Updated 22 days ago • 76