koutch/short_paper_llama_llama3.1-8b_train_sft_all_train_no_think Text Generation • 8B • Updated 6 minutes ago • 89
koutch/short_paper_llama_llama3.1-8b_train_sft_train_no_think Text Generation • 8B • Updated 19 minutes ago • 234
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_all_train_no_think Text Generation • 4B • Updated 34 minutes ago • 70
koutch/short_paper_llama_llama3.1-8b_train_sft_train_para Text Generation • 8B • Updated 34 minutes ago • 105
koutch/short_paper_smol_2.json_train_dpo_v2_train_no_think Text Generation • 3B • Updated about 1 hour ago
koutch/short_paper_smol_2.json_train_dpo_v1_train_no_think Text Generation • 3B • Updated about 1 hour ago
koutch/short_paper_smol_smol3-3B_train_sft_train_no_think Text Generation • 3B • Updated about 2 hours ago • 263
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated about 2 hours ago • 104
koutch/short_paper_smol_smol3-3B_train_sft_train_para Text Generation • 3B • Updated about 2 hours ago • 114
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_train_no_think Text Generation • 4B • Updated about 2 hours ago • 226