shuoxing/qwen2.5-0.5b-instruct-full-pretrain-mix-low-tweet-1m-en-sft Text Generation • 0.5B • Updated May 4 • 6
shuoxing/qwen2.5-0.5b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft Text Generation • 0.5B • Updated May 4 • 3
shuoxing/qwen2.5-0.5b-instruct-full-pretrain-mix-high-tweet-1m-en-sft Text Generation • 0.5B • Updated May 4 • 6
shuoxing/qwen2.5-0.5b-instruct-full-pretrain-control-tweet-1m-en-sft Text Generation • 0.5B • Updated May 4 • 3
shuoxing/qwen3-4b-thinking-full-pretrain-mix-low-tweet-1m-en-reproduce-bs128 Text Generation • 196k • Updated May 4 • 4
shuoxing/qwen3-4b-thinking-full-pretrain-mix-low-tweet-1m-en-sft Text Generation • 196k • Updated May 4 • 5
shuoxing/qwen3-4b-thinking-full-pretrain-mix-mid-tweet-1m-en Text Generation • 196k • Updated May 4 • 8
shuoxing/qwen3-4b-thinking-full-pretrain-mix-mid-tweet-1m-en-sft Text Generation • 196k • Updated May 4 • 6
shuoxing/qwen3-4b-thinking-full-pretrain-mix-high-tweet-1m-en Text Generation • 196k • Updated May 4 • 5
shuoxing/qwen3-4b-thinking-full-pretrain-mix-high-tweet-1m-en-sft Text Generation • 196k • Updated May 4 • 34
shuoxing/qwen3-4b-thinking-full-pretrain-control-tweet-1m-en-sft Text Generation • 196k • Updated May 4 • 8
shuoxing/qwen3-4b-thinking-full-pretrain-control-tweet-1m-en Text Generation • 196k • Updated May 4 • 6
shuoxing/qwen2.5-7b-instruct-full-pretrain-mix-low-tweet-1m-en-sft Text Generation • 333k • Updated May 4 • 5
shuoxing/qwen2.5-7b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft Text Generation • 333k • Updated May 4 • 3
shuoxing/qwen2.5-7b-instruct-full-pretrain-mix-high-tweet-1m-en-sft Text Generation • 333k • Updated May 4 • 7
shuoxing/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft Text Generation • 333k • Updated May 4 • 13