Inference Providers
Active filters: dpo, trl
mradermacher/Llama-2-7b-sft-SPIN-gpt4o-GGUF
7B • Updated • 23
mradermacher/Mistral-7B-v0.1-sft-SPIN-Mistral-8x7B-Instruct-v0.1-GGUF
7B • Updated • 17
mradermacher/Mistral-7B-v0.1-sft-SPIN-gpt4o-GGUF
7B • Updated • 12
mradermacher/Mistral-7B-v0.1-sft-SPIN-Mistral-8x7B-Instruct-v0.1-i1-GGUF
7B • Updated • 18
mradermacher/Mistral-7B-v0.1-sft-SPIN-gpt4o-i1-GGUF
7B • Updated • 30
mradermacher/Llama-2-7b-sft-SPIN-gpt4o-i1-GGUF
7B • Updated • 13
michaelnguyen11/TwinLlama-3.2-3B-DPO
Text Generation
• 3B • Updated • 2
tensorblock/chat_gpt2_dpo-GGUF
Text Generation
• 0.2B • Updated • 102
Text Generation
• 0.1B • Updated • 1
bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen
Text Generation
• 7B • Updated • 1
mradermacher/Mistral-7B-v0.3-sft-SPIN-self-GGUF
7B • Updated mradermacher/Llama-3.1-8B-sft-SPIN-self-GGUF
8B • Updated • 5
• 1
mradermacher/TwinLlama-3.2-3B-DPO-GGUF
3B • Updated • 19
bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-norandom
Text Generation
• 7B • Updated • 3
bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-noshort
Text Generation
• 7B • Updated • 5
mradermacher/GEITje-7B-ultra-GGUF
7B • Updated • 68
mradermacher/GEITje-7B-ultra-i1-GGUF
7B • Updated • 154
LBK95/Llama-2-7b-hf-DPO-LookAhead-5_Q2_TTree1.4_TT0.9_TP0.7_TE0.2_V3
mradermacher/selfbiorag-7b-dpo-full-sft-wo-kqa_silver_wogold-GGUF
7B • Updated • 36
mradermacher/Mistral-7B-v0.1-Llama-3.1-8B-mix-GGUF
7B • Updated • 35
bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-bad
Text Generation
• 7B • Updated • 1
mradermacher/Mistral-7B-v0.3-sft-SPIN-human-dataset-GGUF
7B • Updated • 100
mradermacher/Mistral-7B-v0.3-sft-SPIN-human-dataset-i1-GGUF
7B • Updated • 186
mradermacher/Mistral-7B-v0.3-sft-SPIN-self-human-dataset-GGUF
7B • Updated • 1
mradermacher/Mistral-7B-v0.3-sft-SPIN-self-human-dataset-i1-GGUF
7B • Updated • 66
mradermacher/Llama-3.1-8B-sft-SPIN-human-dataset-GGUF
8B • Updated • 6
mradermacher/Llama-3.1-8B-sft-SPIN-human-dataset-i1-GGUF
8B • Updated • 149
• 1
mradermacher/Llama-3.1-8B-sft-SPIN-self-human-dataset-GGUF
8B • Updated • 2
mradermacher/Gemma-2-9B-sft-SPIN-human-dataset-GGUF
9B • Updated • 5
mradermacher/Gemma-2-9B-sft-SPIN-human-dataset-i1-GGUF
9B • Updated • 28