Inference Providers
Active filters: llamacpp
rob-x-ai/neural-chat-7b-v3-1-GGUF
7B • Updated • 58
Druvith/mistralmed-7b-v1.5.gguf
7B • Updated • 37
rxavier/Taurus-7B-1.0-GGUF
7B • Updated • 9
BramVanroy/GEITje-7B-ultra-GGUF
7B • Updated • 189
• 10
Vikhrmodels/it-5.3-fp16-32k-GGUF
8B • Updated • 387
• 2
rubra-ai/Meta-Llama-3-8B-Instruct-GGUF
9B • Updated • 63
• 4
Vikhrmodels/it-5.4-fp16-orpo-v2-GGUF
8B • Updated • 174
• 4
Dracones/gemma-2-9b-it-GGUF
Text Generation
• 9B • Updated • 9
Dracones/gemma-2-27b-it-GGUF
Text Generation
• 27B • Updated • 13
Vikhrmodels/Vikhr-Gemma-2B-instruct-GGUF
Text Generation
• 3B • Updated • 750
• 19
flowaicom/Flow-Judge-v0.1-GGUF
Text Generation
• 4B • Updated • 33
• 10
Vikhrmodels/Vikhr-Llama-3.2-1B-instruct-GGUF
Text Generation
• 1B • Updated • 1.08k
• 14
Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct-GGUF
Text Generation
• 0.5B • Updated • 363
• 9
DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated • 185
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO_GGUF
2B • Updated • 249
• 10
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-r_GGUF
2B • Updated • 341
• 4
vicharai/ViCoder-html-32B-preview-GGUF
Text Generation
• 33B • Updated • 63
• 4
Gardeviance/MS-Gardventure-MW-V1-22B-IQ4_NL-GGUF
Text Generation
• 22B • Updated • 5
Dca3271144691983/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated
lucky087/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated