Inference Providers
Active filters: sparsity
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
• 8B • Updated • 12
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
• 8B • Updated • 1
RedHatAI/Sparse-Llama-3.1-8B-2of4
Text Generation
• 8B • Updated • 41
• 62
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16
Text Generation
• 2B • Updated • 4
• 3
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4
Text Generation
• 8B • Updated • 3
• 1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4
Text Generation
• 8B • Updated • 10
• 1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
• 2B • Updated • 7
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16
Text Generation
• 2B • Updated • 3
bartowski/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
• 8B • Updated • 291
• 3
QuantFactory/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
• 8B • Updated • 302
• 4
tensorblock/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
• 8B • Updated • 8
nintwentydo/pixtral-12b-2409-2of4-sparse
Image-Text-to-Text
• 13B • Updated • 1
• 1
HangGuo/Llama2-70B-QuaRot-OBR-GPTQ-W4A4KV4S50
Text Generation
• Updated • 1
HangGuo/Llama2-70B-QuaRot-OBR-RTN-W4A4KV4S50
Text Generation
• Updated HangGuo/Llama2-70B-SpinQuant-OBR-RTN-W4A4KV4S50
Text Generation
• Updated HangGuo/Llama2-70B-SpinQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
• Updated HangGuo/Llama3-70B-SpinQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
• Updated HangGuo/Llama3-70B-SpinQuant-OBR-RTN-W4A4KV4S50
Text Generation
• Updated HangGuo/Llama3-70B-QuaRot-OBR-RTN-W4A4KV16S50
Text Generation
• Updated HangGuo/Llama3-70B-QuaRot-OBR-GPTQ-W4A4KV16S50
Text Generation
• Updated HangGuo/QWen2.5-7B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
• Updated HangGuo/QWen2.5-32B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
• Updated HangGuo/QWen2.5-1.5B-FlatQuant-OBR-GPTQ-W4A8KV16S50
Text Generation
• Updated • 1
HangGuo/QWen2.5-3B-FlatQuant-OBR-GPTQ-W4A8KV16S50
Text Generation
• Updated • 1
HangGuo/QWen2.5-3B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
• Updated HangGuo/QWen2.5-1.5B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
• Updated • 1
ay933/BDA-Botanical-Dormancy
Updated