Inference Providers
Active filters: sparsity
RedHatAI/oBERT-6-downstream-pruned-block4-90-squadv1
Updated • 12
RedHatAI/oBERT-3-downstream-dense-squadv1
RedHatAI/oBERT-3-downstream-pruned-unstructured-80-squadv1
Updated • 12
RedHatAI/oBERT-3-downstream-pruned-unstructured-90-squadv1
Updated • 20
RedHatAI/oBERT-3-downstream-pruned-block4-80-squadv1
RedHatAI/oBERT-3-downstream-pruned-block4-90-squadv1
Updated • 10
RedHatAI/oBERT-12-downstream-dense-QAT-squadv1
Updated • 15
RedHatAI/oBERT-12-downstream-pruned-block4-80-QAT-squadv1
Updated • 19
RedHatAI/oBERT-12-downstream-pruned-block4-90-QAT-squadv1
Updated • 19
RedHatAI/oBERT-6-downstream-dense-QAT-squadv1
Updated • 12
RedHatAI/oBERT-6-downstream-pruned-block4-80-QAT-squadv1
Updated • 12
RedHatAI/oBERT-6-downstream-pruned-block4-90-QAT-squadv1
Updated • 10
RedHatAI/oBERT-3-downstream-dense-QAT-squadv1
Updated • 10
RedHatAI/oBERT-3-downstream-pruned-block4-80-QAT-squadv1
Updated • 10
RedHatAI/oBERT-3-downstream-pruned-block4-90-QAT-squadv1
RedHatAI/oBERT-12-upstream-pruned-unstructured-90-v2
Updated • 12
RedHatAI/oBERT-12-upstream-pruned-unstructured-97-v2
Updated • 12
RedHatAI/oBERT-12-upstream-pruned-unstructured-90-finetuned-squadv1-v2
RedHatAI/oBERT-12-upstream-pruned-unstructured-97-finetuned-squadv1-v2
Updated • 15
RedHatAI/oBERT-12-upstream-pruned-unstructured-90-finetuned-mnli-v2
Updated • 10
RedHatAI/oBERT-12-upstream-pruned-unstructured-97-finetuned-mnli-v2
Updated • 13
RedHatAI/oBERT-12-upstream-pruned-unstructured-90-finetuned-qqp-v2
Updated • 11
RedHatAI/oBERT-12-upstream-pruned-unstructured-97-finetuned-qqp-v2
RedHatAI/sst2-distilbert-sparse-blog
Text Classification
• Updated • 8
• 4
RedHatAI/bge-small-en-v1.5-quant
Feature Extraction
• Updated • 136
• 9
RedHatAI/bge-base-en-v1.5-quant
Feature Extraction
• Updated • 278
• 4
RedHatAI/bge-large-en-v1.5-quant
Feature Extraction
• Updated • 101
• 22
facebook/superblock-vit-b-16
Image Classification
• Updated • 2
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4
Text Generation
• 8B • Updated • 15
• 1
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic
Text Generation
• 8B • Updated • 8
• 2