-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
Text Generation
•
120B
•
Updated
•
2.88M
•
•
4.4k
Text Generation
•
22B
•
Updated
•
6.43M
•
•
4.26k
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
18B
•
Updated
•
183k
•
48
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
•
18B
•
Updated
•
7.1k
•
26
mlx-community/Qwen3-TTS-12Hz-0.6B-CustomVoice-8bit
Text-to-Speech
•
0.5B
•
Updated
•
1.67k
•
9
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
5.88k
•
1.26k
openai/gpt-oss-safeguard-20b
Text Generation
•
22B
•
Updated
•
17.1k
•
•
184
mlx-community/GLM-4.7-Flash-8bit
Text Generation
•
30B
•
Updated
•
8.16k
•
17
MultiverseComputingCAI/HyperNova-60B
Text Generation
•
60B
•
Updated
•
1.56k
•
48
mlx-community/GLM-4.7-Flash-8bit-gs32
Text Generation
•
30B
•
Updated
•
531
•
5
GadflyII/GLM-4.7-Flash-MXFP4
Text Generation
•
18B
•
Updated
•
661
•
5
FabioSarracino/VibeVoice-Large-Q8
Text-to-Audio
•
9B
•
Updated
•
2.62k
•
80
Text Generation
•
177B
•
Updated
•
5.82k
•
11
Text Generation
•
5B
•
Updated
•
6.83k
•
13
ig1/Qwen3-VL-30B-A3B-Instruct-NVFP4
Image-Text-to-Text
•
18B
•
Updated
•
2.28k
•
6
lukealonso/MiniMax-M2.1-NVFP4
115B
•
Updated
•
26.9k
•
19
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
•
394B
•
Updated
•
1.54k
•
3
lmstudio-community/GLM-4.7-Flash-MLX-8bit
Text Generation
•
30B
•
Updated
•
393k
•
4
mlx-community/Qwen3-TTS-12Hz-1.7B-VoiceDesign-8bit
Text-to-Speech
•
0.8B
•
Updated
•
1.02k
•
3
ragraph-ai/stable-cypher-instruct-3b
Text Generation
•
3B
•
Updated
•
359
•
31
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
151k
•
10
Text Generation
•
397B
•
Updated
•
10.2k
•
269
tiiuae/Falcon-E-3B-Instruct
Text Generation
•
0.9B
•
Updated
•
284
•
36
MaziyarPanahi/Qwen3-1.7B-GGUF
Text Generation
•
2B
•
Updated
•
229k
•
6
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
•
133B
•
Updated
•
5.22k
•
13
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
•
16B
•
Updated
•
4.16k
•
6
mlx-community/DeepSeek-OCR-8bit
Image-Text-to-Text
•
1B
•
Updated
•
1.39k
•
30
kldzj/gpt-oss-120b-heretic-v2
Text Generation
•
117B
•
Updated
•
304
•
17
MaziyarPanahi/NVIDIA-Nemotron-Nano-12B-v2-GGUF
Text Generation
•
12B
•
Updated
•
73.4k
•
2
Disty0/Z-Image-Turbo-SDNQ-int8
Text-to-Image
•
Updated
•
1.89k
•
17