Inference Providers
Active filters: GPTQ
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
• 253B • Updated • 11
• 4
QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 270
• 9
QuantTrio/GLM-4.5-GPTQ-Int4-Int8Mix
Text Generation
• 55B • Updated • 59
• 5
QuantTrio/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 121
• 2
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8
Text Generation
• 31B • Updated • 716
• 8
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated • 39
• 4
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated • 46
• 5
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
• 34B • Updated • 11
• 3
QuantTrio/DeepSeek-V3.1-AWQ
Text Generation
• 485B • Updated • 217
• 5
QuantTrio/DeepSeek-V3.1-AWQ-Fp16Mix
Text Generation
• 286B • Updated • 26
• 1
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 78.3k
• 3
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 198
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 115
• 1
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 87
• 2
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 2.39k
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 9
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 21
JunHowie/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated • 4
JunHowie/Qwen2-7B-Instruct-GPTQ-Int8
Text Generation
• 8B • Updated • 2
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 3
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated • 8
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated • 4
QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix
Text Generation
• 69B • Updated • 12
• 4
QuantTrio/KAT-Dev-GPTQ-Int4
Text Generation
• 33B • Updated • 4
• 1
QuantTrio/KAT-Dev-GPTQ-Int8
Text Generation
• 33B • Updated • 3
• 1
QuantTrio/Kimi-Dev-72B-GPTQ-Int4
Text Generation
• 73B • Updated • 40
• 2
QuantTrio/Kimi-Dev-72B-GPTQ-Int8
Text Generation
• 73B • Updated • 18
• 2
AXERA-TECH/Qwen3-VL-2B-Instruct-GPTQ-Int4
Image-Text-to-Text
• Updated • 69
• 1
AXERA-TECH/Qwen3-VL-4B-Instruct-GPTQ-Int4
Image-Text-to-Text
• Updated • 41
AXERA-TECH/Qwen3-VL-8B-Instruct-GPTQ-Int4
Image-Text-to-Text
• Updated • 22
• 1