Models

372

Full-text search

Active filters: speculative-decoding

Joysulem/FireEcho

Text Generation • Updated Feb 17 • 6

z-lab/gpt-oss-20b-DFlash

Text Generation • 0.8B • Updated Apr 7 • 1.93k • 22

togethercomputer/Aurora-Spec-Minimax-M2.5

Text Generation • 0.9B • Updated Mar 19 • 355 • 6

StentorLabs/Stentor-30M

Text Generation • 30.4M • Updated Feb 21 • 110 • 3

mradermacher/Stentor-30M-GGUF

Text Generation • 30.4M • Updated Feb 21 • 147 • 3

mradermacher/Stentor-30M-i1-GGUF

Text Generation • 30.4M • Updated Feb 21 • 152 • 2

StentorLabs/Stentor2-12M-Preview

Text Generation • 12.3M • Updated Feb 25 • 11

husj576/GTO-deepseek-8B

Text Generation • Updated Mar 4 • 5

husj576/GTO-llama33-instruct-70B

Text Generation • Updated Mar 4 • 2

husj576/GTO-vicuna-13b

Text Generation • Updated Mar 4 • 1.23k

husj576/GTO-qwen3-8B

Text Generation • Updated Mar 4 • 5

AICP-Labs/qwen3-32b-dflash-en-zh

Text Generation • 0.5B • Updated Mar 1 • 388 • 3

osoleve/Qwen3.5-27B-Text-NVFP4-MTP

Text Generation • 17B • Updated Mar 5 • 11.5k • 19

surogate/Qwen3.5-27B-NVFP4

Text Generation • Updated Feb 28 • 3.53k

festr2/GLM-5-NVFP4-MTP

435B • Updated Mar 1 • 494 • 2

z-lab/Qwen3.5-35B-A3B-DFlash

Text Generation • 0.5B • Updated Apr 7 • 5.75k • 36

Cloudriver/MSD-Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated Mar 2 • 34

Cloudriver/EAGLE3-Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated Mar 2 • 11

ping69852/Medusa-LLaVA1.5-7B

Image-Text-to-Text • Updated Mar 5 • 1

ping69852/Medusa-Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated Mar 5 • 1

celestialcreator/Llama-3.2-1B-MTP-k8

Text Generation • Updated Mar 5 • 2.85k • • 3

lightseekorg/kimi-k2.5-eagle3

3B • Updated Mar 16 • 55.9k • 11

StentorLabs/Stentor2-12M

Text Generation • 12.3M • Updated Mar 8 • 140 • 2

BLR2/Qwen3.5-9B-Eagle3-ShareGPT

Updated Mar 7 • 90 • 4

StentorLabs/Stentor2-30M

Text Generation • 30.4M • Updated Mar 21 • 166 • 2

Thr45h/MEDUSA-Llama-3.1-8B-Instruct

Text Generation • 3B • Updated Mar 17 • 39

darkmaniac7/TokForge-AccelerationPack-Draft

Text Generation • Updated 26 days ago • 212

darkmaniac7/TokForge-AccelerationPack-Qwen35-Draft

Text Generation • Updated Mar 25 • 11

inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch1

2B • Updated Mar 19 • 5

inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch2

2B • Updated Mar 19 • 2