Alexzander85/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-NVFP4-MLP-FP8KV Text Generation • 8B • Updated 13 days ago • 549 • 1
dimitribarbot/CodeLlama-13B-Instruct-GPTQ-TensorRT-LLM-RTX-4090 Text Generation • Updated Oct 25, 2024 • 2 • 1
rungalileo/mistral-7b-instruct-v0.3-trtllm-ckpt-wq_int4_awq-kv_int8 Text Generation • Updated Feb 8 • 86
rungalileo/llama-3.1-8b-instruct-trtllm-ckpt-wq_fp8-kv_fp8 Text Generation • Updated about 14 hours ago • 13
rungalileo/mistral-7b-instruct-v0.3-trtllm-ckpt-wq_fp8-kv_fp8 Text Generation • Updated about 14 hours ago • 13
rungalileo/llama-3.2-3b-instruct-trtllm-ckpt-wq_fp8-kv_fp8 Text Generation • Updated about 13 hours ago • 10
rungalileo/llama-3.2-3b-instruct-trtllm-ckpt-wq_nvfp4-kv_fp8 Text Generation • Updated about 10 hours ago • 10
rungalileo/llama-3.1-8b-instruct-trtllm-ckpt-wq_nvfp4-kv_fp8 Text Generation • Updated about 10 hours ago • 6
rungalileo/mistral-7b-instruct-v0.3-trtllm-ckpt-wq_nvfp4-kv_fp8 Text Generation • Updated about 10 hours ago • 14