Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

audio-language-model

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

40

Base only

Active filters: audio-language-model

OpenMOSS-Team/MOSS-Music-8B-Instruct

Audio-Text-to-Text • 9B • Updated May 1 • 1.18k • 27

OpenMOSS-Team/MOSS-Music-8B-Thinking

Audio-Text-to-Text • 9B • Updated May 1 • 226 • 38

mispeech/midashenglm-7b-0804-fp32

Audio-Text-to-Text • 8B • Updated Mar 17 • 123k • 82

nvidia/audio-flamingo-next-captioner-hf

Audio-Text-to-Text • 8B • Updated May 13 • 1.43k • 19

mlx-community/MOSS-Music-8B-Thinking-8bit

Audio-Text-to-Text • 3B • Updated 18 days ago • 486 • 7

maitrix-org/Voila-base

Audio-to-Audio • 8B • Updated May 6, 2025 • 33 • 13

maitrix-org/Voila-chat

Audio-to-Audio • Updated May 6, 2025 • 84 • 55

moonshotai/Kimi-Audio-7B-Instruct

Text-to-Speech • 10B • Updated May 29, 2025 • 82.1k • 405

moonshotai/Kimi-Audio-7B

Text-to-Speech • 10B • Updated May 29, 2025 • 96 • 84

rsxdalv/Kimi-Audio-7B-Instruct

Text-to-Speech • 10B • Updated May 23, 2025 • 13

zh794390558/Kimi-Audio-7B

Text-to-Speech • 10B • Updated Jun 9, 2025 • 7

mispeech/midashenglm-7b-0804-4bit-bnb

Audio-Text-to-Text • 8B • Updated Oct 20, 2025 • 15 • 1

mispeech/midashenglm-7b-0804-bf16

Audio-Text-to-Text • 8B • Updated Mar 17 • 458

mispeech/midashenglm-7b-0804-fp8

Audio-Text-to-Text • 8B • Updated Oct 31, 2025 • 7

mispeech/midashenglm-7b-0804-w4a16-gptq

Audio-Text-to-Text • 3B • Updated Oct 31, 2025 • 14

mispeech/midashenglm-7b-1021-bf16

Audio-Text-to-Text • 8B • Updated Mar 17 • 1.04k • 3

mispeech/midashenglm-7b-1021-fp8

Audio-Text-to-Text • 8B • Updated Oct 31, 2025 • 26 • 5

mispeech/midashenglm-7b-1021-fp32

Audio-Text-to-Text • 8B • Updated Oct 31, 2025 • 39 • 2

mispeech/midashenglm-7b-1021-w4a16-gptq

Audio-Text-to-Text • 3B • Updated Oct 31, 2025 • 13 • 1

FunAudioLLM/Fun-Audio-Chat-8B

Any-to-Any • 9B • Updated Dec 24, 2025 • 769 • 184

Mayank022/Audio-Language-Model

Audio-Text-to-Text • Updated Feb 26

tunglinwood/Kimi-Audio-7B-Instruct

Text-to-Speech • 10B • Updated Feb 12 • 4

cslys1999/Eureka-Audio-Instruct

Audio-Text-to-Text • 3B • Updated Feb 26 • 105 • 6

teamvizuara/Vocal-LLM

Audio-Text-to-Text • Updated Feb 26

mispeech/midashenglm-0.6b-fp32

Audio-Text-to-Text • 0.7B • Updated Apr 3 • 360 • 4

tencent/Unified_Audio_Schema

Audio-Text-to-Text • 8B • Updated Apr 16 • 15 • 10

mlx-community/kimi-audio-7b

Text-to-Speech • 10B • Updated Apr 5 • 111

nvidia/audio-flamingo-next-hf

Audio-Text-to-Text • 8B • Updated May 13 • 5.16k • 56

nvidia/audio-flamingo-next-think-hf

Audio-Text-to-Text • 8B • Updated May 13 • 4.41k • 9

mispeech/midashenglm-0.6b-gguf

Audio-Text-to-Text • 0.6B • Updated Apr 17 • 223 • 1