mozilla-foundation/common_voice_13_0
Updated • 2.42k • 3
How to use ptah23/whisper-small-dv with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="ptah23/whisper-small-dv") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("ptah23/whisper-small-dv")
model = AutoModelForSpeechSeq2Seq.from_pretrained("ptah23/whisper-small-dv")This model is a fine-tuned version of openai/whisper-small on the Common Voice 13 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Wer Ortho |
|---|---|---|---|---|---|
| 0.1248 | 1.63 | 500 | 0.1684 | 12.9881 | 62.0447 |
| 0.0484 | 3.26 | 1000 | 0.1629 | 11.6493 | 58.6113 |
| 0.0315 | 4.89 | 1500 | 0.1878 | 11.7224 | 58.9386 |
| 0.0125 | 6.51 | 2000 | 0.2308 | 11.0895 | 57.2185 |
| 0.0058 | 8.14 | 2500 | 0.2671 | 11.0773 | 57.6224 |
| 0.0049 | 9.77 | 3000 | 0.2843 | 11.2564 | 57.6572 |
| 0.0033 | 11.4 | 3500 | 0.2845 | 11.0982 | 57.1558 |
| 0.0046 | 13.03 | 4000 | 0.2863 | 57.7129 | 11.2494 |
Base model
openai/whisper-small