Automatic Speech Recognition
Transformers
Safetensors
PyTorch
English
avhubert_avsr
audio-visual-speech-recognition
multimodal
speech-recognition
lip-reading
cocktail-party
noise-robust
av-hubert
transformer
audio
video
english
lrs2
voxceleb2
ctc
attention
beam-search
multi-speaker
noisy-speech
Instructions to use nguyenvulebinh/AVSRCocktail with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use nguyenvulebinh/AVSRCocktail with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="nguyenvulebinh/AVSRCocktail")# Load model directly from transformers import AVHubertAVSR model = AVHubertAVSR.from_pretrained("nguyenvulebinh/AVSRCocktail", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Ctrl+K