Non-English Embeddings and Models
updated
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper
• 2211.05100
• Published • 37
Contrastive Language-Image Pre-training for the Italian Language
Paper
• 2108.08688
• Published • 2
IT5: Large-scale Text-to-text Pretraining for Italian Language
Understanding and Generation
Paper
• 2203.03759
• Published • 5
Spanish Pre-trained BERT Model and Evaluation Data
Paper
• 2308.02976
• Published • 3
German FinBERT: A German Pre-trained Language Model
Paper
• 2311.08793
• Published • 3
German Text Embedding Clustering Benchmark
Paper
• 2401.02709
• Published • 6
AfroDigits: A Community-Driven Spoken Digit Dataset for African
Languages
Paper
• 2303.12582
• Published • 23
Text Generation
• 7B • Updated • 8.13k
• 68
Updated • 335
• 24
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
Model
Paper
• 2402.07827
• Published • 48
Viewer
• Updated • 206k • 10.8k
• 345
CohereLabs/c4ai-command-r-v01
Text Generation
• Updated • 14.6k
• 1.11k