mmBERT: a modern multilingual encoder Collection mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9, 2025 • 53
view article Article Swift Transformers Reaches 1.0 – and Looks to the Future +2 pcuenq, FL33TW00D-HF, mattt, reach-vb • Sep 26, 2025 • 43
view article Article `LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot` +9 fracapuano, aractingi, lhoestq, CarolinePascal, pepijn223, jadechoghari, cadene, aliberts, AdilZtn, nepyope, imstevenpmwork • Sep 16, 2025 • 54
view article Article Deploying Your FastAPI Applications on Huggingface Via Docker HemanthSai7 • Dec 11, 2023 • 41
view article Article Introducing Ghost 8B Beta: A Game-Changing Language Model lamhieu • Jul 17, 2024 • 7
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 37 items • Updated Mar 2 • 376