view article Article Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs 8 days ago • 19
view article Article AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality 15 days ago • 31
view article Article NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI 30 days ago • 61
AgriLLM Collection A collection of the artifacts for the AgriLLM initiative. • 5 items • Updated Dec 15, 2025 • 5
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator Dec 17, 2025 • 47
view article Article Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture about 1 month ago • 37
Dalla models Collection Dalla is a family of Arabic language models optimized for Arabic text processing through advanced tokenization techniques. • 4 items • Updated Dec 16, 2025 • 2
Can a Multichoice Dataset be Repurposed for Extractive Question Answering? Paper • 2404.17342 • Published Apr 26, 2024 • 2
Jais-2-Family Collection The 2nd generation of the Jais Large Language Models Family • 2 items • Updated Dec 9, 2025 • 13
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 149
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 87
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability Collection A compilation of sparse auto-encoders trained on large language models. • 37 items • Updated Dec 16, 2025 • 23
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 • 92
Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 325