view article Article Complete Guide: Training and Inference with π₀.₅ (pi05) on Custom Datasets about 23 hours ago • 1
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 11 days ago • 60
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models 26 days ago • 31
view article Article Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms 25 days ago • 34
view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks +2 24 days ago • 22
view article Article Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications 12 days ago • 22
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 11 days ago • 27
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face 10 days ago • 30