Running on CPU Upgrade Featured 2.83k The Smol Training Playbook 📚 2.83k The secrets to building world-class LLMs
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 Sep 18, 2024 • 272
google/embeddinggemma-300m Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 613k • • 1.41k
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 177
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8, 2025 • 29