OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17, 2025 • 92
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 1.13k
view article Article Faster Assisted Generation with Dynamic Speculation +5 jmamou, orenpereg, joaogante, lewtun, danielkorat, Nadav-Timor, moshew • Oct 8, 2024 • 51
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 29
view article Article Beyond Traditional Fine-tuning: Exploring Advanced Techniques to Mitigate LLM Hallucinations Imama • Feb 11, 2024 • 5
Context-aware Decoding Reduces Hallucination in Query-focused Summarization Paper • 2312.14335 • Published Dec 21, 2023 • 1