SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 205
📚 LLM pretraining datasets Collection A collection of datasets for LLM pretraining • 9 items • Updated May 5, 2025 • 15
Common Corpus Collection Largest multilingual pretraining data. • 1 item • Updated Nov 13, 2024 • 14
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 139
view article Article You could have designed state of the art positional encoding Nov 25, 2024 • 442
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control +2 Feb 4, 2025 • 187