distilbert/distilbert-base-uncased-finetuned-sst-2-english Text Classification • 67M • Updated Dec 19, 2023 • 2.92M • • 861
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper • 2501.16975 • Published Jan 28, 2025 • 31
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 202