[NeurIPS 2025] Vocabulary Frequency Imbalance Dataset and pre-trained models for "Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training (Neurips 2025)" gartland/finewebedu-24K-30B 0.1B • Updated Oct 13 • 7 gartland/finewebedu-49K-30B 0.2B • Updated Oct 13 • 6 gartland/finewebedu-98K-30B 0.2B • Updated Oct 13 • 7 gartland/finewebedu-196K-30B 0.4B • Updated Oct 13 • 5
[NeurIPS 2025] Vocabulary Frequency Imbalance Dataset and pre-trained models for "Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training (Neurips 2025)" gartland/finewebedu-24K-30B 0.1B • Updated Oct 13 • 7 gartland/finewebedu-49K-30B 0.2B • Updated Oct 13 • 6 gartland/finewebedu-98K-30B 0.2B • Updated Oct 13 • 7 gartland/finewebedu-196K-30B 0.4B • Updated Oct 13 • 5