Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Woojin Chung's picture

Woojin Chung PRO

gartland
PenPaperKeyCode's profile picture
·

AI & ML interests

None yet

Organizations

Pre-training's profile picture Cambridge-KAIST's profile picture Cambridge-KAIST2's profile picture token-frequency's profile picture

gartland 's collections 1

[NeurIPS 2025] Vocabulary Frequency Imbalance
Dataset and pre-trained models for "Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training (Neurips 2025)"
  • gartland/finewebedu-24K-30B

    0.1B • Updated Oct 13 • 7
  • gartland/finewebedu-49K-30B

    0.2B • Updated Oct 13 • 6
  • gartland/finewebedu-98K-30B

    0.2B • Updated Oct 13 • 7
  • gartland/finewebedu-196K-30B

    0.4B • Updated Oct 13 • 5
[NeurIPS 2025] Vocabulary Frequency Imbalance
Dataset and pre-trained models for "Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training (Neurips 2025)"
  • gartland/finewebedu-24K-30B

    0.1B • Updated Oct 13 • 7
  • gartland/finewebedu-49K-30B

    0.2B • Updated Oct 13 • 6
  • gartland/finewebedu-98K-30B

    0.2B • Updated Oct 13 • 7
  • gartland/finewebedu-196K-30B

    0.4B • Updated Oct 13 • 5
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs