view article Article Exploring Environments Hub: Your Language Model needs better (open) environments to learn Sep 4 • 28
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 5 days ago • 62
DiLoCo: Distributed Low-Communication Training of Language Models Paper • 2311.08105 • Published Nov 14, 2023 • 16
Contra (Bottleneck T5) Collection Text autoencoders capable of embedding and generating text in a fixed-size latent space, useful for embeddings and latent space text editing. • 4 items • Updated Oct 3, 2023 • 28