language-models
updated
Paper
• 2310.06825
• Published • 58
BloombergGPT: A Large Language Model for Finance
Paper
• 2303.17564
• Published • 31
BERT: Pre-training of Deep Bidirectional Transformers for Language
Understanding
Paper
• 1810.04805
• Published • 26
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and
lighter
Paper
• 1910.01108
• Published • 22
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper
• 2307.09288
• Published • 250
Attention Is All You Need
Paper
• 1706.03762
• Published • 117
Universal Language Model Fine-tuning for Text Classification
Paper
• 1801.06146
• Published • 8
Language Models are Few-Shot Learners
Paper
• 2005.14165
• Published • 20
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper
• 2211.05100
• Published • 37
Self-Rewarding Language Models
Paper
• 2401.10020
• Published • 152