language-models
updated
Paper
• 2310.06825
• Published
• 58
BloombergGPT: A Large Language Model for Finance
Paper
• 2303.17564
• Published
• 30
BERT: Pre-training of Deep Bidirectional Transformers for Language
Understanding
Paper
• 1810.04805
• Published
• 26
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and
lighter
Paper
• 1910.01108
• Published
• 21
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper
• 2307.09288
• Published
• 250
Attention Is All You Need
Paper
• 1706.03762
• Published
• 115
Universal Language Model Fine-tuning for Text Classification
Paper
• 1801.06146
• Published
• 8
Language Models are Few-Shot Learners
Paper
• 2005.14165
• Published
• 19
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper
• 2211.05100
• Published
• 37
Self-Rewarding Language Models
Paper
• 2401.10020
• Published
• 152