Running on CPU Upgrade Featured 2.98k The Smol Training Playbook 📚 2.98k The secrets to building world-class LLMs
Running Featured 142 Chat Template Playground 💻 142 Visualize JSON data in an interactive split-view editor
Running 3.69k The Ultra-Scale Playbook 🌌 3.69k The ultimate guide to training LLM on large GPU Clusters
CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models Paper • 2410.18505 • Published Oct 24, 2024 • 11