Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
jablonkagroup 's Collections
Count-Bench
Shoutouts 🥳
SECS
ChemPile
MaCBench-collection
ChemBench-Collection

ChemPile

updated Oct 23, 2025

The ChemPile is a dataset with over 77 billion curated multimodal tokens about chemistry. For more information, visit https://chempile.lamalab.org/.

Upvote
16

  • jablonkagroup/chempile-education

    Viewer • Updated Jun 23, 2025 • 66.9k • 199 • 5

  • jablonkagroup/chempile-paper

    Viewer • Updated Aug 13, 2025 • 11.7M • 550 • 5

  • jablonkagroup/chempile-mlift

    Viewer • Updated Jul 27, 2025 • 51.5M • 2.63k • 11

  • jablonkagroup/chempile-code

    Viewer • Updated Aug 13, 2025 • 2.27M • 313 • 4

  • jablonkagroup/chempile-caption

    Viewer • Updated Jun 23, 2025 • 100k • 140 • 7

  • jablonkagroup/chempile-lift

    Viewer • Updated Jul 26, 2025 • 176M • 2.49k • 6

  • jablonkagroup/chempile-reasoning

    Viewer • Updated Jul 27, 2025 • 73k • 453 • 5

  • jablonkagroup/chempile-instruction

    Viewer • Updated Jul 30, 2025 • 410k • 222 • 6
Upvote
16
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs