Running 132 TxT360: Trillion Extracted Text π 132 Explore and analyze the TxT360 dataset for LLM pre-training