omkarenator (Omkar Pangarkar)

liked a Space 4 months ago

The Smol Training Playbook

📚

3.04k

The secrets to building world-class LLMs

liked a dataset 4 months ago

bigcode/the-stack-github-issues

Viewer • Updated Mar 20, 2023 • 31M • 243 • 48

liked a Space 10 months ago

Predict Memory

🧮

106

Calculate and visualize model memory usage from config

liked a dataset 11 months ago

WebOrganizer/Corpus-200B

Preview • Updated Feb 19, 2025 • 8.47k • 11

liked a Space 11 months ago

TxT360: Trillion Extracted Text

📖

133

Explore and download the TxT360 LLM pre‑training dataset

liked a model about 1 year ago

mlfoundations/fasttext-oh-eli5

Updated Aug 1, 2024 • 29

liked 2 Spaces about 1 year ago

The Ultra-Scale Playbook

🌌

3.73k

The ultimate guide to training LLM on large GPU Clusters

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

89

Evaluate multilingual models using FineTasks

liked a dataset over 1 year ago

LLM360/TxT360

Updated May 26, 2025 • 19.9k • 248

liked a Space over 1 year ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.31k

Generate a curated web‑text dataset for LLM training

liked 2 datasets over 1 year ago

Trelis/touch-rugby-rules-memorisation

Viewer • Updated Feb 28, 2024 • 363 • 12 • 2

commoncrawl/statistics

Viewer • Updated 17 days ago • 611k • 351 • 26

liked 2 models about 2 years ago

bigcode/starencoder

Updated May 10, 2023 • 2.95k • 57

microsoft/phi-2

Text Generation • 3B • Updated Dec 8, 2025 • 1.68M • 3.43k

liked 4 models over 2 years ago

liked a model almost 3 years ago

stanfordnlp/backpack-gpt2

Text Generation • Updated Aug 14, 2023 • 16 • 16

Omkar Pangarkar

AI & ML interests

Organizations

The Smol Training Playbook

bigcode/the-stack-github-issues

Predict Memory

WebOrganizer/Corpus-200B

TxT360: Trillion Extracted Text

mlfoundations/fasttext-oh-eli5

The Ultra-Scale Playbook

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

LLM360/TxT360

FineWeb: decanting the web for the finest text data at scale

Trelis/touch-rugby-rules-memorisation

commoncrawl/statistics

bigcode/starencoder

microsoft/phi-2

microsoft/phi-1_5

microsoft/phi-1

adept/fuyu-8b

adept/persimmon-8b-base

stanfordnlp/backpack-gpt2

Omkar Pangarkar

AI & ML interests

Organizations

omkarenator's activity

The Smol Training Playbook

Predict Memory

TxT360: Trillion Extracted Text

The Ultra-Scale Playbook

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

FineWeb: decanting the web for the finest text data at scale