Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

upvoted an article 15 days ago

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

upvoted a collection 17 days ago

Physics of Language Models: Part 4.2

upvoted an article 21 days ago

We Got Claude to Fine-Tune an Open Source LLM

View all activity

Organizations

None yet

upvoted an article 15 days ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

18 days ago

•

upvoted a collection 17 days ago

Physics of Language Models: Part 4.2

Collection

16 items • Updated Jul 29 • 14

upvoted an article 21 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

23 days ago

•

535

upvoted a paper 2 months ago

The Massive Legal Embedding Benchmark (MLEB)

Paper • 2510.19365 • Published Oct 22 • 17

upvoted an article 2 months ago

Article

Australian-made LLM beats OpenAI and Google at legal retrieval

Oct 23

•

upvoted an article 3 months ago

Article

There is no such thing as a tokenizer-free lunch

Sep 25

•

updated a dataset 3 months ago

soates/australian-insurance-dspy-corpus

Viewer • Updated Sep 17 • 359 • 20

published a dataset 3 months ago

soates/australian-insurance-dspy-corpus

Viewer • Updated Sep 17 • 359 • 20

upvoted 2 papers 3 months ago

Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12 • 26

The Majority is not always right: RL training for solution aggregation

Paper • 2509.06870 • Published Sep 8 • 16

updated a dataset 4 months ago

soates/tictactoe-gemma-dataset

Viewer • Updated Aug 15 • 93.6k • 15

published a dataset 4 months ago

soates/tictactoe-gemma-dataset

Viewer • Updated Aug 15 • 93.6k • 15

liked a model 5 months ago

Menlo/Lucy-128k

Text Generation • 2B • Updated Aug 4 • 294 • 108

liked a model 6 months ago

chandar-lab/NeoBERT

Feature Extraction • 0.2B • Updated Mar 25 • 3.19k • 186

upvoted 2 papers 7 months ago

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30 • 14

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Paper • 2505.11711 • Published May 16 • 11

upvoted an article 7 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

•

244

upvoted an article 8 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25

•

303

upvoted a paper 8 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 139

upvoted an article 8 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

Apr 18

•

Stephen Oates PRO

AI & ML interests

Recent Activity

Organizations

soates's activity

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

We Got Claude to Fine-Tune an Open Source LLM

Australian-made LLM beats OpenAI and Google at legal retrieval

There is no such thing as a tokenizer-free lunch

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Tiny Agents: an MCP-powered agent in 50 lines of code

Gotchas in Tokenizer Behavior Every Developer Should Know