Tom Goldstein's Lab at University of Maryland, College Park

university

http://www.cs.umd.edu/~tomg/

tomgoldsteincs

Activity Feed

AI & ML interests

AI security & privacy, algorithmic bias, foundations of ML

Recent Activity

kaiyuyue authored a paper 7 days ago

Image Generation with a Sphere Encoder

kaiyuyue updated a collection 19 days ago

Sphere Encoder

kaiyuyue submitted a paper 19 days ago

Image Generation with a Sphere Encoder

View all activity

Papers

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

View all Papers

tomg-group-umd 's collections 15

Sphere Encoder

Image Generation with a Sphere Encoder

Image Generation with a Sphere Encoder

Paper • 2602.15030 • Published 28 days ago • 15
kaiyuyue/sphere-encoder-fid-artifacts

Viewer • Updated 18 days ago • 50k • 37

Retrofitting Recurrence

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19
smcleish/Recurrent-Llama-3.2-train-recurrence-32

Text Generation • 1B • Updated Nov 11, 2025 • 384 • 1
smcleish/Recurrent-Llama-3.2-train-recurrence-16

Text Generation • 1B • Updated Nov 11, 2025 • 2
smcleish/Recurrent-Llama-3.2-train-recurrence-8

Text Generation • 1B • Updated Nov 11, 2025 • 1

Refusal Token Models

This collection contains models described in the refusal token paper published in COLM 2025.

tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast

8B • Updated Jul 22, 2025
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens

8B • Updated Jul 22, 2025 • 1
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-single-token

8B • Updated Jul 22, 2025 • 1 • 1
tomg-group-umd/zephyr-llama3-8b-sft-no-refusal-messages

8B • Updated Jul 22, 2025

LoRI Adapters

LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.

tomg-group-umd/LoRI-S_safety_mistral7b_rank_64

Text Generation • Updated Apr 14, 2025 • 1
tomg-group-umd/LoRI-S_safety_mistral7b_rank_32

Text Generation • Updated Apr 14, 2025 • 2
tomg-group-umd/LoRI-S_safety_llama3_rank_64

Text Generation • Updated Aug 13, 2025 • 2
tomg-group-umd/LoRI-S_safety_llama3_rank_32

Text Generation • Updated Apr 14, 2025 • 1

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.

tomg-group-umd/huginn-0125

Text Generation • Updated Jul 29, 2025 • 3.18k • 294
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 153
tomg-group-umd/huginn_swa_100_10_avg_0.9_merge

Text Generation • 4B • Updated Jul 17, 2025 • 2
tomg-group-umd/step-00010752-recurrence_full_512_0

Text Generation • 4B • Updated Jul 17, 2025

GenQA

tomg-group-umd/GenQA

Viewer • Updated Jun 21, 2024 • 11.1M • 286 • 54
tomg-group-umd/GenQA_raw

Viewer • Updated Jun 13, 2024 • 11.1M • 41
tomg-group-umd/GenQA_rebalanced

Viewer • Updated Jun 13, 2024 • 6.47M • 12 • 3
tomg-group-umd/GenQA-Subset-llama-3

Text Generation • 8B • Updated Jun 17, 2024 • 1

Zero-Shot Grafting

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28, 2025 • 7
tomg-group-umd/zero-model-checkpoints

Image-Text-to-Text • Updated Aug 5, 2025 • 2

Goldfish Loss: Mitigating Memorization in LLMs

This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Paper • 2406.10209 • Published Jun 14, 2024 • 8
tomg-group-umd/3-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 1
tomg-group-umd/4-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024
tomg-group-umd/8-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 1

MTP-LM

Models to accompany "Multi-Token Prediction via Self-Distillation" (arxiv:2602.06019)

Multi-Token Prediction via Self-Distillation

Paper • 2602.06019 • Published Feb 5
jwkirchenbauer/L3-1-8B-Magpie-MTP

8B • Updated Feb 10 • 6
jwkirchenbauer/Qwen3-4B-Inst-2507-MTP

4B • Updated Feb 10 • 68 • 1
jwkirchenbauer/metamathqa-grouped-split

Viewer • Updated Feb 9 • 395k • 118

DynaGuard

https://arxiv.org/abs/2509.02563

tomg-group-umd/DynaGuard-8B

Text Generation • 8B • Updated Sep 3, 2025 • 461 • 15
tomg-group-umd/DynaGuard-4B

Text Generation • 4B • Updated Sep 3, 2025 • 21 • 2
tomg-group-umd/DynaGuard-1.7B

Text Generation • Updated Sep 3, 2025 • 169 • 3
montehoover/DynaBench

Viewer • Updated Nov 22, 2025 • 140k • 405 • 5

FictionalQA

jwkirchenbauer/fictionalqa

Viewer • Updated 15 days ago • 39.2k • 111 • 2
jwkirchenbauer/fictionalqa_training_splits

Viewer • Updated 15 days ago • 219k • 401
jwkirchenbauer/fictionalqa_reformatted_triviaqa

Viewer • Updated 15 days ago • 16.4k • 35

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.

tomg-group-umd/Gemstone-768x45

Text Generation • 0.5B • Updated Feb 9, 2025 • 5
tomg-group-umd/Gemstone-1280x15

Text Generation • 0.5B • Updated Feb 6, 2025 • 6
tomg-group-umd/Gemstone-512x13

Text Generation • 0.1B • Updated Feb 6, 2025 • 6
tomg-group-umd/Gemstone-1536x50

Text Generation • 2B • Updated Feb 7, 2025 • 11

Style Descriptors

How to extract style from images? Model, dataset, and the paper

Measuring Style Similarity in Diffusion Models

Paper • 2404.01292 • Published Apr 1, 2024 • 17
tomg-group-umd/CSD-ViT-L

Image Feature Extraction • Updated Sep 4, 2024 • 16 • 5
tomg-group-umd/ContraStyles

Viewer • Updated Jul 31, 2024 • 498k • 26 • 5

CLRS-Text

Hugging Face collection for all things CLRS-Text

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper • 2406.04229 • Published Jun 6, 2024 • 4
tomg-group-umd/CLRS-Text-train

Viewer • Updated Jul 14, 2024 • 2.15M • 328 • 2
tomg-group-umd/CLRS-Text-test

Viewer • Updated Jul 10, 2024 • 503k • 403

PixelProse

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18
tomg-group-umd/pixelprose

Viewer • Updated Dec 13, 2025 • 15.6M • 473 • 164
pixelprose/pixelprose-shards

Viewer • Updated Dec 14, 2025 • 7.66M • 1.72k • 1
pixelprose/pixelprose-jsons

Preview • Updated Jul 3, 2025 • 49

Sphere Encoder

Image Generation with a Sphere Encoder

Image Generation with a Sphere Encoder

Paper • 2602.15030 • Published 28 days ago • 15
kaiyuyue/sphere-encoder-fid-artifacts

Viewer • Updated 18 days ago • 50k • 37

MTP-LM

Models to accompany "Multi-Token Prediction via Self-Distillation" (arxiv:2602.06019)

Multi-Token Prediction via Self-Distillation

Paper • 2602.06019 • Published Feb 5
jwkirchenbauer/L3-1-8B-Magpie-MTP

8B • Updated Feb 10 • 6
jwkirchenbauer/Qwen3-4B-Inst-2507-MTP

4B • Updated Feb 10 • 68 • 1
jwkirchenbauer/metamathqa-grouped-split

Viewer • Updated Feb 9 • 395k • 118

Retrofitting Recurrence

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19
smcleish/Recurrent-Llama-3.2-train-recurrence-32

Text Generation • 1B • Updated Nov 11, 2025 • 384 • 1
smcleish/Recurrent-Llama-3.2-train-recurrence-16

Text Generation • 1B • Updated Nov 11, 2025 • 2
smcleish/Recurrent-Llama-3.2-train-recurrence-8

Text Generation • 1B • Updated Nov 11, 2025 • 1

DynaGuard

https://arxiv.org/abs/2509.02563

tomg-group-umd/DynaGuard-8B

Text Generation • 8B • Updated Sep 3, 2025 • 461 • 15
tomg-group-umd/DynaGuard-4B

Text Generation • 4B • Updated Sep 3, 2025 • 21 • 2
tomg-group-umd/DynaGuard-1.7B

Text Generation • Updated Sep 3, 2025 • 169 • 3
montehoover/DynaBench

Viewer • Updated Nov 22, 2025 • 140k • 405 • 5

Refusal Token Models

This collection contains models described in the refusal token paper published in COLM 2025.

tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast

8B • Updated Jul 22, 2025
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens

8B • Updated Jul 22, 2025 • 1
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-single-token

8B • Updated Jul 22, 2025 • 1 • 1
tomg-group-umd/zephyr-llama3-8b-sft-no-refusal-messages

8B • Updated Jul 22, 2025

FictionalQA

jwkirchenbauer/fictionalqa

Viewer • Updated 15 days ago • 39.2k • 111 • 2
jwkirchenbauer/fictionalqa_training_splits

Viewer • Updated 15 days ago • 219k • 401
jwkirchenbauer/fictionalqa_reformatted_triviaqa

Viewer • Updated 15 days ago • 16.4k • 35

LoRI Adapters

LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.

tomg-group-umd/LoRI-S_safety_mistral7b_rank_64

Text Generation • Updated Apr 14, 2025 • 1
tomg-group-umd/LoRI-S_safety_mistral7b_rank_32

Text Generation • Updated Apr 14, 2025 • 2
tomg-group-umd/LoRI-S_safety_llama3_rank_64

Text Generation • Updated Aug 13, 2025 • 2
tomg-group-umd/LoRI-S_safety_llama3_rank_32

Text Generation • Updated Apr 14, 2025 • 1

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.

tomg-group-umd/Gemstone-768x45

Text Generation • 0.5B • Updated Feb 9, 2025 • 5
tomg-group-umd/Gemstone-1280x15

Text Generation • 0.5B • Updated Feb 6, 2025 • 6
tomg-group-umd/Gemstone-512x13

Text Generation • 0.1B • Updated Feb 6, 2025 • 6
tomg-group-umd/Gemstone-1536x50

Text Generation • 2B • Updated Feb 7, 2025 • 11

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.

tomg-group-umd/huginn-0125

Text Generation • Updated Jul 29, 2025 • 3.18k • 294
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 153
tomg-group-umd/huginn_swa_100_10_avg_0.9_merge

Text Generation • 4B • Updated Jul 17, 2025 • 2
tomg-group-umd/step-00010752-recurrence_full_512_0

Text Generation • 4B • Updated Jul 17, 2025

Style Descriptors

How to extract style from images? Model, dataset, and the paper

Measuring Style Similarity in Diffusion Models

Paper • 2404.01292 • Published Apr 1, 2024 • 17
tomg-group-umd/CSD-ViT-L

Image Feature Extraction • Updated Sep 4, 2024 • 16 • 5
tomg-group-umd/ContraStyles

Viewer • Updated Jul 31, 2024 • 498k • 26 • 5

GenQA

tomg-group-umd/GenQA

Viewer • Updated Jun 21, 2024 • 11.1M • 286 • 54
tomg-group-umd/GenQA_raw

Viewer • Updated Jun 13, 2024 • 11.1M • 41
tomg-group-umd/GenQA_rebalanced

Viewer • Updated Jun 13, 2024 • 6.47M • 12 • 3
tomg-group-umd/GenQA-Subset-llama-3

Text Generation • 8B • Updated Jun 17, 2024 • 1

CLRS-Text

Hugging Face collection for all things CLRS-Text

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper • 2406.04229 • Published Jun 6, 2024 • 4
tomg-group-umd/CLRS-Text-train

Viewer • Updated Jul 14, 2024 • 2.15M • 328 • 2
tomg-group-umd/CLRS-Text-test

Viewer • Updated Jul 10, 2024 • 503k • 403

Zero-Shot Grafting

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28, 2025 • 7
tomg-group-umd/zero-model-checkpoints

Image-Text-to-Text • Updated Aug 5, 2025 • 2

PixelProse

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18
tomg-group-umd/pixelprose

Viewer • Updated Dec 13, 2025 • 15.6M • 473 • 164
pixelprose/pixelprose-shards

Viewer • Updated Dec 14, 2025 • 7.66M • 1.72k • 1
pixelprose/pixelprose-jsons

Preview • Updated Jul 3, 2025 • 49

Goldfish Loss: Mitigating Memorization in LLMs

This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Paper • 2406.10209 • Published Jun 14, 2024 • 8
tomg-group-umd/3-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 1
tomg-group-umd/4-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024
tomg-group-umd/8-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 1

AI & ML interests

Recent Activity

Papers

Team members 31

tomg-group-umd 's collections 15