AI & ML interests

AI security & privacy, algorithmic bias, foundations of ML

Recent Activity

kaiyuyue  updated a collection 19 days ago
Sphere Encoder
kaiyuyue  submitted a paper 19 days ago
Image Generation with a Sphere Encoder
View all activity

tomg-group-umd 's collections 15

LoRI Adapters
LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.
Recurrent Models
These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.
Goldfish Loss: Mitigating Memorization in LLMs
This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."
Gemstone Models
Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.
LoRI Adapters
LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.
Gemstone Models
Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.
Recurrent Models
These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.
Goldfish Loss: Mitigating Memorization in LLMs
This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."