aayush garg's picture

In a Training Loop 🔄

aayush garg

garg-aayush

·

https://aayushgarg.dev/

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

published an article about 2 months ago

FlashAttention: Making Attention I/O-Aware

liked a model about 2 months ago

ggml-org/GLM-OCR-GGUF

View all activity

Organizations

upvoted an article 4 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 292

upvoted an article 5 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 310

upvoted an article 6 months ago

Article

Continuous batching from first principles

+1

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 380

upvoted a collection 6 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 169

upvoted a paper 7 months ago

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published Oct 20, 2025 • 80

upvoted a collection almost 2 years ago

multilingual

76 items • Updated Jan 5 • 8

upvoted an article about 2 years ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

+1

Leyo, HugoLaurencon, VictorSanh

•

Apr 15, 2024

• 191

upvoted a collection about 2 years ago

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 41 items • Updated Mar 2 • 152

upvoted a collection over 2 years ago

Optimizing diffusion models

Provides a list of papers focusing on optimizing T2I diffusion models, targeting fewer timesteps, architecture optimization, and more. • 21 items • Updated Aug 22, 2024 • 21