joaogante (Joao Gante)

liked a model 3 months ago

deepseek-ai/DeepSeek-V3.2-Speciale

Text Generation • Updated Dec 1, 2025 • 173k • 679

liked a Space 4 months ago

The Smol Training Playbook

📚

3.01k

The secrets to building world-class LLMs

liked a Space 5 months ago

Maintain the unmaintainable

📚

77

Explore the complex relationships between 400+ machine learning models

liked 2 models 7 months ago

openai/gpt-oss-20b

Text Generation • Updated Aug 26, 2025 • 5.87M • • 4.4k

transformers-community/sep_cache

8B • Updated Aug 4, 2025 • 5 • 9

liked a model 8 months ago

mistralai/Voxtral-Mini-3B-2507

Updated Jul 28, 2025 • 412k • 625

liked a model 10 months ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26, 2025 • 10.2M • 1.11k

liked a model about 1 year ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Feb 24, 2025 • 720k • • 792

liked a model over 1 year ago

Qwen/Qwen2.5-0.5B-Instruct

Text Generation • 0.5B • Updated Sep 25, 2024 • 6.59M • 466

liked a Space over 1 year ago

SynthID Text

🏃

68

Watermarking LLM-generated text with SynthID Text

liked a model over 1 year ago

meta-llama/Llama-3.2-1B

Text Generation • 1B • Updated Oct 24, 2024 • 1.24M • 2.3k

liked a Space over 1 year ago

Repository statistics

📊

15

liked 2 models over 1 year ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 6.27M • • 5.5k

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 269 • 1.71k

liked a Space over 1 year ago

FLUX.1 [dev]

🖥

9.4k

Generate images from your text prompt

liked a model over 1 year ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27, 2024 • 395k • • 1.29k

liked 4 Spaces over 1 year ago

Hf Co Docs Chat

🚀

8

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

125

Explore and compare advanced language models on a new leaderboard

Omni-Zero

🧛

462

Restylize & repose person ID

FineWeb: decanting the web for the finest text data at scale

🍷

1.3k

Generate a curated web‑text dataset for LLM training

Joao Gante

AI & ML interests

Organizations

deepseek-ai/DeepSeek-V3.2-Speciale

The Smol Training Playbook

Maintain the unmaintainable

openai/gpt-oss-20b

transformers-community/sep_cache

mistralai/Voxtral-Mini-3B-2507

Qwen/Qwen3-0.6B

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Qwen/Qwen2.5-0.5B-Instruct

SynthID Text

meta-llama/Llama-3.2-1B

Repository statistics

meta-llama/Llama-3.1-8B-Instruct

mattshumer/Reflection-Llama-3.1-70B

FLUX.1 [dev]

google/gemma-2-2b-it

Hf Co Docs Chat

Open-LLM performances are plateauing, let’s make the leaderboard steep again

Omni-Zero

FineWeb: decanting the web for the finest text data at scale

Joao Gante

AI & ML interests

Organizations

joaogante's activity

The Smol Training Playbook

Maintain the unmaintainable

SynthID Text

Repository statistics

FLUX.1 [dev]

Hf Co Docs Chat

Open-LLM performances are plateauing, let’s make the leaderboard steep again

Omni-Zero

FineWeb: decanting the web for the finest text data at scale