Sergio Paniego's picture

Building on HF

Sergio Paniego PRO

sergiopaniego

huggingface

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

posted an update about 7 hours ago

Great experience yesterday at PyTorch Conf Europe in Paris 🇫🇷 We (w/ @kashif) talked about training LLMs through interaction, using trajectories across games, browsers, or simulators Room was packed, a clear sign of interest in where RL post-training is heading. sharing the slides! 🤓 https://drive.google.com/file/d/16k7YRnf5EJEo0XjXGlRJ_hVeLoFWKyNP/view?usp=sharing

updated a Space about 10 hours ago

sergiopaniego/browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-09_14-43-35

published a Space about 10 hours ago

sergiopaniego/browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-09_14-43-35

View all activity

Organizations

Posts 84

Post

51

Great experience yesterday at PyTorch Conf Europe in Paris 🇫🇷

We (w/ @kashif ) talked about training LLMs through interaction, using trajectories across games, browsers, or simulators

Room was packed, a clear sign of interest in where RL post-training is heading.

sharing the slides! 🤓
https://drive.google.com/file/d/16k7YRnf5EJEo0XjXGlRJ_hVeLoFWKyNP/view?usp=sharing

Articles 16

Article

804

Welcome Gemma 4: Frontier multimodal intelligence on device

View all Articles

Collections 9

View 9 collections

spaces 122

VLM Object Understanding

Explore object detection, visual grounding, keypoint Detecti

Qwen2-VL-7B

Ask questions about charts in images

SmolVLM-trl-dpo-rlaif-v

Generate text from an image and question

SmolVLM-trl-sft-ChartQA

Ask questions about charts in images

BrowserGym Environment Server

Interact with a browser simulation environment

Browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-09 14-43-35

Show I/O tracking dashboard

View 122 Spaces

models 119

sergiopaniego/carla-vlm-qwen35

Updated 8 days ago

sergiopaniego/nemotron-3-sft

Updated about 1 month ago

sergiopaniego/Qwen3-0.6B-carla-trolley-escape

0.8B • Updated Feb 26 • 55

sergiopaniego/tiny-aya-global-SFT

sergiopaniego/nemo3-sft-bnb

sergiopaniego/rloo_tldr_test

sergiopaniego/wordle-grpo-Qwen3-1.7B-test

sergiopaniego/wordle-grpo-Qwen3-1.7B

Text Generation • 2B • Updated Feb 2 • 66

sergiopaniego/browsergym-grpo-functiongemma-270m-it-test

sergiopaniego/sudoku-grpo-qwen3

Text Generation • 2B • Updated Jan 2 • 5

View 119 models

datasets 6

sergiopaniego/browsergym-grpo-functiongemma-270m-it-dataset

Viewer • Updated 11 days ago • 105 • 7.22k • 1

sergiopaniego/sample_videos

Viewer • Updated Jun 30, 2025 • 2 • 25

sergiopaniego/difficult_prompts

Viewer • Updated Jun 20, 2025 • 38 • 7

sergiopaniego/ourworldindata_example

Viewer • Updated Dec 2, 2024 • 13 • 261 • 1

sergiopaniego/faiss_embeddings

Updated Oct 3, 2024 • 16

sergiopaniego/CarlaFollowLanePreviousV

Viewer • Updated Sep 6, 2023 • 59.6k • 29