dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

upvoted an article about 5 hours ago

Transformers v5: Simple model definitions powering the AI ecosystem

liked a dataset about 8 hours ago

neulab/agent-data-collection

upvoted a paper 1 day ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

View all activity

Organizations

None yet

upvoted an article about 5 hours ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

301

liked a dataset about 8 hours ago

neulab/agent-data-collection

Preview • Updated 2 days ago • 2.12k • 107

upvoted a paper 1 day ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published 9 days ago • 26

liked a model 1 day ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated about 22 hours ago • 154k • • 713

upvoted a paper 2 days ago

Small Language Models are the Future of Agentic AI

Paper • 2506.02153 • Published Jun 2, 2025 • 24

upvoted a paper 5 days ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published 8 days ago • 63

upvoted 2 papers 6 days ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published 10 days ago • 56

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published 12 days ago • 221

liked a model 7 days ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 3 days ago • 218k • • 884

upvoted an article 9 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

10 days ago

•

126

liked a model 9 days ago

MiniMaxAI/MiniMax-M2.5

Text Generation • Updated 7 days ago • 191k • • 855

liked a model 11 days ago

zai-org/GLM-5

Text Generation • 754B • Updated 9 days ago • 178k • • 1.43k

upvoted 3 papers 12 days ago

upvoted a paper 14 days ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published 16 days ago • 71

upvoted 3 papers 15 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 18 days ago • 93

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published 17 days ago • 28

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published 17 days ago • 27

liked a dataset 15 days ago

internlm/Lean-Github

Viewer • Updated Jul 25, 2024 • 219k • 126 • 37

dfuhoiysOHSVFh82934gfjklb

AI & ML interests

Recent Activity

Organizations

huba-buba's activity

Transformers v5: Simple model definitions powering the AI ecosystem

Forge: Scalable Agent RL Framework and Algorithm