26 3

AI Papers Academy

aipapersacademy

AI & ML interests

None yet

Recent Activity

commented on a paper about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

commented on a paper about 2 months ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

commented on a paper about 2 months ago

mHC: Manifold-Constrained Hyper-Connections

View all activity

Organizations

None yet

commented a paper about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 228 •

commented 2 papers about 2 months ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Paper • 2509.03646 • Published Sep 3, 2025 • 33 •

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 313 •

commented a paper 4 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 509 •

commented a paper 5 months ago

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 298 •

commented a paper 6 months ago

Hierarchical Reasoning Model

Paper • 2506.21734 • Published Jun 26, 2025 • 48 •

commented a paper 8 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •

commented a paper 10 months ago

PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

Paper • 2504.13180 • Published Apr 17, 2025 • 20 •

commented a paper 11 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144 •

commented a paper 12 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113 •

commented 8 papers about 1 year ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75 •

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 441 •

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 288 •

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108 •

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 94 •

commented 2 papers over 1 year ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 47 •

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 77 •

AI Papers Academy

AI & ML interests

Recent Activity

Organizations

aipapersacademy's activity