RLFR - a JingHaoZ Collection

JingHaoZ 's Collections

RLFR

RLFR

updated Oct 14, 2025

Extending Reinforcement Learning for LLMs with Flow Environment

JingHaoZ/RLFR-Qwen2.5-Math-7B

Text Generation • 8B • Updated Oct 14, 2025 • 6
JingHaoZ/RLFR-Qwen2.5-VL-7B-Instruct

Image-to-Text • 8B • Updated Oct 14, 2025 • 9 • 1
JingHaoZ/RLFR-Dataset-LM

Viewer • Updated Nov 14, 2025 • 102k • 85
JingHaoZ/RLFR-Dataset-VLM

Preview • Updated Oct 14, 2025 • 24
RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Paper • 2510.10201 • Published Oct 11, 2025 • 35