dumbequation/Qwen2.5-3B-reasoning-medical-symptoms-GRPO-f16 Text Generation • Updated Feb 19, 2025 • 2
dumbequation/Qwen2.5-7B-GRPO-1M-Context-Medical-Reasoning-f16 Text Generation • 8B • Updated Mar 4, 2025 • 3 • 1