-
jaygala24/Qwen3-4B-GRPO-KL-math-reasoning
Text Generation • 4B • Updated • 1.07k -
jaygala24/Qwen3-4B-GRPO-math-reasoning
Text Generation • 4B • Updated • 890 -
jaygala24/Qwen3-4B-ReMax-math-reasoning
Text Generation • 4B • Updated • 853 -
jaygala24/Qwen3-4B-RLOO-math-reasoning
Text Generation • 4B • Updated • 286
Jay Gala
jaygala24
AI & ML interests
Machine Learning, Natural Language Processing, Language and Vision Intersection, Fairness and Biases
Recent Activity
updated a dataset about 3 hours ago
jaygala24/reasoning-geometry published a dataset about 23 hours ago
jaygala24/reasoning-geometry updated a collection 1 day ago
RL post-training