arxiv:2407.10490
YI REN
Joshua-Ren
AI & ML interests
LLM, Cognitive science
Recent Activity
upvoted
a
paper
14 days ago
On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral
upvoted
a
paper
2 months ago
SimKO: Simple Pass@K Policy Optimization
Organizations
None yet