Lei Wang
demolei
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
3 days ago
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling
upvoted
a
paper
15 days ago
Training AI Co-Scientists Using Rubric Rewards