-
GenEx: Generating an Explorable World
Paper • 2412.09624 • Published • 98 -
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
Paper • 2501.02790 • Published • 8 -
Who's Your Judge? On the Detectability of LLM-Generated Judgments
Paper • 2509.25154 • Published • 30 -
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
Paper • 2509.25760 • Published • 55
seba
sdonoso
AI & ML interests
None yet
Recent Activity
updated a collection 12 days ago
papers updated a collection 18 days ago
papers updated a collection about 1 month ago
papers