arxiv:2508.06905
Sinan Wang
wsnHowest
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Are We on the Right Way to Assessing LLM-as-a-Judge?
new activity
10 days ago
ONE-Lab/MultiRef-benchmark:Missing image files in images folder of MultiRef-benchmark dataset
updated
a dataset
10 days ago
ONE-Lab/MultiRef-benchmark