arxiv:2410.16090
Xiaotang Du
acDante
AI & ML interests
None yet
Recent Activity
liked
a dataset
1 day ago
edinburgh-dawg/mmlu-redux-2.0
authored
a paper
about 2 months ago
The Hallucinations Leaderboard -- An Open Effort to Measure
Hallucinations in Large Language Models
authored
a paper
about 2 months ago
Are We Done with MMLU?
Organizations
models
None public yet
datasets
None public yet