Skip to content

narimannr2x/confidence_scoring

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Self-reported confidence scoring of LLMs to gastroenterology board exam self-assessment questions

This repo contains the code for extraction and analysis of confidence scores of LLMs (self-reported). The code for generating answers and the accuracy of answers is available at VLM-LLM-in-Gastroenterology

the pre-print of the paper is availabale at arxiv link

Team:

Nariman Naderi, Seyed Amir Ahmad Safavi-Naini, Thomas Savage, Ali Soroush

If you use this code or data in your research, please cite our paper:

@misc{naderi2025selfreportedconfidencelargelanguage,
      title={Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models}, 
      author={Nariman Naderi and Seyed Amir Ahmad Safavi-Naini and Thomas Savage and Zahra Atf and Peter Lewis and Girish Nadkarni and Ali Soroush},
      year={2025},
      eprint={2503.18562},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2503.18562}, 
}

About

the code and data for Self-Reported Confidence paper

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors