Skip to content

Commit

Permalink
Merge branch 'main' of https://github.com/zhuohaoyu/FreeEval into main
Browse files Browse the repository at this point in the history
  • Loading branch information
zhuohaoyu committed Mar 19, 2024
2 parents 5aad782 + 4ae75a5 commit 06af7cd
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

<div align="center">

<img src="https://github.com/zhuohaoyu/FreeEval-Private/assets/8074086/887aba98-c1b6-4750-aeb6-f154ba54c7a7" width="400px">
<img src="./assets/images/freeeval-logo.png" width="400px">


**FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models**
Expand All @@ -25,7 +25,7 @@
FreeEval is a modular and extensible framework for conducting trustworthy and efficient automatic evaluations of large language models (LLMs). The toolkit unifies various evaluation approaches, including dataset-based evaluators, reference-based metrics, and LLM-based evaluators, within a transparent and reproducible framework. FreeEval incorporates meta-evaluation techniques such as human evaluation and data contamination detection to enhance the reliability of evaluation results. The framework is built on a high-performance infrastructure that enables efficient large-scale evaluations across multi-node, multi-GPU clusters, supporting both open-source and proprietary LLMs. With its focus on modularity, trustworthiness, and efficiency, FreeEval aims to provide researchers with a standardized and comprehensive platform for gaining deeper insights into the capabilities and limitations of LLMs.

<div align="center">
<img width="1173" alt="FreeEval Pipeline" src="https://github.com/zhuohaoyu/FreeEval-Private/assets/8074086/a7b42428-d7cf-4095-bbb4-e8dc7d08b9d7">
<img width="1173" alt="FreeEval Pipeline" src="./assets/images/freeeval-pipeline-github.png">
</div>

## Quick Start
Expand Down

0 comments on commit 06af7cd

Please sign in to comment.