AGIHouse
Popular repositories Loading
-
-
evals
evals PublicForked from openai/evals
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
Python
-
haltt4llm
haltt4llm PublicForked from manyoso/haltt4llm
This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious current problem in widespread adoption of LLM's for many real…
Python
-
Repositories
- openscience Public
AGIHouse/openscience’s past year of commit activity - evals Public Forked from openai/evals
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
AGIHouse/evals’s past year of commit activity - haltt4llm Public Forked from manyoso/haltt4llm
This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious current problem in widespread adoption of LLM's for many real purposes.
AGIHouse/haltt4llm’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…