Popular repositories Loading
-
TruthfulQA
TruthfulQA PublicTruthfulQA: Measuring How Models Imitate Human Falsehoods
-
CalibratedMath
CalibratedMath PublicTeaching Models to Express Their Uncertainty in Words
-
BIG-bench
BIG-bench PublicForked from google/BIG-bench
Beyond the Imitation Game collaborative benchmark for enormous language models
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.