sylinrl

sylinrl

Achievements

TruthfulQA TruthfulQA Public

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Jupyter Notebook 621 71
CalibratedMath CalibratedMath Public

Teaching Models to Express Their Uncertainty in Words

Python 36 5
BIG-bench BIG-bench Public

Forked from google/BIG-bench

Beyond the Imitation Game collaborative benchmark for enormous language models

Python