Skip to content
Change the repository type filter

All

    Repositories list

    • HPC cluster code and configurations for running on OCI
      Python
      Universal Permissive License v1.0
      14700Updated Mar 7, 2025Mar 7, 2025
    • mask

      Public
      Code for evaluating AI systems on the MASK honesty benchmark.
      Python
      MIT License
      0100Updated Mar 6, 2025Mar 6, 2025
    • Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"
      Jupyter Notebook
      MIT License
      44300Updated Feb 27, 2025Feb 27, 2025
    • ccc-docs

      Public
      CAIS Compute Cluster (CCC) documentation
      MIT License
      0130Updated Feb 27, 2025Feb 27, 2025
    • hle

      Public
      Humanity's Last Exam
      Python
      MIT License
      2654100Updated Feb 26, 2025Feb 26, 2025
    • AISES

      Public
      CSS
      2001Updated Feb 13, 2025Feb 13, 2025
    • CSS
      MIT License
      2040Updated Jan 27, 2025Jan 27, 2025
    • Measuring correlations between safety benchmarks and general AI capabilities benchmarks.
      Python
      MIT License
      1700Updated Oct 2, 2024Oct 2, 2024
    • HTML
      MIT License
      0300Updated Sep 20, 2024Sep 20, 2024
    • Forecasting.
      TypeScript
      113210Updated Sep 11, 2024Sep 11, 2024
    • HarmBench

      Public
      HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
      Jupyter Notebook
      MIT License
      79566245Updated Aug 16, 2024Aug 16, 2024
    • This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
      Python
      MIT License
      288500Updated May 19, 2024May 19, 2024
    • wmdp

      Public
      WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
      Jupyter Notebook
      MIT License
      3010381Updated Apr 27, 2024Apr 27, 2024
    • HTML
      MIT License
      0000Updated Mar 28, 2024Mar 28, 2024
    • JavaScript
      MIT License
      0100Updated Mar 6, 2024Mar 6, 2024
    • Prometheus exporter for performance metrics from Slurm.
      Go
      GNU General Public License v3.0
      156251Updated Nov 1, 2023Nov 1, 2023
    • Jupyter Notebook
      0300Updated Oct 30, 2023Oct 30, 2023
    • reading

      Public
      1100Updated Oct 26, 2023Oct 26, 2023
    • Cost-effectiveness models, tools, and results for various AI safety field-building programs.
      Python
      MIT License
      4602Updated Aug 15, 2023Aug 15, 2023
    • Website for the Trojan Detection Challenge NeurIPS 2022 competition
      JavaScript
      MIT License
      0000Updated Jul 28, 2023Jul 28, 2023
    • GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.
      Go
      7000Updated Jun 21, 2023Jun 21, 2023
    • 206700Updated May 31, 2023May 31, 2023