Skip to content
View loganriggs's full-sized avatar

Block or report loganriggs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. sae-rm sae-rm Public

    Using SAE's to interpret Reward Models (RM)

    Jupyter Notebook 4 2

  2. sparse_coding sparse_coding Public

    Forked from HoagyC/sparse_coding

    Jupyter Notebook 7 5

  3. Optimal-Policies-Tend-To-Seek-Power Optimal-Policies-Tend-To-Seek-Power Public

    Code for the paper "Optimal Policies Tend To Seek Power"

    Mathematica 1

  4. alignment-research-dataset alignment-research-dataset Public

    Forked from moirage/alignment-research-dataset

    A dataset of alignment research and code to reproduce it

    Python

  5. STFT_wifi_physical_fingerprint STFT_wifi_physical_fingerprint Public

    Python 1

  6. white-box white-box Public

    Forked from AlignmentResearch/tuned-lens

    Tools for understanding how transformer predictions are built layer-by-layer

    Jupyter Notebook