Skip to content
View StellaAthena's full-sized avatar

Organizations

@EleutherAI

Block or report StellaAthena

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
StellaAthena/README.md

Hi there ๐Ÿ‘‹, my name is Stella Biderman

I'm an AI researcher seeking to understand how large language models work better.

  • ๐Ÿ”ญ Iโ€™m currently working on language model interpretability with Pythia
  • ๐Ÿค” Iโ€™m looking for help with statistical models of learning dynamics and designing custom datasets to test theories about language models.
  • ๐Ÿ’ฌ Ask me about training large language models
  • ๐Ÿ˜„ Pronouns: she/her

Catch me on:

Google Scholar Twitter Stack Exchange

Some stats:

GitHub stats

Pinned Loading

  1. EleutherAI/gpt-neox EleutherAI/gpt-neox Public

    An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

    Python 7k 1k

  2. EleutherAI/gpt-neo EleutherAI/gpt-neo Public archive

    An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

    Python 8.2k 955

  3. EleutherAI/the-pile EleutherAI/the-pile Public

    Python 1.5k 132

  4. EleutherAI/pythia EleutherAI/pythia Public

    The hub for EleutherAI's work on interpretability and learning dynamics

    Jupyter Notebook 2.3k 176