McGill NLP

llm2vec Public

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1.6k 130

webllama Public

Llama-3 agents that can browse the web by following instructions and talking to you

Python 1.4k 109

nano-aha-moment Public

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 535 51

VinePPO Public

Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"

Python 174 21

weblinx Public

WebLINX is a benchmark for building web navigation agents with conversational capabilities

Python 158 17

bias-bench Public

ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.

Python 147 41

Provide feedback