-
openai-agents-python Public
Forked from openai/openai-agents-pythonA lightweight, powerful framework for multi-agent workflows
Python MIT License UpdatedMar 13, 2025 -
scrape-openai-code-interpreter Public
Forked from simonw/scrape-openai-code-interpreterScrape details about Code Interpreter to track any changes
-
native_llama Public
An experiment repo contains native llama3 model and generator with KV caches
Python UpdatedJan 27, 2025 -
Cosmos Public
Forked from NVIDIA/CosmosCosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Python Apache License 2.0 UpdatedJan 9, 2025 -
alpha_zero Public
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
-
leaked-system-prompts Public
Forked from jujumilk3/leaked-system-promptsCollection of leaked system prompts
UpdatedSep 11, 2024 -
Llama3-FunctionCalling Public
Fine-tune Llama3 model to support function calling
-
miniGPT Public archive
Try to implement pre-training and fine-tuning GPT-2 model for research and education purpose.
-
RAG-LLaMA Public archive
A clean and simple implementation of Retrieval Augmented Generation (RAG) to enhanced LLaMA chat model to answer questions from a private knowledge base. We use Tesla user manuals to build the know…
-
DPO-LLaMA Public archive
A clean implementation of direct preference optimization (DPO) to train the LLaMA 2 model to align with human preferences.
-
InstructLLaMA Public
Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…
-
art-of-reinforcement-learning Public
Forked from Apress/art-of-reinforcement-learningOriginal source code The Art of Reinforcement Learning by Michael Hu
Python Other UpdatedFeb 28, 2024 -
MM-LLaMA Public archive
Bring multimodality to the LLaMA model by leveraging ImageBind as the modal encoder. This project supports vision input (both images and short videos) to the LLaMA model, with text output generated…
-
deep_rl_zoo Public archive
A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
-
QLoRA-LLM Public archive
A simple custom QLoRA implementation for fine-tuning a language model (LLM) with basic tools such as PyTorch and Bitsandbytes, completely decoupled from Hugging Face.
-
muzero Public archive
A PyTorch implementation of DeepMind's MuZero agent
-
ReservoirComputing Public
Implementing Reservoir Computing Networks for Predicting Dynamic Systems
Jupyter Notebook MIT License UpdatedSep 27, 2023 -
VisionTransformer Public archive
Implementing vision transformer for image classification
Python MIT License UpdatedSep 27, 2023 -
SAP-UI5-Development-Re-Introduction Public archive
This is the official source code for Udemy course SAP UI5 Development Re-Introduction