dpo
Here are 51 public repositories matching this topic...
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4, Internlm2.5, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
-
Updated
Jul 7, 2024 - Python
Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon
-
Updated
Jul 5, 2024 - Python
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
-
Updated
Jul 3, 2024 - Python
CodeUltraFeedback: aligning large language models to coding preferences
-
Updated
Jun 25, 2024 - Python
Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
-
Updated
Jun 21, 2024 - Python
This is the DPO Pay plugin for WooCommerce.
-
Updated
May 28, 2024 - PHP
A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of domains and languages.
-
Updated
May 27, 2024 - Python
This is the DPO Group plugin for Gravity Forms.
-
Updated
Apr 29, 2024 - PHP
Improve this page
Add a description, image, and links to the dpo topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dpo topic, visit your repo's landing page and select "manage topics."