Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
-
Updated
Jun 7, 2024 - Python
Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4, Internlm2.5, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
🐋MindChat(漫谈)——心理大模型:漫谈人生路, 笑对风霜途
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing"
🧘🏻♂️KarmaVLM (相生):A family of high efficiency and powerful visual language model.
Use your open source local model from the terminal
Visual Instruction Tuning for Qwen2 Base Model
A demo of expanding the vocabulary of the Llama3 model, applicable to other vocabularies that use TikToken as well.
The Web Metadata Extraction Toolkit is designed to streamline the process of extracting, cleaning, and analyzing metadata from websites. Utilizing advanced AI models and custom extraction strategies, this toolkit helps users efficiently gather data like titles, descriptions, and keywords, which are crucial for SEO and content strategy.
Add a description, image, and links to the qwen2 topic page so that developers can more easily learn about it.
To associate your repository with the qwen2 topic, visit your repo's landing page and select "manage topics."