qwen3

Here are 54 public repositories matching this topic...

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Updated Sep 26, 2025
Python

1Panel-dev / MaxKB

Star

🔥 MaxKB is an open-source platform for building enterprise-grade agents. MaxKB 是强大易用的开源企业级智能体平台。

agent chatbot knowledgebase rag llm langchain pgvector ollama maxkb llama3 agentic-ai mcp-server deepseek-r1 qwen3

Updated Sep 26, 2025
Python

sgl-project / sglang

Star

SGLang is a fast serving framework for large language models and vision language models.

Updated Sep 26, 2025
Python

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Updated Sep 25, 2025
Python

OpenPipe / ART

Star

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

agent reinforcement-learning rl lora llms qwen agentic-ai grpo qwen3

Updated Sep 26, 2025
Python

zilliztech / deep-searcher

Star

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent openai grok claude rag milvus vector-database llm zilliz deepseek agentic-rag grok3 reasoning-models deepseek-r1 deep-research qwen3 llama4

Updated Jul 10, 2025
Python

xlite-dev / Awesome-LLM-Inference

Star

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

mla vllm llm-inference awesome-llm flash-attention tensorrt-llm paged-attention deepseek flash-attention-3 deepseek-v3 minimax-01 deepseek-r1 flash-mla qwen3

Updated Aug 19, 2025
Python

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

Star

Fully Open Framework for Democratized Multimodal Training

llm mllm vision-language-model llava qwen3

Updated Sep 26, 2025
Python

hud-evals / hud-python

Star

OSS RL environment + evals toolkit

reinforcement-learning rl lora reinforcement-learning-environments llm llms qwen grpo qwen3

Updated Sep 26, 2025
Python

Zeyi-Lin / Qwen3-Medical-SFT

Star

Qwen3 Fine-tuning: Medical R1 Style Chat

r1 fine-tuning sft qwen3

Updated May 31, 2025
Python

NetEase-Media / grps_trtllm

Star

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.

Updated May 14, 2025
Python

NVIDIA-NeMo / Automodel

Star

DTensor-native pretraining and fine-tuning for LLMs/VLMs with day-0 Hugging Face support, GPU-acceleration, and memory efficiency.

python machine-learning ai pytorch openai llama mistral vlm finetuning huggingface llm llm-training finetuning-llms qwen llama3 gemma3 qwen3 gemma3n

Updated Sep 26, 2025
Python

aws-samples / easy-model-deployer

Star

Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.

Updated Aug 26, 2025
Python

AaronFeng753 / Better-Qwen3

Star

Auto Thinking Mode switch for Qwen3 in Open webui

qwen open-webui qwen3

Updated May 8, 2025
Python

bold84 / cot_proxy

Star

Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models with apps that lack parameter customization.

llm qwen3