Skip to content

feature: adding nightly wheel workflow and indexer#14924

Merged
Kangyan-Zhou merged 11 commits intomainfrom
nightly-wheel
Dec 13, 2025
Merged

feature: adding nightly wheel workflow and indexer#14924
Kangyan-Zhou merged 11 commits intomainfrom
nightly-wheel

Conversation

@dougyster
Copy link
Collaborator

@dougyster dougyster commented Dec 11, 2025

Motivation

Adding workflow to build, version, and store nightly whl.

Modifications

Added a release-pypi-nightly file along with an index creation script.

Accuracy Tests

test nightly wheel successfully uploaded to: https://github.com/dougyster/test-whl/releases/tag/nightly-2025-12-13-53fb9ae8e

test nightly image can be downloaded via:

pip install sglang==0.5.6.post2.dev7788+g82c390d22 --extra-index-url https://dougyster.github.io/test-whl/nightly/

Benchmarking and Profiling

N/A.

Checklist

@gemini-code-assist
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@Kangyan-Zhou Kangyan-Zhou merged commit b2e240b into main Dec 13, 2025
44 checks passed
@Kangyan-Zhou Kangyan-Zhou deleted the nightly-wheel branch December 13, 2025 08:26
Liwansi added a commit to iforgetmyname/sglang that referenced this pull request Dec 13, 2025
…n_eagle3_npu

* 'main' of https://github.com/sgl-project/sglang: (25 commits)
  [NPU] perf update with kvcache nz & w4a8 quant (sgl-project#14423)
  [PP Prefill][NIXL] Fix PP mode transfer completion tracking to wait for all ranks (sgl-project#15027)
  Fix GLM-4.6 tool calls don't support streaming output for arguments i… (sgl-project#13989)
  feature: adding nightly wheel workflow and indexer (sgl-project#14924)
  [diffusion] feat: Improve LoRA compatibility by adding unified format detection and diffusers-based normalization (sgl-project#14659)
  [Fix] Disable trtllm moe backend for draft model for a qucik fix (sgl-project#15002)
  [diffusion] fix: use NDRotaryEmbedding in flux_2   (sgl-project#15034)
  Mistral Large 3 NVFP4 support (sgl-project#14485)
  call check_quantized_moe_compatibility after initialize (sgl-project#13876)
  Add sgl_router_attempt_http_responses_total for single attempt information (sgl-project#15037)
  Add error code in prometheus metrics and add X-SMG-Error-Code header (sgl-project#15036)
  Provide more fine grained error reason for reqwest error (sgl-project#15032)
  Tiny change http router response format to unify (sgl-project#15031)
  Tiny unify grpc existing error responses into new format (sgl-project#15030)
  Add `code` field and unify error responses for router (sgl-project#15028)
  Super tiny remove unused log_request (sgl-project#15035)
  Fix decode OOM caused by retraction (sgl-project#14939)
  [CI]Add gb200 runner back (sgl-project#15024)
  Add a special label for b200 CI runner that can run kernel tests (sgl-project#15033)
  Fix regression caused by fa3 block_table (sgl-project#15009)
  ...

# Conflicts:
#	python/sglang/srt/hardware_backend/npu/attention/ascend_backend.py
Prozac614 pushed a commit to Prozac614/sglang that referenced this pull request Dec 17, 2025
YChange01 pushed a commit to YChange01/sglang that referenced this pull request Jan 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants