-
Notifications
You must be signed in to change notification settings - Fork 6.9k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[LoRA] DSA indexer targets + MoE-LoRA cuda-graph and RL adapter-reload fixes
lora
#29874
opened Jul 1, 2026 by
yushengsu-thu
Collaborator
Loading…
docs(GLM-5.1): add --enable-aiter-allreduce-fusion to MXFP4 MI355X command
documentation
Improvements or additions to documentation
#29873
opened Jul 1, 2026 by
jiacao-amd
Loading…
[Package] Move easydict (LGPL-3.0) to an optional remote-models extra
dependencies
Pull requests that update a dependency file
npu
#29869
opened Jul 1, 2026 by
gaurav0107
Loading…
5 tasks done
[Spec][4/N] Decoupled speculative decoding: ignore_decode_budget for the drafter engine
run-ci
#29868
opened Jul 1, 2026 by
zhendonghua
Contributor
Loading…
feat(short-conv): shared ShortConvAttnBackend for ZAYA1 CCA + LFM2 short conv
bypass-fastfail
run-ci
#29867
opened Jul 1, 2026 by
ch-wan
Collaborator
Loading…
Fix capture-mode detection during breakable CUDA graph capture
run-ci
run-ci-extra
#29866
opened Jul 1, 2026 by
cctry
Collaborator
Loading…
[diffusion] Refresh LTX HQ consistency GT
diffusion
SGLang Diffusion
#29863
opened Jul 1, 2026 by
mickqian
Collaborator
Loading…
Fix SWA eviction tombstoning the last leaf
bypass-fastfail
run-ci
run-ci-extra
#29860
opened Jul 1, 2026 by
ispobock
Collaborator
Loading…
[HiCache] Unified Mooncake Registration for Logical Anchors and Draft Pools
#29859
opened Jul 1, 2026 by
stmatengss
Collaborator
Loading…
5 tasks
Build SWA window kv buffers for the EAGLE draft-extend cuda-graph path
#29858
opened Jul 1, 2026 by
reger-men
Loading…
[AMD][DI][CI] 3/N Add Kimi K2.6 FP8 MI355X 1P1D nightly recipes
#29855
opened Jul 1, 2026 by
Lzy17
Contributor
Loading…
docs: complete production metrics reference
documentation
Improvements or additions to documentation
#29854
opened Jul 1, 2026 by
anencore94
Loading…
bugfix for npu Grok2 model --detokenizer without all special ids
#29853
opened Jul 1, 2026 by
McZyWu
Contributor
Loading…
5 tasks
Support MOSS-Transcribe-Diarize model and adapter
#29850
opened Jul 1, 2026 by
CloudRipple
Contributor
Loading…
2 of 5 tasks
[Feature] Add DSA CP shared KV cache
documentation
Improvements or additions to documentation
#29847
opened Jul 1, 2026 by
taoyuanyuan
•
Draft
5 tasks done
[CI] Restore B200 NVFP4 thresholds and sanitize CUDA env
dependencies
Pull requests that update a dependency file
diffusion
SGLang Diffusion
run-ci
run-ci-extra
#29844
opened Jul 1, 2026 by
BBuf
Collaborator
Loading…
[trtllm_mha] Fuse cuda-graph metadata rebuild into one triton kernel
blackwell
SM100/SM120
run-ci
#29843
opened Jul 1, 2026 by
pranjalssh
Contributor
Loading…
pad customized_info for mixed output batches
bypass-fastfail
run-ci
run-ci-extra
#29842
opened Jul 1, 2026 by
yaof20
Contributor
Loading…
feat: integrate hpc-ops attention and MoE backends
#29839
opened Jul 1, 2026 by
kyang669
Loading…
4 of 5 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.