-
Notifications
You must be signed in to change notification settings - Fork 6.5k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[session] Fix reaper closing sessions mid-decode; forward session_params in /v1/completions
#28140
opened Jun 13, 2026 by
discobot
Loading…
3 of 5 tasks
[CI] Kernel benchmark regression gate (draft / RFC)
documentation
Improvements or additions to documentation
sgl-kernel
[Kernel] Add SM120/SM121 dispatch for int8_scaled_mm
sgl-kernel
#28137
opened Jun 13, 2026 by
waynehacking8
Loading…
3 tasks done
[Core] Reduce RoPE cache size for shorter context length
#28136
opened Jun 13, 2026 by
labAxiaoming
Loading…
5 tasks
fix(openai): reject request-supplied chat_template by default
#28135
opened Jun 13, 2026 by
Sunt-ing
Contributor
Loading…
5 tasks done
[Spec] Clear dead DRAFT_EXTEND objects left after EAGLE v1 removal
bypass-fastfail
run-ci
run-ci-extra
#28133
opened Jun 13, 2026 by
ch-wan
Collaborator
Loading…
fix(quant): support block FP8 strategy for CompressedTensorsW8A16Fp8
quant
LLM Quantization
#28132
opened Jun 13, 2026 by
Sunt-ing
Contributor
Loading…
4 tasks done
fix(multimodal): return 400 for corrupt image inputs
#28131
opened Jun 13, 2026 by
Sunt-ing
Contributor
Loading…
3 tasks done
fix(quant): dispatch compressed-tensors ParallelLMHead
quant
LLM Quantization
#28130
opened Jun 13, 2026 by
Sunt-ing
Contributor
Loading…
4 tasks done
[Spec] Remove deprecated EAGLE v1 DRAFT_EXTEND forward mode
blackwell
SM100/SM120
bypass-fastfail
deepseek
npu
run-ci
run-ci-extra
speculative-decoding
#28129
opened Jun 13, 2026 by
ch-wan
Collaborator
Loading…
feat(cookbook): generic config-declared Improvements or additions to documentation
flagSelects playground axis
documentation
#28128
opened Jun 13, 2026 by
zijiexia
Collaborator
Loading…
[diffusion] Improve server warmup coverage
diffusion
SGLang Diffusion
run-ci
#28127
opened Jun 13, 2026 by
mickqian
Collaborator
Loading…
feat(moe): add SM120/SM121 dispatch for fp8_blockwise_scaled_grouped_mm
sgl-kernel
#28125
opened Jun 13, 2026 by
waynehacking8
Loading…
3 tasks done
[AMD] GQA-grouped split-K verify kernel for EAGLE3/MTP extend attention
#28124
opened Jun 13, 2026 by
JohnQinAMD
Loading…
5 tasks
[diffusion] test: tighten perf baselines
diffusion
SGLang Diffusion
run-ci
#28123
opened Jun 13, 2026 by
mickqian
Collaborator
Loading…
[MLX] Add Metal profiling hooks to bench_offline_throughput and server profiler
#28122
opened Jun 13, 2026 by
LijuanTang94
Contributor
Loading…
[HiSparse] fix: populate req_pool_indices_cpu in staging-to-decode batch
run-ci
#28121
opened Jun 13, 2026 by
alphabetc1
Collaborator
Loading…
【bugfix】The NPU's forward_dsa_prepare_npu also needs special handling for is_nextn
deepseek
npu
#28118
opened Jun 13, 2026 by
littleyellowbicycle
Contributor
Loading…
4 tasks
fix: preserve GLM string ids with underscores
#28114
opened Jun 13, 2026 by
he-yufeng
Contributor
Loading…
[Platform] Route pin memory availability through current_platform
documentation
Improvements or additions to documentation
#28113
opened Jun 13, 2026 by
N3u0ns
Loading…
5 tasks done
[LoRA] Support DSA indexer LoRA targets for GLM-5.1 / DeepSeek-V3.2-family models
lora
run-ci
#28110
opened Jun 13, 2026 by
jybsuper
Collaborator
Loading…
Relax apache-tvm-ffi dependency constraints
dependencies
Pull requests that update a dependency file
#28109
opened Jun 13, 2026 by
sladyn98
Loading…
5 tasks done
[attn backend] Make seq_lens_cpu optional in trtllm_mha backend
blackwell
SM100/SM120
#28106
opened Jun 12, 2026 by
JonnyKong
Contributor
Loading…
4 of 5 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.