Skip to content

Pull requests: GeeeekExplorer/nano-vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add fp8 (e4m3) weight-only quantization
#241 opened Jun 10, 2026 by HaozheZhang6 Loading…
Change return type from list[str] to list[dict]
#238 opened Jun 4, 2026 by ZhengWentong Loading…
add qwen3-moe
#236 opened May 29, 2026 by shhyQAQ Loading…
set up uv env
#235 opened May 26, 2026 by hchen549 Loading…
Add Qwen3.5 (text + multimodal) support
#232 opened May 13, 2026 by 86MaxCao Loading…
Add SDPA attention fallback and design docs
#231 opened May 12, 2026 by Lebhoryi Loading…
Fix RMSNorm fp32 input mutation
#205 opened Apr 14, 2026 by JxKim Loading…
Fix CUDA graph block_tables shape mismatch
#191 opened Mar 24, 2026 by ilrewrite Loading…
Feature/support llama3
#188 opened Mar 21, 2026 by wudong5 Loading…
docs: add Chinese README and language links
#183 opened Mar 8, 2026 by LJS1124 Loading…
ProTip! Filter pull requests by the default branch with base:main.