- [2026-06-10] DefTruth, Butterfingrz (2026). FFPA: Efficient Flash Prefill Attention for Large Head Dimensions via Split-D. Zenodo, 2026.
🎉🎉🎉
xlite-dev
Pinned Loading
Repositories
- Awesome-DiT-Inference Public
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
xlite-dev/Awesome-DiT-Inference’s past year of commit activity - cache-dit Public Forked from vipshop/cache-dit
A PyTorch-native inference engine with cache, parallelism, quantization for Diffusion Transformers.
xlite-dev/cache-dit’s past year of commit activity - cllms-for-copilot Public Forked from appledragon/cllms-for-copilot
Pick Qwen, GLM, MiniMax, Xiaomi MiMo, Moonshot Kimi & Tencent Hunyuan models from the Copilot Chat model picker. Vision, thinking, BYOK.
xlite-dev/cllms-for-copilot’s past year of commit activity - ffpa-attn Public
🤖FFPA: Extends FlashAttention-2 via Split-D for large headdims, 1.5x~3×↑🎉 vs SDPA, up to 430T🎉 on H200.
xlite-dev/ffpa-attn’s past year of commit activity - .github Public
xlite-dev/.github’s past year of commit activity - sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
xlite-dev/sglang’s past year of commit activity - deepseek-v4-for-copilot Public Forked from Vizards/deepseek-v4-for-copilot
Pick DeepSeek V4 from the Copilot Chat model picker — and keep everything else Copilot already gives you.
xlite-dev/deepseek-v4-for-copilot’s past year of commit activity - LeetCUDA Public
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
xlite-dev/LeetCUDA’s past year of commit activity - diffusers Public Forked from huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
xlite-dev/diffusers’s past year of commit activity - svdquant-kernels Public Forked from ultism/svdquant-kernels
Cross-architecture CUDA kernels for SVDQuant (W4A4 with low-rank correction)
xlite-dev/svdquant-kernels’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…
