Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

refactor: update fa3 codebase [part 2]
#2192 opened Dec 9, 2025 by yzh119 Loading…
4 of 5 tasks
Fix for moe on sm110
#2190 opened Dec 9, 2025 by jhalabi-nv Draft
5 tasks
Add CUDA graph buffers for persistent attention
#2185 opened Dec 7, 2025 by Edenzzzz Loading…
5 tasks
Fix/moe_sm110 (to be tested)
#2183 opened Dec 6, 2025 by aleozlx Draft
5 tasks
Enable Hopper FA3 FP8 attention
#2148 opened Nov 28, 2025 by nvpohanh Draft
5 tasks
perf: using multi-cta optimization for top-k/top-p
#2119 opened Nov 20, 2025 by yzh119 Loading…
4 of 5 tasks
Refactor trtllm_mnnvl_allreduce
#2118 opened Nov 20, 2025 by timlee0212 Loading…
5 tasks done
feat: support more head dim in RoPE kernel
#2109 opened Nov 19, 2025 by raayandhar Loading…
5 tasks done
Port TRT-LLM communication kernels to flashinfer
#2102 opened Nov 18, 2025 by djns99 Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2101 opened Nov 17, 2025 by xuanzic Loading…
5 tasks
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
Blockwise GEMM with all reduce overlapping
#2007 opened Oct 30, 2025 by Amir-19 Draft
5 tasks
chore: agentic workflow for automatic version bump
#1947 opened Oct 19, 2025 by yzh119 Loading…
5 tasks
add blockwise gemm cute dsl
#1922 opened Oct 13, 2025 by Amir-19 Loading…
5 tasks
Sampling non contiguous
#1916 opened Oct 12, 2025 by zcin Loading…
5 tasks done
ProTip! Exclude everything labeled bug with -label:bug.