Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Feature/moe bf16 pr
#1859 opened Oct 4, 2025 by aleozlx Draft
4 of 5 tasks
ci: add release workflow for flashinfer-jit-cache package
#1858 opened Oct 4, 2025 by yzh119 Loading…
3 of 5 tasks
[wip] test: add coverage for all cli commands
#1848 opened Oct 2, 2025 by sricketts Draft
2 of 5 tasks
[DO NOT MERGE][WIP] lint: Add clang-tidy to pre-commits
#1845 opened Oct 2, 2025 by yzh119 Loading…
5 tasks
use ffi::TensorView instead of ffi::Tensor
#1844 opened Oct 2, 2025 by cyx-6 Loading…
5 tasks
Update the routing for TRTLLMGEN to support kimi k2 and qwen
#1831 opened Oct 1, 2025 by ChristinaZ Loading…
3 of 5 tasks
feat: trtrllm-gen global scaled FP8 GEMMs
#1829 opened Oct 1, 2025 by hypdeb Loading…
chore: allow custom paths for external dependencies like CUTLASS
#1827 opened Oct 1, 2025 by yzh119 Loading…
4 of 5 tasks
Fix sm120 fp8 groupwise gemm
#1820 opened Sep 30, 2025 by yongwww Draft
5 tasks done
feat:enable fp8 blockscale moe for fused cultass for sm90
#1819 opened Sep 30, 2025 by djmmoss Loading…
5 tasks done
Support checks PoC
#1809 opened Sep 29, 2025 by nvmbreughe Draft
1 of 5 tasks
Add CUDA arch 12.0 to installation guide
#1785 opened Sep 26, 2025 by mgoin Loading…
5 tasks
add xqa fp8 mha and fp8 kv cache
#1769 opened Sep 25, 2025 by qsang-nv Loading…
5 tasks done
misc: checksum check when downloading artifacts
#1761 opened Sep 23, 2025 by jimmyzho Loading…
3 of 5 tasks
fix the dequantize_block in the trtllm_cutlass fuse moe test
#1721 opened Sep 18, 2025 by rainj-me Loading…
5 tasks done
Tiny optimizations for moe
#1717 opened Sep 18, 2025 by fzyzcjy Loading…
5 tasks
misc: Add cuda graph tests for invariant FA2
#1704 opened Sep 17, 2025 by Edenzzzz Loading…
5 tasks
misc: fix vector size calculation for fp4
#1702 opened Sep 17, 2025 by yzh119 Loading…
5 tasks
refactor: refactor xqa interface
#1701 opened Sep 17, 2025 by yzh119 Loading…
5 tasks
[wip] refactor: remove csrc/nv_internal
#1655 opened Sep 9, 2025 by yzh119 Loading…
5 tasks
feat: trtllm-gen attention with dynamic scale
#1630 opened Sep 3, 2025 by yyihuang Draft
5 tasks
re-enable torch.compile
#1576 opened Aug 26, 2025 by guilhermeleobas Loading…
4 of 5 tasks
ProTip! Add no:assignee to see everything that’s not assigned.