vllm-project / compressed-tensors Public

Notifications You must be signed in to change notification settings
Fork 38
Star 210

Code
Issues 5
Pull requests 13
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Pull requests: vllm-project/compressed-tensors

Labels 10 Milestones 0

New pull request New

13 Open 473 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Utils] Add return_unmatched argument to match_modules_set

#522 opened Nov 19, 2025 by kylesayrs

Loading…

[Bugfix] Forward quantize better wrapping

#521 opened Nov 18, 2025 by kylesayrs

Loading…

support wInt4aFp8 for moe

#518 opened Nov 12, 2025 by Wangzheee

Loading…

[WIP] fix qparams decompression bug

Something isn't working

#514 opened Nov 10, 2025 by shanjiaz

Loading…

early and better error for divisibility issues

#510 opened Nov 6, 2025 by HDCharles • Draft

[MXFP4] Add calibration support

#509 opened Nov 4, 2025 by dsikka

Loading…

[Attention] Support FP4 attention quantization

#491 opened Oct 14, 2025 by kylesayrs

Loading…

[Observer Refactor] Use static defaults

#489 opened Oct 13, 2025 by kylesayrs • Draft

[Compression] Remove legacy compression and decompression pathways

#465 opened Sep 11, 2025 by kylesayrs • Draft

[FP4] Update to make compression handling more generic for fp4

#448 opened Sep 8, 2025 by dsikka • Draft

[KV Cache] support kv cache int8 per channel quant

#398 opened Jul 19, 2025 by Eviannn

Loading…

Optimize sparse 2:4 compression performance

#358 opened Jun 16, 2025 by rahul-tuli • Draft

8 tasks done

relax setuptools_scm version requirement

#343 opened Jun 6, 2025 by envolution

Loading…

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!