Skip to content
#

iclr-2026

Here are 14 public repositories matching this topic...

Near-optimal vector quantization from Google's ICLR 2026 paper — 95% recall, 5x compression, zero preprocessing, pure Python FAISS replacement

  • Updated Mar 28, 2026
  • Python

Near-optimal vector quantization for LLM KV cache compression. Python implementation of TurboQuant (ICLR 2026) — PolarQuant + QJL for 3-bit quantization with minimal accuracy loss and up to 8x memory reduction.

  • Updated Mar 28, 2026
  • Python

AI agent skill implementing Google's TurboQuant compression algorithm (ICLR 2026) — 6x KV cache memory reduction, 8x speedup, zero accuracy loss. Compatible with Claude Code, Codex CLI, and all Agent Skills-compatible tools.

  • Updated Mar 28, 2026
  • Python

Improve this page

Add a description, image, and links to the iclr-2026 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the iclr-2026 topic, visit your repo's landing page and select "manage topics."

Learn more