Experimental TurboQuant implementation and llama.cpp-style integration path for long-context inference
-
Updated
Mar 29, 2026 - C++
Experimental TurboQuant implementation and llama.cpp-style integration path for long-context inference
Add a description, image, and links to the cude topic page so that developers can more easily learn about it.
To associate your repository with the cude topic, visit your repo's landing page and select "manage topics."