arozanov

Follow

🎯

Focusing

Anton Rozanov arozanov

🎯

Focusing

Follow

Software Developer. Node.js, React.

5 followers · 7 following

Achievements

Achievements

Organizations

Popular repositories Loading

turboquant-mlx turboquant-mlx Public

TurboQuant KV cache compression for MLX with fused Metal kernels. 4.6x compression at 98% FP16 speed.

Python 42 6
mlx-lm mlx-lm Public

Forked from ml-explore/mlx-lm

Run LLMs with MLX

Python 1
mlx mlx Public

Forked from ml-explore/mlx

MLX: An array framework for Apple silicon

C++
ggml-ane ggml-ane Public

Objective-C++
vllm-mlx vllm-mlx Public

Forked from waybarrios/vllm-mlx

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX …

Python