Neural Magic
Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM
Pinned Loading
Repositories
Showing 10 of 81 repositories
- compressed-tensors Public
A safetensors extension to efficiently store sparse quantized tensors on disk
neuralmagic/compressed-tensors’s past year of commit activity - DeepEP-test Public Forked from smarterclayton/DeepEP
DeepEP: an efficient expert-parallel communication library
neuralmagic/DeepEP-test’s past year of commit activity - arena-hard-auto Public Forked from lmarena/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
neuralmagic/arena-hard-auto’s past year of commit activity - model-validation-configs Public
neuralmagic/model-validation-configs’s past year of commit activity - pytorch Public Forked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
neuralmagic/pytorch’s past year of commit activity - pydantic-regmix Public
Common mixins, registries, and utilities with native support for Pydantic used across popular repos such as GuideLLM and Speculators
neuralmagic/pydantic-regmix’s past year of commit activity - upstream-transformers Public Forked from huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
neuralmagic/upstream-transformers’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…