Skip to content
@neuralmagic

Neural Magic

Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM

Pinned Loading

  1. nm-vllm-certs nm-vllm-certs Public

    General Information, model certifications, and benchmarks for nm-vllm enterprise distributions

    11 2

  2. deepsparse deepsparse Public

    Sparsity-aware deep learning inference runtime for CPUs

    Python 3.1k 186

  3. sparseml sparseml Public

    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

    Python 2.1k 155

  4. docs docs Public

    Top-level directory for documentation and general content

    MDX 121 7

  5. sparsezoo sparsezoo Public

    Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

    Python 384 28

  6. guidellm guidellm Public

    Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

    Python 304 39

Repositories

Showing 10 of 72 repositories
  • guidellm Public

    Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

    neuralmagic/guidellm’s past year of commit activity
    Python 304 Apache-2.0 39 35 (2 issues need help) 16 Updated May 22, 2025
  • compressed-tensors Public

    A safetensors extension to efficiently store sparse quantized tensors on disk

    neuralmagic/compressed-tensors’s past year of commit activity
    Python 113 Apache-2.0 11 5 15 Updated May 22, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    neuralmagic/vllm’s past year of commit activity
    Python 12 Apache-2.0 7,641 0 9 Updated May 22, 2025
  • speculators Public
    neuralmagic/speculators’s past year of commit activity
    Python 2 Apache-2.0 0 19 4 Updated May 21, 2025
  • neuralmagic/model-validation-configs’s past year of commit activity
    0 0 0 8 Updated May 21, 2025
  • research Public

    Repository to enable research flows

    neuralmagic/research’s past year of commit activity
    Python 0 0 0 1 Updated May 21, 2025
  • yolov5 Public Forked from ultralytics/yolov5

    YOLOv5 in PyTorch > ONNX > CoreML > TFLite

    neuralmagic/yolov5’s past year of commit activity
    Python 19 GPL-3.0 17,127 0 3 Updated May 21, 2025
  • sparseml Public

    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

    neuralmagic/sparseml’s past year of commit activity
    Python 2,136 Apache-2.0 155 1 3 Updated May 20, 2025
  • lmms-eval Public Forked from EvolvingLMMs-Lab/lmms-eval

    Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

    neuralmagic/lmms-eval’s past year of commit activity
    Python 0 281 0 7 Updated May 19, 2025
  • axolotl Public Forked from axolotl-ai-cloud/axolotl

    Go ahead and axolotl questions

    neuralmagic/axolotl’s past year of commit activity
    Python 0 Apache-2.0 1,027 0 3 Updated May 18, 2025

Top languages

Loading…

Most used topics

Loading…

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy