Skip to content
Change the repository type filter

All

    Repositories list

    • .github

      Public
      0000Updated Jan 31, 2025Jan 31, 2025
    • nndeploy

      Public
      nndeploy is an end-to-end model deployment framework. Based on multi-terminal inference and directed acyclic graph model deployment, it is committed to providing users with a cross-platform, easy-to-use, and high-performance model deployment experience.
      C++
      Apache License 2.0
      10168270Updated Jan 29, 2025Jan 29, 2025
    • Header-only safetensors loader and saver in C++
      C++
      MIT License
      11000Updated Nov 19, 2024Nov 19, 2024
    • onnx-llm

      Public
      llm deploy project based onnx.
      C++
      Apache License 2.0
      7000Updated Oct 9, 2024Oct 9, 2024
    • Universal cross-platform tokenizers binding to HF and sentencepiece
      C++
      Apache License 2.0
      69100Updated Jun 3, 2024Jun 3, 2024
    • 💻A small Collection for Awesome LLM Inference [Papers|Blogs|Docs] with codes, contains TensorRT-LLM, streaming-llm, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
      GNU General Public License v3.0
      226200Updated Dec 3, 2023Dec 3, 2023
    • Simplify your onnx model
      Python
      Apache License 2.0
      388100Updated Apr 27, 2022Apr 27, 2022
    pFad - Phonifier reborn

    Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

    Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


    Alternative Proxies:

    Alternative Proxy

    pFad Proxy

    pFad v3 Proxy

    pFad v4 Proxy