Skip to content

Popular repositories Loading

  1. lorax lorax Public

    Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

    Python 3k 215

  2. llm_distillation_playbook llm_distillation_playbook Public

    Best practices for distilling large language models.

    Jupyter Notebook 548 43

  3. lora_bakeoff lora_bakeoff Public

    Python 19 2

  4. json-mode-benchmark json-mode-benchmark Public

    Jupyter Notebook 6 1

  5. neuropod neuropod Public

    Forked from uber/neuropod

    A uniform interface to run deep learning models from multiple frameworks

    C++ 3 2

  6. punica punica Public

    Forked from punica-ai/punica

    Serving multiple LoRA finetuned LLM as one

    Cuda 2 4

Repositories

Showing 10 of 18 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy