184 downloads per month
Used in 2 crates

MIT license

70KB
1.5K SLoC

llm_devices: Device management and build system for LLM inference

The llm_devices crate is a workspace member of the llm_client project. It is used as a dependency by the llm_interface crate for building llama.cpp.

Features

Automated building of llama.cpp with appropriate platform-specific optimizations
Device detection and configuration for CPU, RAM, CUDA (Linux/Windows), and Metal (macOS)
Manages memory by detecting available VRAM/RAM, estimating model fit, and distributing layers across devices
Logging tools

The llm_devices crate is a workspace member of the llm_client project. It is used as a dependency by the llm_interface crate for building llama.cpp.

Automated building of llama.cpp with appropriate platform-specific optimizations
Device detection and configuration for CPU, RAM, CUDA (Linux/Windows), and Metal (macOS)
Manages memory by detecting available VRAM/RAM, estimating model fit, and distributing layers across devices
Logging tools

~6–32MB
~454K SLoC