3 releases
0.0.3 | Jan 29, 2025 |
---|---|
0.0.2 | Oct 4, 2024 |
0.0.1 | Oct 4, 2024 |
#639 in Machine learning
184 downloads per month
Used in 2 crates
70KB
1.5K
SLoC
llm_devices: Device management and build system for LLM inference
The llm_devices crate is a workspace member of the llm_client project. It is used as a dependency by the llm_interface crate for building llama.cpp.
Features
- Automated building of llama.cpp with appropriate platform-specific optimizations
- Device detection and configuration for CPU, RAM, CUDA (Linux/Windows), and Metal (macOS)
- Manages memory by detecting available VRAM/RAM, estimating model fit, and distributing layers across devices
- Logging tools
lib.rs
:
llm_devices: Device management and build system for LLM inference
The llm_devices crate is a workspace member of the llm_client project. It is used as a dependency by the llm_interface crate for building llama.cpp.
Features
- Automated building of llama.cpp with appropriate platform-specific optimizations
- Device detection and configuration for CPU, RAM, CUDA (Linux/Windows), and Metal (macOS)
- Manages memory by detecting available VRAM/RAM, estimating model fit, and distributing layers across devices
- Logging tools
Dependencies
~6–32MB
~454K SLoC