Open
Description
quantizer_hqq.py
requires cuda device:
transformers/src/transformers/quantizers/quantizer_hqq.py
Lines 74 to 75 in badc71b
However the original HQQ library also runs on CPU, by falling back to default aten operators: https://github.com/mobiusml/hqq?tab=readme-ov-file#usage-with-models
Metadata
Metadata
Assignees
Labels
No labels