Models prequantized with [HIGGS](https://arxiv.org/abs/2411.17525) zero-shot quantization. Requires the latest `transformers` to run.
-
ISTA-DASLab/Llama-3.3-70B-Instruct-HIGGS-GPTQ-4bit
Updated • 23 • 2 -
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-GPTQ-4bit
Text Generation • Updated • 80 -
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-GPTQ-3bit
Text Generation • Updated • 13 -
ISTA-DASLab/Llama-3.1-8B-HIGGS-GPTQ-4bit
Text Generation • Updated • 81