demo space
2
#4 opened over 1 year ago
by
matthoffner
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6424f28ef1d18f46decd414c/ytq223nXqxJB1gI2CoF81.png)
Looks like the starchat-alpha-ggml-q4_1.bin is broken
8
#3 opened over 1 year ago
by
xhyi
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1630460859893-612b2b983394ed91a3c6ea2a.jpeg)
Which inference repo is this quantized for?
3
#2 opened over 1 year ago
by
xhyi
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1630460859893-612b2b983394ed91a3c6ea2a.jpeg)
Can the quantized model be loaded in gpu to have faster inference ?
6
#1 opened over 1 year ago
by
MohamedRashad
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1628885133347-6116d0584ef9fdfbf45dc4d9.jpeg)