deepseek-ai/DeepSeek-R1-Distill-Qwen-32B · Please convert these models to GGUF format...

Moodym

9 days ago

Please convert to GGUF format...

ZiggyS

5 days ago

Others already have. Just do a search.

fblgit

2 days ago

@bartowski has already quantz for this.
it may be helpful for the model card to include those refs.

ghostplant

about 11 hours ago

Is quantz version still that smart against o1-mini?

fblgit

about 7 hours ago

there is degradation obviously by the change in precision. But there are no performance metrics to showcase exactly how much is lost on each quant..
IMHO a non-quant 32B can perform as good as a quant R1 full.. in any case the model in 32B is extraordinary good.

ZiggyS

about 7 hours ago

there is degradation obviously by the change in precision. But there are no performance metrics to showcase exactly how much is lost on each quant..
IMHO a non-quant 32B can perform as good as a quant R1 full.. in any case the model in 32B is extraordinary good.

To me its a lot like a wav->mp3 decision when you dont have a lot of storage on your music device. Sure, you will lose some quality but its smaller. So, its a trade off of necessity, and just have to decide the bit-rate you are willing to go down to.