Is desklib able to support quantization ?

by sexyOG - opened about 24 hours ago

about 24 hours ago

I would like to run desklib locally with a 8GB-memory GPU. Since FP16 is not supported, it is okay to use quantization like using ONNX Runtime? What about bitsandbytes?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment