Quantization

by shadow123 - opened 15 days ago

15 days ago

Can you make a GPTQ Int8 for it? Thx

about 23 hours ago

Can you make a GPTQ Int8 for it? Thx

The Q8_0 version takes 74.98GB, and the page also provide a model size range down to 16.75GB.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment