bigstorm
/

dolphin-2.9.2-qwen2-72b-6.0bpw-exl2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

bigstorm commited on Jun 21, 2024

Commit

5ee221f

·

verified ·

1 Parent(s): 8df3c5d

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -22,6 +22,9 @@ datasets:
 # Exl2 Quantization 6.0BPW
 This model fits comfortably within 72gb of VRAM with 32k context. It was created after inference/quant bug was repaired.
 Enjoy! Feel free to reach out for other quants, or BPW levels.
 # Dolphin 2.9.2 Qwen2 72B 🐬

 # Exl2 Quantization 6.0BPW
 This model fits comfortably within 72gb of VRAM with 32k context. It was created after inference/quant bug was repaired.
+- 6 head bits
+- 6.0 bpw target
 Enjoy! Feel free to reach out for other quants, or BPW levels.
 # Dolphin 2.9.2 Qwen2 72B 🐬