shenzhi-wang

mradermacher commited on 21 days ago

Commit

4d8f821

verified ·

0 Parent(s):

Duplicate from mradermacher/Xwen-7B-Chat-i1-GGUF

Browse files

Co-authored-by: team mradermacher <[email protected]>

Files changed (27) hide show

.gitattributes +60 -0
README.md +77 -0
Xwen-7B-Chat.i1-IQ1_M.gguf +3 -0
Xwen-7B-Chat.i1-IQ1_S.gguf +3 -0
Xwen-7B-Chat.i1-IQ2_M.gguf +3 -0
Xwen-7B-Chat.i1-IQ2_S.gguf +3 -0
Xwen-7B-Chat.i1-IQ2_XS.gguf +3 -0
Xwen-7B-Chat.i1-IQ2_XXS.gguf +3 -0
Xwen-7B-Chat.i1-IQ3_M.gguf +3 -0
Xwen-7B-Chat.i1-IQ3_S.gguf +3 -0
Xwen-7B-Chat.i1-IQ3_XS.gguf +3 -0
Xwen-7B-Chat.i1-IQ3_XXS.gguf +3 -0
Xwen-7B-Chat.i1-IQ4_NL.gguf +3 -0
Xwen-7B-Chat.i1-IQ4_XS.gguf +3 -0
Xwen-7B-Chat.i1-Q2_K.gguf +3 -0
Xwen-7B-Chat.i1-Q2_K_S.gguf +3 -0
Xwen-7B-Chat.i1-Q3_K_L.gguf +3 -0
Xwen-7B-Chat.i1-Q3_K_M.gguf +3 -0
Xwen-7B-Chat.i1-Q3_K_S.gguf +3 -0
Xwen-7B-Chat.i1-Q4_0.gguf +3 -0
Xwen-7B-Chat.i1-Q4_1.gguf +3 -0
Xwen-7B-Chat.i1-Q4_K_M.gguf +3 -0
Xwen-7B-Chat.i1-Q4_K_S.gguf +3 -0
Xwen-7B-Chat.i1-Q5_K_M.gguf +3 -0
Xwen-7B-Chat.i1-Q5_K_S.gguf +3 -0
Xwen-7B-Chat.i1-Q6_K.gguf +3 -0
imatrix.dat +3 -0

.gitattributes ADDED Viewed

	@@ -0,0 +1,60 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text
+imatrix.dat filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
+Xwen-7B-Chat.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,77 @@

+---
+base_model: xwen-team/Xwen-7B-Chat
+language:
+- en
+- zh
+library_name: transformers
+license: apache-2.0
+quantized_by: mradermacher
+---
+## About
+<!-- ### quantize_version: 2 -->
+<!-- ### output_tensor_quantised: 1 -->
+<!-- ### convert_type: hf -->
+<!-- ### vocab_type:  -->
+<!-- ### tags: nicoboss -->
+weighted/imatrix quants of https://huggingface.co/xwen-team/Xwen-7B-Chat
+<!-- provided-files -->
+static quants are available at https://huggingface.co/mradermacher/Xwen-7B-Chat-GGUF
+## Usage
+If you are unsure how to use GGUF files, refer to one of [TheBloke's
+READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
+more details, including on how to concatenate multi-part files.
+## Provided Quants
+(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
+| Link | Type | Size/GB | Notes |
+|:-----|:-----|--------:|:------|
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ1_S.gguf) | i1-IQ1_S | 2.0 | for the desperate |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ1_M.gguf) | i1-IQ1_M | 2.1 | mostly desperate |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.4 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.6 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ2_S.gguf) | i1-IQ2_S | 2.7 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ2_M.gguf) | i1-IQ2_M | 2.9 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q2_K_S.gguf) | i1-Q2_K_S | 2.9 | very low quality |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q2_K.gguf) | i1-Q2_K | 3.1 | IQ3_XXS probably better |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.2 | lower quality |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.4 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.6 | IQ3_XS probably better |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ3_S.gguf) | i1-IQ3_S | 3.6 | beats Q3_K* |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ3_M.gguf) | i1-IQ3_M | 3.7 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q3_K_M.gguf) | i1-Q3_K_M | 3.9 | IQ3_S probably better |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.2 | IQ3_M probably better |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.3 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.5 | prefer IQ4_XS |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q4_0.gguf) | i1-Q4_0 | 4.5 | fast, low quality |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.6 | optimal size/speed/quality |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q4_K_M.gguf) | i1-Q4_K_M | 4.8 | fast, recommended |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q4_1.gguf) | i1-Q4_1 | 5.0 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.4 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.5 |  |
+| [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q6_K.gguf) | i1-Q6_K | 6.4 | practically like static Q6_K |
+Here is a handy graph by ikawrakow comparing some lower-quality quant
+types (lower is better):
+![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
+And here are Artefact2's thoughts on the matter:
+https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
+## FAQ / Model Request
+See https://huggingface.co/mradermacher/model_requests for some answers to
+questions you might have and/or if you want some other model quantized.
+## Thanks
+I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
+me use its servers and providing upgrades to my workstation to enable
+this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
+<!-- end -->

Xwen-7B-Chat.i1-IQ1_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:942b3c6b0a7671a0568e57f97afd17497a2004b80e3d4d3529e6804d1a4425ae
+size 2042196928

Xwen-7B-Chat.i1-IQ1_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:05b443d87a9b1226ab062d6cb2cc8836c160c7d340fa2fe3816242d3ebc39130
+size 1903668160

Xwen-7B-Chat.i1-IQ2_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5acc997901fb8d3ee73e38a00da2612066133b7ba7fedfecf545419b380eb105
+size 2780343232

Xwen-7B-Chat.i1-IQ2_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fc5328306b74d5e6ae9bb78bf2a8d5a3568c4cce2f83fd7eb0bf55c403abae71
+size 2595638208

Xwen-7B-Chat.i1-IQ2_XS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c1b5dfe4ee50169c5fa1315e1ad1548ab574e9f5c08093ca9450ec645d21f492
+size 2469022656

Xwen-7B-Chat.i1-IQ2_XXS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b68ced2d76532c4bf99c61918c455d24c6dabe23df79ae0c7a29eb201d0225a5
+size 2273078208

Xwen-7B-Chat.i1-IQ3_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1a93d4add73bb3bf3f7cf7422e83becf28b21a6e25b7cdd208ab9408b1b02947
+size 3574012864

Xwen-7B-Chat.i1-IQ3_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1b853c9255e54141a2113cd586ea8896048159fa5ea13a7020fa1a755e494bfe
+size 3499193280

Xwen-7B-Chat.i1-IQ3_XS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a63d8a3fe9ee9a28bd075cc21ee6398a92ecf5011d845695378b8f8858d18db1
+size 3346256832

Xwen-7B-Chat.i1-IQ3_XXS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5879aba14b56d5e4bf33e3a6600aee5b317a32e2f7a0ea49704be4a866a01033
+size 3114515392

Xwen-7B-Chat.i1-IQ4_NL.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:756ee38d0738e834c017296a98c5440951f9a456aab92a77f93542cfd4b32b01
+size 4437814208

Xwen-7B-Chat.i1-IQ4_XS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:88e322d7597e9027a09c1be8ef1132569bb85b64f0fa619f7f5ceae8c187be6c
+size 4218473408

Xwen-7B-Chat.i1-Q2_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d4bd20de1c009caa329026d8bac3622b63b4086b360044c8a0bb228771598c29
+size 3015941056

Xwen-7B-Chat.i1-Q2_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:97bfea26602970c2e5cf18b4af2f539fb1d0275b004498c5478f2bebf18efa63
+size 2834074560

Xwen-7B-Chat.i1-Q3_K_L.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:29ecb6ec567edb487af357ae52a69b517b633e25821ff077f455a25367d33b46
+size 4088460224

Xwen-7B-Chat.i1-Q3_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cea72e55cf324c08edd2c5d5e22cf6dc3f4b07760c864aefdf859e8c4e5d62f5
+size 3808392128

Xwen-7B-Chat.i1-Q3_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:067de320190179ed3b518be01978e3e358f0a8f917b2926418ac8754f9a931d8
+size 3492369344

Xwen-7B-Chat.i1-Q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ff6425d8d79575a9da17500dd12823bcb2b750b0826bed209c6a626fc9ac1d9c
+size 4444122048

Xwen-7B-Chat.i1-Q4_1.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f7f29b90efac7f876133d11bc5dd020faad9be776a7daa92db4b4eae10344210
+size 4873284544

Xwen-7B-Chat.i1-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:adec1d5c5ee8a728da194c0d7ea9202665191166a7710652dbeb37b10abd84dd
+size 4683074496

Xwen-7B-Chat.i1-Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f3d617e07fede2cdf73c5dbf990318f52d67e7337856785960eeb06ae3592f6c
+size 4457769920

Xwen-7B-Chat.i1-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:52b8ef3f11ca12f286dddc76a2de9cc3c33145494fa9f7c1f01ad3db34e39b92
+size 5444832192

Xwen-7B-Chat.i1-Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4da9a320ab2892d2ac59ff5572ce45242b33a3051ca7def31d6eede34bc972d1
+size 5315177408

Xwen-7B-Chat.i1-Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5a60ea40cbe1320e4cae1dd760f91187cfbd70a2a609ef714e1daa18814656dc
+size 6254199744

imatrix.dat ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:89680180a884bd90332d7f6127a24c48704228db6b269345013d4c3974c152c4
+size 4536665