shenzhi-wang mradermacher commited on
Commit
4d8f821
·
verified ·
0 Parent(s):

Duplicate from mradermacher/Xwen-7B-Chat-i1-GGUF

Browse files

Co-authored-by: team mradermacher <[email protected]>

.gitattributes ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ imatrix.dat filter=lfs diff=lfs merge=lfs -text
37
+ Xwen-7B-Chat.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
38
+ Xwen-7B-Chat.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ Xwen-7B-Chat.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ Xwen-7B-Chat.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
41
+ Xwen-7B-Chat.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ Xwen-7B-Chat.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
43
+ Xwen-7B-Chat.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
44
+ Xwen-7B-Chat.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ Xwen-7B-Chat.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
46
+ Xwen-7B-Chat.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
47
+ Xwen-7B-Chat.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
48
+ Xwen-7B-Chat.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
49
+ Xwen-7B-Chat.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
50
+ Xwen-7B-Chat.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
51
+ Xwen-7B-Chat.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
52
+ Xwen-7B-Chat.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
53
+ Xwen-7B-Chat.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
54
+ Xwen-7B-Chat.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
55
+ Xwen-7B-Chat.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
56
+ Xwen-7B-Chat.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
57
+ Xwen-7B-Chat.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
58
+ Xwen-7B-Chat.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
59
+ Xwen-7B-Chat.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
60
+ Xwen-7B-Chat.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,77 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: xwen-team/Xwen-7B-Chat
3
+ language:
4
+ - en
5
+ - zh
6
+ library_name: transformers
7
+ license: apache-2.0
8
+ quantized_by: mradermacher
9
+ ---
10
+ ## About
11
+
12
+ <!-- ### quantize_version: 2 -->
13
+ <!-- ### output_tensor_quantised: 1 -->
14
+ <!-- ### convert_type: hf -->
15
+ <!-- ### vocab_type: -->
16
+ <!-- ### tags: nicoboss -->
17
+ weighted/imatrix quants of https://huggingface.co/xwen-team/Xwen-7B-Chat
18
+
19
+ <!-- provided-files -->
20
+ static quants are available at https://huggingface.co/mradermacher/Xwen-7B-Chat-GGUF
21
+ ## Usage
22
+
23
+ If you are unsure how to use GGUF files, refer to one of [TheBloke's
24
+ READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
25
+ more details, including on how to concatenate multi-part files.
26
+
27
+ ## Provided Quants
28
+
29
+ (sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
30
+
31
+ | Link | Type | Size/GB | Notes |
32
+ |:-----|:-----|--------:|:------|
33
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ1_S.gguf) | i1-IQ1_S | 2.0 | for the desperate |
34
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ1_M.gguf) | i1-IQ1_M | 2.1 | mostly desperate |
35
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.4 | |
36
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.6 | |
37
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ2_S.gguf) | i1-IQ2_S | 2.7 | |
38
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ2_M.gguf) | i1-IQ2_M | 2.9 | |
39
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q2_K_S.gguf) | i1-Q2_K_S | 2.9 | very low quality |
40
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q2_K.gguf) | i1-Q2_K | 3.1 | IQ3_XXS probably better |
41
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.2 | lower quality |
42
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.4 | |
43
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.6 | IQ3_XS probably better |
44
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ3_S.gguf) | i1-IQ3_S | 3.6 | beats Q3_K* |
45
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ3_M.gguf) | i1-IQ3_M | 3.7 | |
46
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q3_K_M.gguf) | i1-Q3_K_M | 3.9 | IQ3_S probably better |
47
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.2 | IQ3_M probably better |
48
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.3 | |
49
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.5 | prefer IQ4_XS |
50
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q4_0.gguf) | i1-Q4_0 | 4.5 | fast, low quality |
51
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.6 | optimal size/speed/quality |
52
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q4_K_M.gguf) | i1-Q4_K_M | 4.8 | fast, recommended |
53
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q4_1.gguf) | i1-Q4_1 | 5.0 | |
54
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.4 | |
55
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.5 | |
56
+ | [GGUF](https://huggingface.co/mradermacher/Xwen-7B-Chat-i1-GGUF/resolve/main/Xwen-7B-Chat.i1-Q6_K.gguf) | i1-Q6_K | 6.4 | practically like static Q6_K |
57
+
58
+ Here is a handy graph by ikawrakow comparing some lower-quality quant
59
+ types (lower is better):
60
+
61
+ ![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
62
+
63
+ And here are Artefact2's thoughts on the matter:
64
+ https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
65
+
66
+ ## FAQ / Model Request
67
+
68
+ See https://huggingface.co/mradermacher/model_requests for some answers to
69
+ questions you might have and/or if you want some other model quantized.
70
+
71
+ ## Thanks
72
+
73
+ I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
74
+ me use its servers and providing upgrades to my workstation to enable
75
+ this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
76
+
77
+ <!-- end -->
Xwen-7B-Chat.i1-IQ1_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:942b3c6b0a7671a0568e57f97afd17497a2004b80e3d4d3529e6804d1a4425ae
3
+ size 2042196928
Xwen-7B-Chat.i1-IQ1_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:05b443d87a9b1226ab062d6cb2cc8836c160c7d340fa2fe3816242d3ebc39130
3
+ size 1903668160
Xwen-7B-Chat.i1-IQ2_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5acc997901fb8d3ee73e38a00da2612066133b7ba7fedfecf545419b380eb105
3
+ size 2780343232
Xwen-7B-Chat.i1-IQ2_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fc5328306b74d5e6ae9bb78bf2a8d5a3568c4cce2f83fd7eb0bf55c403abae71
3
+ size 2595638208
Xwen-7B-Chat.i1-IQ2_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c1b5dfe4ee50169c5fa1315e1ad1548ab574e9f5c08093ca9450ec645d21f492
3
+ size 2469022656
Xwen-7B-Chat.i1-IQ2_XXS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b68ced2d76532c4bf99c61918c455d24c6dabe23df79ae0c7a29eb201d0225a5
3
+ size 2273078208
Xwen-7B-Chat.i1-IQ3_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1a93d4add73bb3bf3f7cf7422e83becf28b21a6e25b7cdd208ab9408b1b02947
3
+ size 3574012864
Xwen-7B-Chat.i1-IQ3_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1b853c9255e54141a2113cd586ea8896048159fa5ea13a7020fa1a755e494bfe
3
+ size 3499193280
Xwen-7B-Chat.i1-IQ3_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a63d8a3fe9ee9a28bd075cc21ee6398a92ecf5011d845695378b8f8858d18db1
3
+ size 3346256832
Xwen-7B-Chat.i1-IQ3_XXS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5879aba14b56d5e4bf33e3a6600aee5b317a32e2f7a0ea49704be4a866a01033
3
+ size 3114515392
Xwen-7B-Chat.i1-IQ4_NL.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:756ee38d0738e834c017296a98c5440951f9a456aab92a77f93542cfd4b32b01
3
+ size 4437814208
Xwen-7B-Chat.i1-IQ4_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:88e322d7597e9027a09c1be8ef1132569bb85b64f0fa619f7f5ceae8c187be6c
3
+ size 4218473408
Xwen-7B-Chat.i1-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4bd20de1c009caa329026d8bac3622b63b4086b360044c8a0bb228771598c29
3
+ size 3015941056
Xwen-7B-Chat.i1-Q2_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:97bfea26602970c2e5cf18b4af2f539fb1d0275b004498c5478f2bebf18efa63
3
+ size 2834074560
Xwen-7B-Chat.i1-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:29ecb6ec567edb487af357ae52a69b517b633e25821ff077f455a25367d33b46
3
+ size 4088460224
Xwen-7B-Chat.i1-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cea72e55cf324c08edd2c5d5e22cf6dc3f4b07760c864aefdf859e8c4e5d62f5
3
+ size 3808392128
Xwen-7B-Chat.i1-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:067de320190179ed3b518be01978e3e358f0a8f917b2926418ac8754f9a931d8
3
+ size 3492369344
Xwen-7B-Chat.i1-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ff6425d8d79575a9da17500dd12823bcb2b750b0826bed209c6a626fc9ac1d9c
3
+ size 4444122048
Xwen-7B-Chat.i1-Q4_1.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7f29b90efac7f876133d11bc5dd020faad9be776a7daa92db4b4eae10344210
3
+ size 4873284544
Xwen-7B-Chat.i1-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:adec1d5c5ee8a728da194c0d7ea9202665191166a7710652dbeb37b10abd84dd
3
+ size 4683074496
Xwen-7B-Chat.i1-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f3d617e07fede2cdf73c5dbf990318f52d67e7337856785960eeb06ae3592f6c
3
+ size 4457769920
Xwen-7B-Chat.i1-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:52b8ef3f11ca12f286dddc76a2de9cc3c33145494fa9f7c1f01ad3db34e39b92
3
+ size 5444832192
Xwen-7B-Chat.i1-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4da9a320ab2892d2ac59ff5572ce45242b33a3051ca7def31d6eede34bc972d1
3
+ size 5315177408
Xwen-7B-Chat.i1-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a60ea40cbe1320e4cae1dd760f91187cfbd70a2a609ef714e1daa18814656dc
3
+ size 6254199744
imatrix.dat ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89680180a884bd90332d7f6127a24c48704228db6b269345013d4c3974c152c4
3
+ size 4536665