Update README.md
Browse files
README.md
CHANGED
@@ -17,15 +17,12 @@ I did not create that model, only discovered it and wanted to try it for myself,
|
|
17 |
|
18 |
|
19 |
# Memory usage tests
|
20 |
-
|
21 |
### 2.65bpw
|
22 |
context 16k, cache 16: 46.9GiB (fits in 2x 3090)
|
23 |
context 32k, cache 8: 47GiB (fits in 2x 3090)
|
24 |
-
|
25 |
### 3bpw
|
26 |
-
context 8k, cache 16: 47.4GiB (fits in 2x 3090)
|
27 |
-
context 16k, cache 8: 47.4GiB (fits in 2x 3090)
|
28 |
-
|
29 |
### 4.35bpw
|
30 |
context 16k, cache 16: 70.1GiB (fits in 3x 3090)
|
31 |
context 32k, cache 8: 70.3GiB (fits in 3x 3090)
|
|
|
17 |
|
18 |
|
19 |
# Memory usage tests
|
|
|
20 |
### 2.65bpw
|
21 |
context 16k, cache 16: 46.9GiB (fits in 2x 3090)
|
22 |
context 32k, cache 8: 47GiB (fits in 2x 3090)
|
|
|
23 |
### 3bpw
|
24 |
+
context 8k, cache 16: 47.4GiB (fits in 2x 3090)
|
25 |
+
context 16k, cache 8: 47.4GiB (fits in 2x 3090)
|
|
|
26 |
### 4.35bpw
|
27 |
context 16k, cache 16: 70.1GiB (fits in 3x 3090)
|
28 |
context 32k, cache 8: 70.3GiB (fits in 3x 3090)
|