aikitoria
/

Goliath-longLORA-120b-rope8-32k-exl2

Text Generation

Model card Files Files and versions Community

aikitoria commited on Jan 27, 2024

Commit

cd3b412

·

verified ·

1 Parent(s): 0cf53a1

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -25,6 +25,8 @@ context 32k, cache 8: 70.3GiB (fits in 3x 3090)
 context 32k, cache 16: 78.7GiB (fits in A100 80GB)
 # Super epic scientific test results
-The 2.65bpw version suffered greatly, it's not completely broken, but it's not good either.
-The 4.35bpw version is worse than normal 4k goliath but better than goliath with rope scale applied for 8k+ context.
-The version using the PIPPA dataset produces worse results than the one using the default dataset on any context length.

 context 32k, cache 16: 78.7GiB (fits in A100 80GB)
 # Super epic scientific test results
+- The 2.65bpw version suffered greatly, it's not completely broken, but it's not good either.
+- The 4.35bpw version is worse than normal 4k goliath but better than goliath with rope scale applied for 8k+ context.
+- The version using the PIPPA dataset produces worse results than the one using the default dataset on any context length.
+My current strategy is to use the original goliath until its context is full and then switch over to this one.