GGML Version

#1
by s3nh - opened

:) https://huggingface.co/s3nh/mamba-gpt-3b-v3-GGML
did not check if there is significant drop of accuracy, but there exist a visible speed up in inference time.

All best,
Damian

Sign up or log in to comment