jondurbin
/

mpt-30b-qlora-compatible

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jondurbin commited on Jun 26, 2023

Commit

5e4f4bc

·

1 Parent(s): 6d5b10d

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -12,6 +12,8 @@ Differences in the qlora scripts:
 __I think there's a bug in gradient accumulation, so if you try this, maybe set gradient accumulation steps to 1__
 __5 epochs seemed to achieve the best results, but YMMV__
 Full example of tuning (used for airoboros-mpt-30b-gpt4-1.4):

 __I think there's a bug in gradient accumulation, so if you try this, maybe set gradient accumulation steps to 1__
+*my first attempts used batch size 6, with gradient accumulation steps 16, but results of three epochs with gradient accumulation vs without were quite a bit worse*
 __5 epochs seemed to achieve the best results, but YMMV__
 Full example of tuning (used for airoboros-mpt-30b-gpt4-1.4):