Continued pre-training with replay?

by ostapeno - opened 1 day ago

1 day ago

Dear authors,
thank you for the great work.
Regarding the continued pre-training experiments conducted in your work, did you use replay to prevent forgetting the knowledge obtained by the model in stage 1 training?

Thank you in advance!

instruction-pretrain

Owner about 5 hours ago

Thanks for the question. We did not use replay in our experiments. Instead, we mixed domain-specific instruction-augmented corpora with general instructions to help maintain the model's general capabilities while adapting to the target domain.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment