No description provided.

We have uploaded the trained weights for the 1B model using LayerNorm Scaling (CoD).

pengxiang changed pull request status to merged

Sign up or log in to comment