We have uploaded the trained weights for the 1B model using LayerNorm Scaling (CoD).
· Sign up or log in to comment