ymoslem
/

ModernBERT-base-long-context-qe-v1

Text Classification

quality-estimation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

ymoslem commited on 1 day ago

Commit

e4d84a3

·

verified ·

1 Parent(s): 4433978

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -90,6 +90,8 @@ The `raw` column represents a weighted average of scores of augmented sentences
 ## Training procedure
 - tokenizer.model_max_length: 8192 (full context length)
 - attn_implementation: flash_attention_2

 ## Training procedure
+The model is trained on 1x H200 SXM (143 GB VRAM) for approx. 26 hours.
 - tokenizer.model_max_length: 8192 (full context length)
 - attn_implementation: flash_attention_2