ymoslem
/

ModernBERT-base-long-context-qe-v1

Text Classification

quality-estimation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

ymoslem commited on 3 days ago

Commit

d6f0e9d

·

verified ·

1 Parent(s): 6e14fcf

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -74,14 +74,19 @@ It achieves the following results on the evaluation set:
 ## Model description
-This model is for reference-free quality estimation (QE) of machine translation (MT) systems.
 ## Training and evaluation data
 The model is trained on the long-context dataset [ymoslem/wmt-da-human-evaluation-long-context](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation-long-context).
-* Training: 7.65 million long-context texts
-* Test: 59,235 long-context texts
 ## Training procedure

 ## Model description
+This model is for reference-free, long-context quality estimation (QE) of machine translation (MT) systems.
+It trained on a dataset of texts of up to 32 sentences (64 sentences for the source and target).
+Hence, this model is suitable for document-level quality estimation.
 ## Training and evaluation data
 The model is trained on the long-context dataset [ymoslem/wmt-da-human-evaluation-long-context](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation-long-context).
+The used long-context / document-level dataset for Quality Estimation of Machine Translation is an augmented variant of the sentence-level WMT DA Human Evaluation dataset.
+In addition to individual sentences, it contains augmentations of 2, 4, 8, 16, and 32 sentences, among each language pair `lp` and `domain`.
+The `raw` column represents a weighted average of scores of augmented sentences using character lengths of `src` and `mt` as weights.
+* Training data: 7.65 million long-context texts
+* Test data: 59,235 long-context texts
 ## Training procedure