ymoslem
/

ModernBERT-base-qe-maxlen512-lr3e-04-v1

@@ -37,34 +37,78 @@ datasets:
 - ymoslem/wmt-da-human-evaluation
 model-index:
 - name: Quality Estimation for Machine Translation
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # Quality Estimation for Machine Translation
-This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the ymoslem/wmt-da-human-evaluation dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.0561
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
 - train_batch_size: 128
@@ -95,4 +139,4 @@ The following hyperparameters were used during training:
 - Transformers 4.48.0
 - Pytorch 2.4.1+cu124
 - Datasets 3.2.0
-- Tokenizers 0.21.0

 - ymoslem/wmt-da-human-evaluation
 model-index:
 - name: Quality Estimation for Machine Translation
+  results:
+  - task:
+      type: regression
+    dataset:
+      name: ymoslem/wmt-da-human-evaluation-long-context
+      type: QE
+    metrics:
+    - name: Pearson
+      type: Pearson Correlation
+      value: 0.2055
+    - name: MAE
+      type: Mean Absolute Error
+      value: 0.2004
+    - name: RMSE
+      type: Root Mean Squared Error
+      value: 0.2767
+    - name: R-R2
+      type: R-Squared
+      value: -1.6745
+  - task:
+      type: regression
+    dataset:
+      name: ymoslem/wmt-da-human-evaluation
+      type: QE
+    metrics:
+    - name: Pearson
+      type: Pearson Correlation
+      value: null
+    - name: MAE
+      type: Mean Absolute Error
+      value: null
+    - name: RMSE
+      type: Root Mean Squared Error
+      value: null
+    - name: R-R2
+      type: R-Squared
+      value: null
+metrics:
+- pearsonr
+- mae
+- r_squared
+new_version: ymoslem/ModernBERT-base-qe-v1
 ---
 # Quality Estimation for Machine Translation
+This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the [ymoslem/wmt-da-human-evaluation](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation-long-context) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.0561
 ## Model description
+This model is for reference-free, sentence level quality estimation (QE) of machine translation (MT) systems.
+The long-context / document-level model can be found at: [ModernBERT-base-long-context-qe-v1](https://huggingface.co/ymoslem/ModernBERT-base-long-context-qe-v1),
+which is trained on a long-context / document-level QE dataset [ymoslem/wmt-da-human-evaluation-long-context](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation-long-context)
 ## Training and evaluation data
+This model is trained on the sentence-level quality estimation dataset: [ymoslem/wmt-da-human-evaluation](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation)
 ## Training procedure
 ### Training hyperparameters
+This version of the model uses tokenizer.model_max_length=512.
+The model with full length of 8192 can be found here [ymoslem/ModernBERT-base-qe-v1](https://huggingface.co/ymoslem/ModernBERT-base-qe-v1),
+which is still trained on a sentence-level QE dataset [ymoslem/wmt-da-human-evaluation](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation)
+The long-context / document-level model can be found at: [ModernBERT-base-long-context-qe-v1](https://huggingface.co/ymoslem/ModernBERT-base-long-context-qe-v1),
+which is trained on a long-context / document-level QE dataset [ymoslem/wmt-da-human-evaluation-long-context](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation-long-context)
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
 - train_batch_size: 128
 - Transformers 4.48.0
 - Pytorch 2.4.1+cu124
 - Datasets 3.2.0
+- Tokenizers 0.21.0