salbatarni commited on
Commit
fa24305
·
verified ·
1 Parent(s): eeb4f22

End of training

Browse files
Files changed (1) hide show
  1. README.md +38 -35
README.md CHANGED
@@ -3,20 +3,20 @@ base_model: aubmindlab/bert-base-arabertv02
3
  tags:
4
  - generated_from_trainer
5
  model-index:
6
- - name: arabert_cross_vocabulary_task1_fold0
7
  results: []
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
  should probably proofread and complete it, then remove this comment. -->
12
 
13
- # arabert_cross_vocabulary_task1_fold0
14
 
15
  This model is a fine-tuned version of [aubmindlab/bert-base-arabertv02](https://huggingface.co/aubmindlab/bert-base-arabertv02) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 0.9107
18
- - Qwk: 0.3160
19
- - Mse: 0.9107
20
 
21
  ## Model description
22
 
@@ -45,36 +45,39 @@ The following hyperparameters were used during training:
45
 
46
  ### Training results
47
 
48
- | Training Loss | Epoch | Step | Validation Loss | Qwk | Mse |
49
- |:-------------:|:------:|:----:|:---------------:|:------:|:------:|
50
- | No log | 0.0351 | 2 | 3.6812 | 0.0124 | 3.6812 |
51
- | No log | 0.0702 | 4 | 2.2449 | 0.0807 | 2.2449 |
52
- | No log | 0.1053 | 6 | 1.7920 | 0.1291 | 1.7920 |
53
- | No log | 0.1404 | 8 | 1.1077 | 0.2184 | 1.1077 |
54
- | No log | 0.1754 | 10 | 1.6727 | 0.2157 | 1.6727 |
55
- | No log | 0.2105 | 12 | 2.3411 | 0.1852 | 2.3411 |
56
- | No log | 0.2456 | 14 | 1.4252 | 0.2951 | 1.4252 |
57
- | No log | 0.2807 | 16 | 0.8885 | 0.3981 | 0.8885 |
58
- | No log | 0.3158 | 18 | 0.6824 | 0.4387 | 0.6824 |
59
- | No log | 0.3509 | 20 | 0.6604 | 0.4473 | 0.6604 |
60
- | No log | 0.3860 | 22 | 0.7208 | 0.3880 | 0.7208 |
61
- | No log | 0.4211 | 24 | 1.1639 | 0.2846 | 1.1639 |
62
- | No log | 0.4561 | 26 | 2.0330 | 0.1689 | 2.0330 |
63
- | No log | 0.4912 | 28 | 2.2500 | 0.1485 | 2.2500 |
64
- | No log | 0.5263 | 30 | 1.8145 | 0.1758 | 1.8145 |
65
- | No log | 0.5614 | 32 | 1.1982 | 0.2547 | 1.1982 |
66
- | No log | 0.5965 | 34 | 0.8111 | 0.3192 | 0.8111 |
67
- | No log | 0.6316 | 36 | 0.7359 | 0.3443 | 0.7359 |
68
- | No log | 0.6667 | 38 | 0.8012 | 0.3164 | 0.8012 |
69
- | No log | 0.7018 | 40 | 0.9036 | 0.2985 | 0.9036 |
70
- | No log | 0.7368 | 42 | 1.0075 | 0.2804 | 1.0075 |
71
- | No log | 0.7719 | 44 | 1.0761 | 0.2855 | 1.0761 |
72
- | No log | 0.8070 | 46 | 1.0400 | 0.2883 | 1.0400 |
73
- | No log | 0.8421 | 48 | 1.0379 | 0.2963 | 1.0379 |
74
- | No log | 0.8772 | 50 | 1.0163 | 0.3002 | 1.0163 |
75
- | No log | 0.9123 | 52 | 0.9760 | 0.3168 | 0.9760 |
76
- | No log | 0.9474 | 54 | 0.9286 | 0.3206 | 0.9286 |
77
- | No log | 0.9825 | 56 | 0.9107 | 0.3160 | 0.9107 |
 
 
 
78
 
79
 
80
  ### Framework versions
 
3
  tags:
4
  - generated_from_trainer
5
  model-index:
6
+ - name: arabert_cross_vocabulary_task1_fold1
7
  results: []
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
  should probably proofread and complete it, then remove this comment. -->
12
 
13
+ # arabert_cross_vocabulary_task1_fold1
14
 
15
  This model is a fine-tuned version of [aubmindlab/bert-base-arabertv02](https://huggingface.co/aubmindlab/bert-base-arabertv02) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 0.8684
18
+ - Qwk: 0.0761
19
+ - Mse: 0.8559
20
 
21
  ## Model description
22
 
 
45
 
46
  ### Training results
47
 
48
+ | Training Loss | Epoch | Step | Validation Loss | Qwk | Mse |
49
+ |:-------------:|:------:|:----:|:---------------:|:-------:|:------:|
50
+ | No log | 0.0323 | 2 | 3.6877 | 0.0063 | 3.7124 |
51
+ | No log | 0.0645 | 4 | 1.1155 | -0.0070 | 1.1203 |
52
+ | No log | 0.0968 | 6 | 0.6955 | -0.0332 | 0.6928 |
53
+ | No log | 0.1290 | 8 | 0.5695 | 0.0031 | 0.5642 |
54
+ | No log | 0.1613 | 10 | 0.5455 | 0.0857 | 0.5422 |
55
+ | No log | 0.1935 | 12 | 0.5525 | 0.0754 | 0.5480 |
56
+ | No log | 0.2258 | 14 | 0.6246 | 0.0259 | 0.6189 |
57
+ | No log | 0.2581 | 16 | 0.7583 | 0.0025 | 0.7531 |
58
+ | No log | 0.2903 | 18 | 0.7013 | -0.0225 | 0.6949 |
59
+ | No log | 0.3226 | 20 | 0.6437 | -0.0653 | 0.6352 |
60
+ | No log | 0.3548 | 22 | 0.7347 | 0.0 | 0.7259 |
61
+ | No log | 0.3871 | 24 | 0.7826 | 0.0 | 0.7744 |
62
+ | No log | 0.4194 | 26 | 0.7467 | 0.0 | 0.7394 |
63
+ | No log | 0.4516 | 28 | 0.7010 | 0.0202 | 0.6941 |
64
+ | No log | 0.4839 | 30 | 0.7123 | 0.0202 | 0.7047 |
65
+ | No log | 0.5161 | 32 | 0.8848 | -0.0091 | 0.8746 |
66
+ | No log | 0.5484 | 34 | 1.0222 | 0.0893 | 1.0101 |
67
+ | No log | 0.5806 | 36 | 0.9830 | 0.0479 | 0.9708 |
68
+ | No log | 0.6129 | 38 | 0.8572 | 0.0761 | 0.8457 |
69
+ | No log | 0.6452 | 40 | 0.7135 | -0.0100 | 0.7039 |
70
+ | No log | 0.6774 | 42 | 0.5802 | -0.0370 | 0.5737 |
71
+ | No log | 0.7097 | 44 | 0.5562 | 0.0091 | 0.5515 |
72
+ | No log | 0.7419 | 46 | 0.5554 | 0.0394 | 0.5513 |
73
+ | No log | 0.7742 | 48 | 0.5528 | 0.0182 | 0.5481 |
74
+ | No log | 0.8065 | 50 | 0.5621 | 0.0036 | 0.5560 |
75
+ | No log | 0.8387 | 52 | 0.6051 | 0.0 | 0.5970 |
76
+ | No log | 0.8710 | 54 | 0.6890 | -0.0100 | 0.6790 |
77
+ | No log | 0.9032 | 56 | 0.7681 | 0.0151 | 0.7569 |
78
+ | No log | 0.9355 | 58 | 0.8212 | 0.0051 | 0.8094 |
79
+ | No log | 0.9677 | 60 | 0.8557 | 0.0761 | 0.8434 |
80
+ | No log | 1.0 | 62 | 0.8684 | 0.0761 | 0.8559 |
81
 
82
 
83
  ### Framework versions