salbatarni commited on
Commit
a775175
·
verified ·
1 Parent(s): d3b613d

Training in progress, step 170

Browse files
Files changed (3) hide show
  1. README.md +98 -40
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -3,20 +3,20 @@ base_model: aubmindlab/bert-base-arabertv02
3
  tags:
4
  - generated_from_trainer
5
  model-index:
6
- - name: arabert_cross_organization_task4_fold5
7
  results: []
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
  should probably proofread and complete it, then remove this comment. -->
12
 
13
- # arabert_cross_organization_task4_fold5
14
 
15
  This model is a fine-tuned version of [aubmindlab/bert-base-arabertv02](https://huggingface.co/aubmindlab/bert-base-arabertv02) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 0.6331
18
- - Qwk: 0.7553
19
- - Mse: 0.6331
20
 
21
  ## Model description
22
 
@@ -36,49 +36,107 @@ More information needed
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 2e-05
39
- - train_batch_size: 16
40
- - eval_batch_size: 16
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
- - num_epochs: 1
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss | Qwk | Mse |
49
  |:-------------:|:------:|:----:|:---------------:|:------:|:------:|
50
- | No log | 0.0308 | 2 | 2.0367 | 0.0234 | 2.0367 |
51
- | No log | 0.0615 | 4 | 1.1409 | 0.1280 | 1.1409 |
52
- | No log | 0.0923 | 6 | 0.8741 | 0.3274 | 0.8741 |
53
- | No log | 0.1231 | 8 | 0.8340 | 0.3904 | 0.8340 |
54
- | No log | 0.1538 | 10 | 0.8426 | 0.4000 | 0.8426 |
55
- | No log | 0.1846 | 12 | 0.9271 | 0.5826 | 0.9271 |
56
- | No log | 0.2154 | 14 | 1.4000 | 0.6024 | 1.4000 |
57
- | No log | 0.2462 | 16 | 1.0552 | 0.6589 | 1.0552 |
58
- | No log | 0.2769 | 18 | 0.6830 | 0.6273 | 0.6830 |
59
- | No log | 0.3077 | 20 | 0.6122 | 0.5963 | 0.6122 |
60
- | No log | 0.3385 | 22 | 0.5979 | 0.6462 | 0.5979 |
61
- | No log | 0.3692 | 24 | 0.7483 | 0.7283 | 0.7483 |
62
- | No log | 0.4 | 26 | 0.9502 | 0.6982 | 0.9502 |
63
- | No log | 0.4308 | 28 | 0.9174 | 0.6851 | 0.9174 |
64
- | No log | 0.4615 | 30 | 0.7212 | 0.7283 | 0.7212 |
65
- | No log | 0.4923 | 32 | 0.5677 | 0.6943 | 0.5677 |
66
- | No log | 0.5231 | 34 | 0.5433 | 0.6389 | 0.5433 |
67
- | No log | 0.5538 | 36 | 0.5355 | 0.6498 | 0.5355 |
68
- | No log | 0.5846 | 38 | 0.5456 | 0.6949 | 0.5456 |
69
- | No log | 0.6154 | 40 | 0.5947 | 0.7382 | 0.5947 |
70
- | No log | 0.6462 | 42 | 0.6160 | 0.7515 | 0.6160 |
71
- | No log | 0.6769 | 44 | 0.6052 | 0.7452 | 0.6052 |
72
- | No log | 0.7077 | 46 | 0.5692 | 0.7479 | 0.5692 |
73
- | No log | 0.7385 | 48 | 0.5419 | 0.7327 | 0.5419 |
74
- | No log | 0.7692 | 50 | 0.5354 | 0.7327 | 0.5354 |
75
- | No log | 0.8 | 52 | 0.5444 | 0.7418 | 0.5444 |
76
- | No log | 0.8308 | 54 | 0.5688 | 0.7560 | 0.5688 |
77
- | No log | 0.8615 | 56 | 0.5846 | 0.7618 | 0.5846 |
78
- | No log | 0.8923 | 58 | 0.5931 | 0.7569 | 0.5931 |
79
- | No log | 0.9231 | 60 | 0.6173 | 0.7592 | 0.6173 |
80
- | No log | 0.9538 | 62 | 0.6289 | 0.7564 | 0.6289 |
81
- | No log | 0.9846 | 64 | 0.6331 | 0.7553 | 0.6331 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
82
 
83
 
84
  ### Framework versions
 
3
  tags:
4
  - generated_from_trainer
5
  model-index:
6
+ - name: arabert_cross_organization_task4_fold4
7
  results: []
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
  should probably proofread and complete it, then remove this comment. -->
12
 
13
+ # arabert_cross_organization_task4_fold4
14
 
15
  This model is a fine-tuned version of [aubmindlab/bert-base-arabertv02](https://huggingface.co/aubmindlab/bert-base-arabertv02) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 0.5141
18
+ - Qwk: 0.7810
19
+ - Mse: 0.5141
20
 
21
  ## Model description
22
 
 
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 2e-05
39
+ - train_batch_size: 64
40
+ - eval_batch_size: 64
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
+ - num_epochs: 10
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss | Qwk | Mse |
49
  |:-------------:|:------:|:----:|:---------------:|:------:|:------:|
50
+ | No log | 0.1111 | 2 | 1.8597 | 0.1122 | 1.8597 |
51
+ | No log | 0.2222 | 4 | 1.3020 | 0.0409 | 1.3020 |
52
+ | No log | 0.3333 | 6 | 1.0459 | 0.3503 | 1.0459 |
53
+ | No log | 0.4444 | 8 | 0.7138 | 0.4906 | 0.7138 |
54
+ | No log | 0.5556 | 10 | 0.5900 | 0.6224 | 0.5900 |
55
+ | No log | 0.6667 | 12 | 0.5708 | 0.6214 | 0.5708 |
56
+ | No log | 0.7778 | 14 | 0.4881 | 0.6661 | 0.4881 |
57
+ | No log | 0.8889 | 16 | 0.4843 | 0.6880 | 0.4843 |
58
+ | No log | 1.0 | 18 | 0.5772 | 0.7185 | 0.5772 |
59
+ | No log | 1.1111 | 20 | 0.4344 | 0.7515 | 0.4344 |
60
+ | No log | 1.2222 | 22 | 0.4173 | 0.6853 | 0.4173 |
61
+ | No log | 1.3333 | 24 | 0.4326 | 0.7470 | 0.4326 |
62
+ | No log | 1.4444 | 26 | 0.6327 | 0.7329 | 0.6327 |
63
+ | No log | 1.5556 | 28 | 0.6486 | 0.7640 | 0.6486 |
64
+ | No log | 1.6667 | 30 | 0.4510 | 0.7633 | 0.4510 |
65
+ | No log | 1.7778 | 32 | 0.3885 | 0.7569 | 0.3885 |
66
+ | No log | 1.8889 | 34 | 0.4049 | 0.7722 | 0.4049 |
67
+ | No log | 2.0 | 36 | 0.5466 | 0.7900 | 0.5466 |
68
+ | No log | 2.1111 | 38 | 0.5445 | 0.7886 | 0.5445 |
69
+ | No log | 2.2222 | 40 | 0.4445 | 0.7553 | 0.4445 |
70
+ | No log | 2.3333 | 42 | 0.4182 | 0.7437 | 0.4182 |
71
+ | No log | 2.4444 | 44 | 0.4202 | 0.7536 | 0.4202 |
72
+ | No log | 2.5556 | 46 | 0.5364 | 0.7929 | 0.5364 |
73
+ | No log | 2.6667 | 48 | 0.6070 | 0.7880 | 0.6070 |
74
+ | No log | 2.7778 | 50 | 0.4960 | 0.7859 | 0.4960 |
75
+ | No log | 2.8889 | 52 | 0.4044 | 0.7719 | 0.4044 |
76
+ | No log | 3.0 | 54 | 0.3938 | 0.7606 | 0.3938 |
77
+ | No log | 3.1111 | 56 | 0.4669 | 0.7947 | 0.4669 |
78
+ | No log | 3.2222 | 58 | 0.5343 | 0.7820 | 0.5343 |
79
+ | No log | 3.3333 | 60 | 0.4763 | 0.7853 | 0.4763 |
80
+ | No log | 3.4444 | 62 | 0.4091 | 0.7835 | 0.4091 |
81
+ | No log | 3.5556 | 64 | 0.4119 | 0.7882 | 0.4119 |
82
+ | No log | 3.6667 | 66 | 0.4525 | 0.7813 | 0.4525 |
83
+ | No log | 3.7778 | 68 | 0.4761 | 0.7828 | 0.4761 |
84
+ | No log | 3.8889 | 70 | 0.4893 | 0.7931 | 0.4893 |
85
+ | No log | 4.0 | 72 | 0.4435 | 0.7862 | 0.4435 |
86
+ | No log | 4.1111 | 74 | 0.4754 | 0.7918 | 0.4754 |
87
+ | No log | 4.2222 | 76 | 0.5004 | 0.7931 | 0.5004 |
88
+ | No log | 4.3333 | 78 | 0.5554 | 0.8090 | 0.5554 |
89
+ | No log | 4.4444 | 80 | 0.5319 | 0.7947 | 0.5319 |
90
+ | No log | 4.5556 | 82 | 0.4459 | 0.7781 | 0.4459 |
91
+ | No log | 4.6667 | 84 | 0.4355 | 0.7725 | 0.4355 |
92
+ | No log | 4.7778 | 86 | 0.4699 | 0.7823 | 0.4699 |
93
+ | No log | 4.8889 | 88 | 0.4860 | 0.7900 | 0.4860 |
94
+ | No log | 5.0 | 90 | 0.4400 | 0.7892 | 0.4400 |
95
+ | No log | 5.1111 | 92 | 0.4221 | 0.7376 | 0.4221 |
96
+ | No log | 5.2222 | 94 | 0.4264 | 0.7879 | 0.4264 |
97
+ | No log | 5.3333 | 96 | 0.4728 | 0.8106 | 0.4728 |
98
+ | No log | 5.4444 | 98 | 0.5254 | 0.8043 | 0.5254 |
99
+ | No log | 5.5556 | 100 | 0.4876 | 0.8048 | 0.4876 |
100
+ | No log | 5.6667 | 102 | 0.4400 | 0.7767 | 0.4400 |
101
+ | No log | 5.7778 | 104 | 0.4191 | 0.7487 | 0.4191 |
102
+ | No log | 5.8889 | 106 | 0.4281 | 0.7643 | 0.4281 |
103
+ | No log | 6.0 | 108 | 0.4819 | 0.7888 | 0.4819 |
104
+ | No log | 6.1111 | 110 | 0.5487 | 0.8063 | 0.5487 |
105
+ | No log | 6.2222 | 112 | 0.6060 | 0.7903 | 0.6060 |
106
+ | No log | 6.3333 | 114 | 0.5618 | 0.7847 | 0.5618 |
107
+ | No log | 6.4444 | 116 | 0.5080 | 0.7689 | 0.5080 |
108
+ | No log | 6.5556 | 118 | 0.4883 | 0.7543 | 0.4883 |
109
+ | No log | 6.6667 | 120 | 0.4979 | 0.7597 | 0.4979 |
110
+ | No log | 6.7778 | 122 | 0.5155 | 0.7757 | 0.5155 |
111
+ | No log | 6.8889 | 124 | 0.5239 | 0.7883 | 0.5239 |
112
+ | No log | 7.0 | 126 | 0.5025 | 0.7973 | 0.5025 |
113
+ | No log | 7.1111 | 128 | 0.4784 | 0.7894 | 0.4784 |
114
+ | No log | 7.2222 | 130 | 0.4608 | 0.7714 | 0.4608 |
115
+ | No log | 7.3333 | 132 | 0.4592 | 0.7608 | 0.4592 |
116
+ | No log | 7.4444 | 134 | 0.4736 | 0.7898 | 0.4736 |
117
+ | No log | 7.5556 | 136 | 0.5099 | 0.7905 | 0.5099 |
118
+ | No log | 7.6667 | 138 | 0.5575 | 0.8010 | 0.5575 |
119
+ | No log | 7.7778 | 140 | 0.5556 | 0.8167 | 0.5556 |
120
+ | No log | 7.8889 | 142 | 0.5181 | 0.7957 | 0.5181 |
121
+ | No log | 8.0 | 144 | 0.4691 | 0.7885 | 0.4691 |
122
+ | No log | 8.1111 | 146 | 0.4424 | 0.7890 | 0.4424 |
123
+ | No log | 8.2222 | 148 | 0.4411 | 0.7803 | 0.4411 |
124
+ | No log | 8.3333 | 150 | 0.4646 | 0.7859 | 0.4646 |
125
+ | No log | 8.4444 | 152 | 0.4939 | 0.8070 | 0.4939 |
126
+ | No log | 8.5556 | 154 | 0.5186 | 0.8172 | 0.5186 |
127
+ | No log | 8.6667 | 156 | 0.5307 | 0.8195 | 0.5307 |
128
+ | No log | 8.7778 | 158 | 0.5184 | 0.8146 | 0.5184 |
129
+ | No log | 8.8889 | 160 | 0.4898 | 0.8099 | 0.4898 |
130
+ | No log | 9.0 | 162 | 0.4741 | 0.7957 | 0.4741 |
131
+ | No log | 9.1111 | 164 | 0.4724 | 0.7948 | 0.4724 |
132
+ | No log | 9.2222 | 166 | 0.4828 | 0.7937 | 0.4828 |
133
+ | No log | 9.3333 | 168 | 0.4861 | 0.7937 | 0.4861 |
134
+ | No log | 9.4444 | 170 | 0.4940 | 0.7848 | 0.4940 |
135
+ | No log | 9.5556 | 172 | 0.5042 | 0.7810 | 0.5042 |
136
+ | No log | 9.6667 | 174 | 0.5129 | 0.7810 | 0.5129 |
137
+ | No log | 9.7778 | 176 | 0.5142 | 0.7810 | 0.5142 |
138
+ | No log | 9.8889 | 178 | 0.5135 | 0.7810 | 0.5135 |
139
+ | No log | 10.0 | 180 | 0.5141 | 0.7810 | 0.5141 |
140
 
141
 
142
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7afc4ccbba2984ca11dd6e5d07c6db12bce445851c498e5af279d0e85908dd71
3
  size 540799996
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b85a047ef72edad62a72a59c64fc31081b67e0469e1909f30e0d829e7728d9f7
3
  size 540799996
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4db11486a0311db824747a9287e17c8743cb18d46743f6c19e50ac03ae117469
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:33ea61851834233822e42431eb54f3bce7147e6573a715798eb2c1ee1710f9a1
3
  size 5240