File size: 22,819 Bytes
da9cfdc
 
 
 
 
 
3ac46a7
da9cfdc
 
 
 
 
 
3ac46a7
da9cfdc
 
 
3ac46a7
 
 
 
da9cfdc
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3ac46a7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
da9cfdc
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
---
library_name: transformers
base_model: aubmindlab/bert-base-arabertv02
tags:
- generated_from_trainer
model-index:
- name: ArabicNewSplits7_B_usingWellWrittenEssays_FineTuningAraBERT_run999_AugV5_k20_task7_organization
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# ArabicNewSplits7_B_usingWellWrittenEssays_FineTuningAraBERT_run999_AugV5_k20_task7_organization

This model is a fine-tuned version of [aubmindlab/bert-base-arabertv02](https://huggingface.co/aubmindlab/bert-base-arabertv02) on the None dataset.
It achieves the following results on the evaluation set:
- Loss: 0.6785
- Qwk: 0.2883
- Mse: 0.6785
- Rmse: 0.8237

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 100

### Training results

| Training Loss | Epoch  | Step | Validation Loss | Qwk     | Mse    | Rmse   |
|:-------------:|:------:|:----:|:---------------:|:-------:|:------:|:------:|
| No log        | 0.0185 | 2    | 2.6272          | -0.0729 | 2.6272 | 1.6209 |
| No log        | 0.0370 | 4    | 1.2463          | 0.0983  | 1.2463 | 1.1164 |
| No log        | 0.0556 | 6    | 0.7915          | 0.0441  | 0.7915 | 0.8897 |
| No log        | 0.0741 | 8    | 0.7705          | 0.1368  | 0.7705 | 0.8778 |
| No log        | 0.0926 | 10   | 0.6891          | 0.2955  | 0.6891 | 0.8301 |
| No log        | 0.1111 | 12   | 0.6805          | 0.3141  | 0.6805 | 0.8249 |
| No log        | 0.1296 | 14   | 0.7554          | 0.2223  | 0.7554 | 0.8691 |
| No log        | 0.1481 | 16   | 0.8799          | 0.2259  | 0.8799 | 0.9380 |
| No log        | 0.1667 | 18   | 0.7391          | 0.3590  | 0.7391 | 0.8597 |
| No log        | 0.1852 | 20   | 0.6912          | 0.3348  | 0.6912 | 0.8314 |
| No log        | 0.2037 | 22   | 0.8114          | 0.2772  | 0.8114 | 0.9008 |
| No log        | 0.2222 | 24   | 0.7259          | 0.2813  | 0.7259 | 0.8520 |
| No log        | 0.2407 | 26   | 0.6871          | 0.3050  | 0.6871 | 0.8289 |
| No log        | 0.2593 | 28   | 1.3581          | 0.2590  | 1.3581 | 1.1654 |
| No log        | 0.2778 | 30   | 1.7250          | 0.1895  | 1.7250 | 1.3134 |
| No log        | 0.2963 | 32   | 1.2685          | 0.1895  | 1.2685 | 1.1263 |
| No log        | 0.3148 | 34   | 0.7739          | 0.3606  | 0.7739 | 0.8797 |
| No log        | 0.3333 | 36   | 0.6666          | 0.1983  | 0.6666 | 0.8165 |
| No log        | 0.3519 | 38   | 0.6695          | 0.2046  | 0.6695 | 0.8182 |
| No log        | 0.3704 | 40   | 0.7431          | 0.3564  | 0.7431 | 0.8620 |
| No log        | 0.3889 | 42   | 0.9422          | 0.3579  | 0.9422 | 0.9707 |
| No log        | 0.4074 | 44   | 1.0279          | 0.3516  | 1.0279 | 1.0138 |
| No log        | 0.4259 | 46   | 0.9828          | 0.3516  | 0.9828 | 0.9914 |
| No log        | 0.4444 | 48   | 0.8631          | 0.3777  | 0.8631 | 0.9290 |
| No log        | 0.4630 | 50   | 0.7154          | 0.3746  | 0.7154 | 0.8458 |
| No log        | 0.4815 | 52   | 0.6521          | 0.4219  | 0.6521 | 0.8075 |
| No log        | 0.5    | 54   | 0.6224          | 0.3092  | 0.6224 | 0.7889 |
| No log        | 0.5185 | 56   | 0.6890          | 0.3819  | 0.6890 | 0.8301 |
| No log        | 0.5370 | 58   | 1.0277          | 0.3166  | 1.0277 | 1.0138 |
| No log        | 0.5556 | 60   | 1.2795          | 0.2772  | 1.2795 | 1.1312 |
| No log        | 0.5741 | 62   | 1.2126          | 0.2909  | 1.2126 | 1.1012 |
| No log        | 0.5926 | 64   | 0.8438          | 0.4255  | 0.8438 | 0.9186 |
| No log        | 0.6111 | 66   | 0.5983          | 0.4463  | 0.5983 | 0.7735 |
| No log        | 0.6296 | 68   | 0.6445          | 0.4674  | 0.6445 | 0.8028 |
| No log        | 0.6481 | 70   | 0.6404          | 0.4737  | 0.6404 | 0.8003 |
| No log        | 0.6667 | 72   | 0.6235          | 0.4419  | 0.6235 | 0.7897 |
| No log        | 0.6852 | 74   | 0.9033          | 0.4096  | 0.9033 | 0.9504 |
| No log        | 0.7037 | 76   | 1.0313          | 0.2910  | 1.0313 | 1.0155 |
| No log        | 0.7222 | 78   | 0.8396          | 0.4568  | 0.8396 | 0.9163 |
| No log        | 0.7407 | 80   | 0.6278          | 0.3945  | 0.6278 | 0.7923 |
| No log        | 0.7593 | 82   | 0.6544          | 0.4345  | 0.6544 | 0.8090 |
| No log        | 0.7778 | 84   | 0.6348          | 0.4322  | 0.6348 | 0.7968 |
| No log        | 0.7963 | 86   | 0.6784          | 0.2995  | 0.6784 | 0.8236 |
| No log        | 0.8148 | 88   | 0.9486          | 0.4092  | 0.9486 | 0.9740 |
| No log        | 0.8333 | 90   | 1.1878          | 0.2206  | 1.1878 | 1.0899 |
| No log        | 0.8519 | 92   | 1.1619          | 0.2191  | 1.1619 | 1.0779 |
| No log        | 0.8704 | 94   | 0.9051          | 0.4347  | 0.9051 | 0.9514 |
| No log        | 0.8889 | 96   | 0.7585          | 0.3494  | 0.7585 | 0.8709 |
| No log        | 0.9074 | 98   | 0.6845          | 0.3196  | 0.6845 | 0.8273 |
| No log        | 0.9259 | 100  | 0.7034          | 0.2467  | 0.7034 | 0.8387 |
| No log        | 0.9444 | 102  | 0.7146          | 0.3302  | 0.7146 | 0.8453 |
| No log        | 0.9630 | 104  | 0.8031          | 0.3918  | 0.8031 | 0.8962 |
| No log        | 0.9815 | 106  | 0.9954          | 0.3849  | 0.9954 | 0.9977 |
| No log        | 1.0    | 108  | 1.0793          | 0.3269  | 1.0793 | 1.0389 |
| No log        | 1.0185 | 110  | 1.0460          | 0.3697  | 1.0460 | 1.0227 |
| No log        | 1.0370 | 112  | 0.8320          | 0.3560  | 0.8320 | 0.9121 |
| No log        | 1.0556 | 114  | 0.7203          | 0.3069  | 0.7203 | 0.8487 |
| No log        | 1.0741 | 116  | 0.6927          | 0.3060  | 0.6927 | 0.8323 |
| No log        | 1.0926 | 118  | 0.7416          | 0.2518  | 0.7416 | 0.8612 |
| No log        | 1.1111 | 120  | 0.8737          | 0.3892  | 0.8737 | 0.9347 |
| No log        | 1.1296 | 122  | 1.1036          | 0.3088  | 1.1036 | 1.0505 |
| No log        | 1.1481 | 124  | 1.0979          | 0.3404  | 1.0979 | 1.0478 |
| No log        | 1.1667 | 126  | 0.9128          | 0.3709  | 0.9128 | 0.9554 |
| No log        | 1.1852 | 128  | 0.8296          | 0.2843  | 0.8296 | 0.9108 |
| No log        | 1.2037 | 130  | 0.7985          | 0.2904  | 0.7985 | 0.8936 |
| No log        | 1.2222 | 132  | 0.8440          | 0.4080  | 0.8440 | 0.9187 |
| No log        | 1.2407 | 134  | 0.9444          | 0.3676  | 0.9444 | 0.9718 |
| No log        | 1.2593 | 136  | 1.0034          | 0.3337  | 1.0034 | 1.0017 |
| No log        | 1.2778 | 138  | 0.8877          | 0.4092  | 0.8877 | 0.9422 |
| No log        | 1.2963 | 140  | 0.7385          | 0.3637  | 0.7385 | 0.8593 |
| No log        | 1.3148 | 142  | 0.6943          | 0.2498  | 0.6943 | 0.8333 |
| No log        | 1.3333 | 144  | 0.6994          | 0.2471  | 0.6994 | 0.8363 |
| No log        | 1.3519 | 146  | 0.7091          | 0.2784  | 0.7091 | 0.8421 |
| No log        | 1.3704 | 148  | 0.7587          | 0.3234  | 0.7587 | 0.8710 |
| No log        | 1.3889 | 150  | 0.8943          | 0.3538  | 0.8943 | 0.9457 |
| No log        | 1.4074 | 152  | 0.9652          | 0.3029  | 0.9652 | 0.9824 |
| No log        | 1.4259 | 154  | 0.8352          | 0.4404  | 0.8352 | 0.9139 |
| No log        | 1.4444 | 156  | 0.6769          | 0.2558  | 0.6769 | 0.8228 |
| No log        | 1.4630 | 158  | 0.6655          | 0.3141  | 0.6655 | 0.8158 |
| No log        | 1.4815 | 160  | 0.6565          | 0.3426  | 0.6565 | 0.8102 |
| No log        | 1.5    | 162  | 0.7265          | 0.3817  | 0.7265 | 0.8523 |
| No log        | 1.5185 | 164  | 0.8765          | 0.3499  | 0.8765 | 0.9362 |
| No log        | 1.5370 | 166  | 1.0127          | 0.2898  | 1.0127 | 1.0064 |
| No log        | 1.5556 | 168  | 0.9417          | 0.3052  | 0.9417 | 0.9704 |
| No log        | 1.5741 | 170  | 0.7469          | 0.3562  | 0.7469 | 0.8642 |
| No log        | 1.5926 | 172  | 0.6349          | 0.3763  | 0.6349 | 0.7968 |
| No log        | 1.6111 | 174  | 0.6206          | 0.2877  | 0.6206 | 0.7878 |
| No log        | 1.6296 | 176  | 0.6285          | 0.3399  | 0.6285 | 0.7928 |
| No log        | 1.6481 | 178  | 0.6664          | 0.3099  | 0.6664 | 0.8163 |
| No log        | 1.6667 | 180  | 0.7391          | 0.3746  | 0.7391 | 0.8597 |
| No log        | 1.6852 | 182  | 0.7609          | 0.3746  | 0.7609 | 0.8723 |
| No log        | 1.7037 | 184  | 0.7273          | 0.3372  | 0.7273 | 0.8528 |
| No log        | 1.7222 | 186  | 0.6796          | 0.2227  | 0.6796 | 0.8244 |
| No log        | 1.7407 | 188  | 0.7217          | 0.2383  | 0.7217 | 0.8495 |
| No log        | 1.7593 | 190  | 0.8368          | 0.3456  | 0.8368 | 0.9148 |
| No log        | 1.7778 | 192  | 0.8586          | 0.3688  | 0.8586 | 0.9266 |
| No log        | 1.7963 | 194  | 0.7750          | 0.2871  | 0.7750 | 0.8803 |
| No log        | 1.8148 | 196  | 0.7648          | 0.2871  | 0.7648 | 0.8746 |
| No log        | 1.8333 | 198  | 0.7913          | 0.3095  | 0.7913 | 0.8896 |
| No log        | 1.8519 | 200  | 0.7951          | 0.2926  | 0.7951 | 0.8917 |
| No log        | 1.8704 | 202  | 0.8246          | 0.2471  | 0.8246 | 0.9081 |
| No log        | 1.8889 | 204  | 0.8560          | 0.2364  | 0.8560 | 0.9252 |
| No log        | 1.9074 | 206  | 0.9938          | 0.3052  | 0.9938 | 0.9969 |
| No log        | 1.9259 | 208  | 1.1704          | 0.2643  | 1.1704 | 1.0818 |
| No log        | 1.9444 | 210  | 1.1412          | 0.2501  | 1.1412 | 1.0683 |
| No log        | 1.9630 | 212  | 0.9513          | 0.3601  | 0.9513 | 0.9754 |
| No log        | 1.9815 | 214  | 0.8096          | 0.2904  | 0.8096 | 0.8998 |
| No log        | 2.0    | 216  | 0.8180          | 0.2904  | 0.8180 | 0.9044 |
| No log        | 2.0185 | 218  | 0.9502          | 0.3439  | 0.9502 | 0.9748 |
| No log        | 2.0370 | 220  | 0.9671          | 0.3381  | 0.9671 | 0.9834 |
| No log        | 2.0556 | 222  | 0.9231          | 0.3439  | 0.9231 | 0.9608 |
| No log        | 2.0741 | 224  | 0.8631          | 0.3499  | 0.8631 | 0.9290 |
| No log        | 2.0926 | 226  | 0.7739          | 0.4239  | 0.7739 | 0.8797 |
| No log        | 2.1111 | 228  | 0.7480          | 0.2749  | 0.7480 | 0.8648 |
| No log        | 2.1296 | 230  | 0.7852          | 0.4114  | 0.7852 | 0.8861 |
| No log        | 2.1481 | 232  | 0.8783          | 0.3560  | 0.8783 | 0.9372 |
| No log        | 2.1667 | 234  | 0.8716          | 0.3678  | 0.8716 | 0.9336 |
| No log        | 2.1852 | 236  | 0.8379          | 0.4366  | 0.8379 | 0.9154 |
| No log        | 2.2037 | 238  | 0.7586          | 0.3700  | 0.7586 | 0.8710 |
| No log        | 2.2222 | 240  | 0.7216          | 0.3340  | 0.7216 | 0.8495 |
| No log        | 2.2407 | 242  | 0.7426          | 0.3569  | 0.7426 | 0.8617 |
| No log        | 2.2593 | 244  | 0.8270          | 0.4153  | 0.8270 | 0.9094 |
| No log        | 2.2778 | 246  | 0.9176          | 0.3381  | 0.9176 | 0.9579 |
| No log        | 2.2963 | 248  | 0.8500          | 0.3799  | 0.8500 | 0.9219 |
| No log        | 2.3148 | 250  | 0.6978          | 0.3544  | 0.6978 | 0.8354 |
| No log        | 2.3333 | 252  | 0.6435          | 0.3144  | 0.6435 | 0.8022 |
| No log        | 2.3519 | 254  | 0.6297          | 0.3625  | 0.6297 | 0.7935 |
| No log        | 2.3704 | 256  | 0.6371          | 0.3840  | 0.6371 | 0.7982 |
| No log        | 2.3889 | 258  | 0.6757          | 0.3942  | 0.6757 | 0.8220 |
| No log        | 2.4074 | 260  | 0.6659          | 0.3942  | 0.6659 | 0.8160 |
| No log        | 2.4259 | 262  | 0.6379          | 0.3976  | 0.6379 | 0.7987 |
| No log        | 2.4444 | 264  | 0.6425          | 0.3197  | 0.6425 | 0.8016 |
| No log        | 2.4630 | 266  | 0.6550          | 0.2537  | 0.6550 | 0.8093 |
| No log        | 2.4815 | 268  | 0.6578          | 0.2787  | 0.6578 | 0.8110 |
| No log        | 2.5    | 270  | 0.7050          | 0.3195  | 0.7050 | 0.8396 |
| No log        | 2.5185 | 272  | 0.7764          | 0.4272  | 0.7764 | 0.8811 |
| No log        | 2.5370 | 274  | 0.7354          | 0.4745  | 0.7354 | 0.8576 |
| No log        | 2.5556 | 276  | 0.6619          | 0.3656  | 0.6619 | 0.8136 |
| No log        | 2.5741 | 278  | 0.6357          | 0.4207  | 0.6357 | 0.7973 |
| No log        | 2.5926 | 280  | 0.6774          | 0.4404  | 0.6774 | 0.8231 |
| No log        | 2.6111 | 282  | 0.7805          | 0.4721  | 0.7805 | 0.8835 |
| No log        | 2.6296 | 284  | 0.8090          | 0.4705  | 0.8090 | 0.8995 |
| No log        | 2.6481 | 286  | 0.6898          | 0.4144  | 0.6898 | 0.8305 |
| No log        | 2.6667 | 288  | 0.5588          | 0.4243  | 0.5588 | 0.7475 |
| No log        | 2.6852 | 290  | 0.5194          | 0.4147  | 0.5194 | 0.7207 |
| No log        | 2.7037 | 292  | 0.5186          | 0.4722  | 0.5186 | 0.7201 |
| No log        | 2.7222 | 294  | 0.5234          | 0.4722  | 0.5234 | 0.7235 |
| No log        | 2.7407 | 296  | 0.5440          | 0.4819  | 0.5440 | 0.7376 |
| No log        | 2.7593 | 298  | 0.5435          | 0.4642  | 0.5435 | 0.7373 |
| No log        | 2.7778 | 300  | 0.5318          | 0.3702  | 0.5318 | 0.7293 |
| No log        | 2.7963 | 302  | 0.5482          | 0.4384  | 0.5482 | 0.7404 |
| No log        | 2.8148 | 304  | 0.5548          | 0.3947  | 0.5548 | 0.7448 |
| No log        | 2.8333 | 306  | 0.5691          | 0.3494  | 0.5691 | 0.7544 |
| No log        | 2.8519 | 308  | 0.6289          | 0.4035  | 0.6289 | 0.7931 |
| No log        | 2.8704 | 310  | 0.6465          | 0.4035  | 0.6465 | 0.8041 |
| No log        | 2.8889 | 312  | 0.6420          | 0.3755  | 0.6420 | 0.8013 |
| No log        | 2.9074 | 314  | 0.6189          | 0.3092  | 0.6189 | 0.7867 |
| No log        | 2.9259 | 316  | 0.6213          | 0.3092  | 0.6213 | 0.7883 |
| No log        | 2.9444 | 318  | 0.6413          | 0.3092  | 0.6413 | 0.8008 |
| No log        | 2.9630 | 320  | 0.6483          | 0.3092  | 0.6483 | 0.8052 |
| No log        | 2.9815 | 322  | 0.6706          | 0.3387  | 0.6706 | 0.8189 |
| No log        | 3.0    | 324  | 0.7129          | 0.2883  | 0.7129 | 0.8444 |
| No log        | 3.0185 | 326  | 0.7934          | 0.4224  | 0.7934 | 0.8907 |
| No log        | 3.0370 | 328  | 0.8775          | 0.3473  | 0.8775 | 0.9368 |
| No log        | 3.0556 | 330  | 0.8439          | 0.4624  | 0.8439 | 0.9187 |
| No log        | 3.0741 | 332  | 0.7766          | 0.3099  | 0.7766 | 0.8813 |
| No log        | 3.0926 | 334  | 0.6686          | 0.2981  | 0.6686 | 0.8177 |
| No log        | 3.1111 | 336  | 0.6458          | 0.3123  | 0.6458 | 0.8036 |
| No log        | 3.1296 | 338  | 0.6396          | 0.3166  | 0.6396 | 0.7998 |
| No log        | 3.1481 | 340  | 0.6458          | 0.3092  | 0.6458 | 0.8036 |
| No log        | 3.1667 | 342  | 0.6672          | 0.3312  | 0.6672 | 0.8168 |
| No log        | 3.1852 | 344  | 0.7297          | 0.3099  | 0.7297 | 0.8542 |
| No log        | 3.2037 | 346  | 0.7574          | 0.4197  | 0.7574 | 0.8703 |
| No log        | 3.2222 | 348  | 0.6859          | 0.3261  | 0.6859 | 0.8282 |
| No log        | 3.2407 | 350  | 0.6214          | 0.3312  | 0.6214 | 0.7883 |
| No log        | 3.2593 | 352  | 0.5847          | 0.3166  | 0.5847 | 0.7646 |
| No log        | 3.2778 | 354  | 0.5664          | 0.3354  | 0.5664 | 0.7526 |
| No log        | 3.2963 | 356  | 0.5628          | 0.3354  | 0.5628 | 0.7502 |
| No log        | 3.3148 | 358  | 0.5628          | 0.3354  | 0.5628 | 0.7502 |
| No log        | 3.3333 | 360  | 0.5712          | 0.3006  | 0.5712 | 0.7558 |
| No log        | 3.3519 | 362  | 0.5911          | 0.3323  | 0.5911 | 0.7689 |
| No log        | 3.3704 | 364  | 0.5943          | 0.3243  | 0.5943 | 0.7709 |
| No log        | 3.3889 | 366  | 0.5777          | 0.3039  | 0.5777 | 0.7600 |
| No log        | 3.4074 | 368  | 0.5638          | 0.3354  | 0.5638 | 0.7509 |
| No log        | 3.4259 | 370  | 0.5524          | 0.3889  | 0.5524 | 0.7432 |
| No log        | 3.4444 | 372  | 0.5468          | 0.3369  | 0.5468 | 0.7395 |
| No log        | 3.4630 | 374  | 0.5588          | 0.4845  | 0.5588 | 0.7476 |
| No log        | 3.4815 | 376  | 0.5484          | 0.4060  | 0.5484 | 0.7406 |
| No log        | 3.5    | 378  | 0.5375          | 0.3274  | 0.5375 | 0.7332 |
| No log        | 3.5185 | 380  | 0.5438          | 0.2987  | 0.5438 | 0.7375 |
| No log        | 3.5370 | 382  | 0.5484          | 0.3273  | 0.5484 | 0.7405 |
| No log        | 3.5556 | 384  | 0.5405          | 0.2987  | 0.5405 | 0.7352 |
| No log        | 3.5741 | 386  | 0.5429          | 0.2996  | 0.5429 | 0.7368 |
| No log        | 3.5926 | 388  | 0.5399          | 0.2641  | 0.5399 | 0.7348 |
| No log        | 3.6111 | 390  | 0.5373          | 0.2641  | 0.5373 | 0.7330 |
| No log        | 3.6296 | 392  | 0.5325          | 0.2996  | 0.5325 | 0.7297 |
| No log        | 3.6481 | 394  | 0.5277          | 0.3953  | 0.5277 | 0.7264 |
| No log        | 3.6667 | 396  | 0.5433          | 0.3416  | 0.5433 | 0.7371 |
| No log        | 3.6852 | 398  | 0.5704          | 0.3341  | 0.5704 | 0.7553 |
| No log        | 3.7037 | 400  | 0.5767          | 0.3341  | 0.5767 | 0.7594 |
| No log        | 3.7222 | 402  | 0.5726          | 0.3341  | 0.5726 | 0.7567 |
| No log        | 3.7407 | 404  | 0.5866          | 0.3341  | 0.5866 | 0.7659 |
| No log        | 3.7593 | 406  | 0.5951          | 0.3312  | 0.5951 | 0.7714 |
| No log        | 3.7778 | 408  | 0.6172          | 0.3312  | 0.6172 | 0.7856 |
| No log        | 3.7963 | 410  | 0.6595          | 0.3843  | 0.6595 | 0.8121 |
| No log        | 3.8148 | 412  | 0.6781          | 0.3843  | 0.6781 | 0.8235 |
| No log        | 3.8333 | 414  | 0.6525          | 0.4190  | 0.6525 | 0.8078 |
| No log        | 3.8519 | 416  | 0.6357          | 0.4020  | 0.6357 | 0.7973 |
| No log        | 3.8704 | 418  | 0.6030          | 0.3622  | 0.6030 | 0.7765 |
| No log        | 3.8889 | 420  | 0.5870          | 0.3341  | 0.5870 | 0.7662 |
| No log        | 3.9074 | 422  | 0.5679          | 0.3675  | 0.5679 | 0.7536 |
| No log        | 3.9259 | 424  | 0.5573          | 0.3995  | 0.5573 | 0.7465 |
| No log        | 3.9444 | 426  | 0.5627          | 0.4194  | 0.5627 | 0.7501 |
| No log        | 3.9630 | 428  | 0.5972          | 0.4292  | 0.5972 | 0.7728 |
| No log        | 3.9815 | 430  | 0.6792          | 0.4815  | 0.6792 | 0.8241 |
| No log        | 4.0    | 432  | 0.7062          | 0.4644  | 0.7062 | 0.8404 |
| No log        | 4.0185 | 434  | 0.6888          | 0.4644  | 0.6888 | 0.8299 |
| No log        | 4.0370 | 436  | 0.6759          | 0.4409  | 0.6759 | 0.8221 |
| No log        | 4.0556 | 438  | 0.6074          | 0.4044  | 0.6074 | 0.7793 |
| No log        | 4.0741 | 440  | 0.5911          | 0.4027  | 0.5911 | 0.7689 |
| No log        | 4.0926 | 442  | 0.5959          | 0.3782  | 0.5959 | 0.7719 |
| No log        | 4.1111 | 444  | 0.5990          | 0.3494  | 0.5990 | 0.7740 |
| No log        | 4.1296 | 446  | 0.6249          | 0.3465  | 0.6249 | 0.7905 |
| No log        | 4.1481 | 448  | 0.6833          | 0.3789  | 0.6833 | 0.8266 |
| No log        | 4.1667 | 450  | 0.6998          | 0.3789  | 0.6998 | 0.8365 |
| No log        | 4.1852 | 452  | 0.6573          | 0.3465  | 0.6573 | 0.8108 |
| No log        | 4.2037 | 454  | 0.6596          | 0.3465  | 0.6596 | 0.8122 |
| No log        | 4.2222 | 456  | 0.6712          | 0.3387  | 0.6712 | 0.8193 |
| No log        | 4.2407 | 458  | 0.6840          | 0.4052  | 0.6840 | 0.8270 |
| No log        | 4.2593 | 460  | 0.6763          | 0.3444  | 0.6763 | 0.8224 |
| No log        | 4.2778 | 462  | 0.6450          | 0.3387  | 0.6450 | 0.8031 |
| No log        | 4.2963 | 464  | 0.6399          | 0.3387  | 0.6399 | 0.8000 |
| No log        | 4.3148 | 466  | 0.6431          | 0.3387  | 0.6431 | 0.8019 |
| No log        | 4.3333 | 468  | 0.6471          | 0.3167  | 0.6471 | 0.8044 |
| No log        | 4.3519 | 470  | 0.6554          | 0.3789  | 0.6554 | 0.8096 |
| No log        | 4.3704 | 472  | 0.6469          | 0.3471  | 0.6469 | 0.8043 |
| No log        | 4.3889 | 474  | 0.6061          | 0.3976  | 0.6061 | 0.7785 |
| No log        | 4.4074 | 476  | 0.5654          | 0.3754  | 0.5654 | 0.7519 |
| No log        | 4.4259 | 478  | 0.5624          | 0.3258  | 0.5624 | 0.7499 |
| No log        | 4.4444 | 480  | 0.5691          | 0.2923  | 0.5691 | 0.7544 |
| No log        | 4.4630 | 482  | 0.5774          | 0.2963  | 0.5774 | 0.7599 |
| No log        | 4.4815 | 484  | 0.5919          | 0.3575  | 0.5919 | 0.7693 |
| No log        | 4.5    | 486  | 0.6617          | 0.3673  | 0.6617 | 0.8134 |
| No log        | 4.5185 | 488  | 0.7257          | 0.3444  | 0.7257 | 0.8519 |
| No log        | 4.5370 | 490  | 0.7068          | 0.3444  | 0.7068 | 0.8407 |
| No log        | 4.5556 | 492  | 0.6779          | 0.3167  | 0.6779 | 0.8233 |
| No log        | 4.5741 | 494  | 0.6681          | 0.3594  | 0.6681 | 0.8174 |
| No log        | 4.5926 | 496  | 0.6884          | 0.3444  | 0.6884 | 0.8297 |
| No log        | 4.6111 | 498  | 0.7111          | 0.3444  | 0.7111 | 0.8432 |
| 0.2421        | 4.6296 | 500  | 0.7278          | 0.3444  | 0.7278 | 0.8531 |
| 0.2421        | 4.6481 | 502  | 0.6692          | 0.3312  | 0.6692 | 0.8181 |
| 0.2421        | 4.6667 | 504  | 0.6313          | 0.3166  | 0.6313 | 0.7946 |
| 0.2421        | 4.6852 | 506  | 0.6121          | 0.3445  | 0.6121 | 0.7824 |
| 0.2421        | 4.7037 | 508  | 0.6076          | 0.3445  | 0.6076 | 0.7795 |
| 0.2421        | 4.7222 | 510  | 0.6405          | 0.3572  | 0.6405 | 0.8003 |
| 0.2421        | 4.7407 | 512  | 0.7083          | 0.4052  | 0.7083 | 0.8416 |
| 0.2421        | 4.7593 | 514  | 0.7393          | 0.4554  | 0.7393 | 0.8598 |
| 0.2421        | 4.7778 | 516  | 0.7042          | 0.4642  | 0.7042 | 0.8392 |
| 0.2421        | 4.7963 | 518  | 0.6464          | 0.3594  | 0.6464 | 0.8040 |
| 0.2421        | 4.8148 | 520  | 0.6130          | 0.3649  | 0.6130 | 0.7830 |
| 0.2421        | 4.8333 | 522  | 0.6089          | 0.3599  | 0.6089 | 0.7803 |
| 0.2421        | 4.8519 | 524  | 0.6280          | 0.3183  | 0.6280 | 0.7924 |
| 0.2421        | 4.8704 | 526  | 0.6584          | 0.3425  | 0.6584 | 0.8114 |
| 0.2421        | 4.8889 | 528  | 0.6523          | 0.3155  | 0.6523 | 0.8077 |
| 0.2421        | 4.9074 | 530  | 0.6549          | 0.2950  | 0.6549 | 0.8092 |
| 0.2421        | 4.9259 | 532  | 0.6785          | 0.2883  | 0.6785 | 0.8237 |


### Framework versions

- Transformers 4.44.2
- Pytorch 2.4.0+cu118
- Datasets 2.21.0
- Tokenizers 0.19.1