File size: 22,099 Bytes
3f3d6fa
 
 
 
 
 
e719793
3f3d6fa
 
 
 
 
 
e719793
3f3d6fa
 
 
e719793
 
 
 
3f3d6fa
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e719793
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3f3d6fa
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
---
library_name: transformers
base_model: aubmindlab/bert-base-arabertv02
tags:
- generated_from_trainer
model-index:
- name: ArabicNewSplits7_B_usingWellWrittenEssays_FineTuningAraBERT_run999_AugV5_k20_task2_organization
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# ArabicNewSplits7_B_usingWellWrittenEssays_FineTuningAraBERT_run999_AugV5_k20_task2_organization

This model is a fine-tuned version of [aubmindlab/bert-base-arabertv02](https://huggingface.co/aubmindlab/bert-base-arabertv02) on the None dataset.
It achieves the following results on the evaluation set:
- Loss: 0.8610
- Qwk: 0.3970
- Mse: 0.8610
- Rmse: 0.9279

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 100

### Training results

| Training Loss | Epoch  | Step | Validation Loss | Qwk     | Mse    | Rmse   |
|:-------------:|:------:|:----:|:---------------:|:-------:|:------:|:------:|
| No log        | 0.0185 | 2    | 4.8061          | 0.0010  | 4.8061 | 2.1923 |
| No log        | 0.0370 | 4    | 2.6276          | 0.0051  | 2.6276 | 1.6210 |
| No log        | 0.0556 | 6    | 1.6356          | 0.0682  | 1.6356 | 1.2789 |
| No log        | 0.0741 | 8    | 1.3581          | 0.0958  | 1.3581 | 1.1654 |
| No log        | 0.0926 | 10   | 1.4611          | -0.0494 | 1.4611 | 1.2088 |
| No log        | 0.1111 | 12   | 1.4770          | -0.1091 | 1.4770 | 1.2153 |
| No log        | 0.1296 | 14   | 1.3101          | 0.0847  | 1.3101 | 1.1446 |
| No log        | 0.1481 | 16   | 1.3902          | 0.0253  | 1.3902 | 1.1791 |
| No log        | 0.1667 | 18   | 1.4876          | 0.1288  | 1.4876 | 1.2197 |
| No log        | 0.1852 | 20   | 1.3910          | 0.1507  | 1.3910 | 1.1794 |
| No log        | 0.2037 | 22   | 1.2287          | 0.0788  | 1.2287 | 1.1085 |
| No log        | 0.2222 | 24   | 1.1899          | 0.1043  | 1.1899 | 1.0908 |
| No log        | 0.2407 | 26   | 1.1645          | 0.1443  | 1.1645 | 1.0791 |
| No log        | 0.2593 | 28   | 1.1629          | 0.1344  | 1.1629 | 1.0784 |
| No log        | 0.2778 | 30   | 1.1827          | 0.1344  | 1.1827 | 1.0875 |
| No log        | 0.2963 | 32   | 1.2156          | 0.0977  | 1.2156 | 1.1025 |
| No log        | 0.3148 | 34   | 1.2314          | 0.1232  | 1.2314 | 1.1097 |
| No log        | 0.3333 | 36   | 1.4842          | 0.0537  | 1.4842 | 1.2183 |
| No log        | 0.3519 | 38   | 1.6229          | 0.1032  | 1.6229 | 1.2739 |
| No log        | 0.3704 | 40   | 1.3818          | 0.1530  | 1.3818 | 1.1755 |
| No log        | 0.3889 | 42   | 1.2727          | 0.2446  | 1.2727 | 1.1281 |
| No log        | 0.4074 | 44   | 1.1019          | 0.2168  | 1.1019 | 1.0497 |
| No log        | 0.4259 | 46   | 1.0527          | 0.3066  | 1.0527 | 1.0260 |
| No log        | 0.4444 | 48   | 0.9995          | 0.3695  | 0.9995 | 0.9997 |
| No log        | 0.4630 | 50   | 0.9803          | 0.3596  | 0.9803 | 0.9901 |
| No log        | 0.4815 | 52   | 0.9745          | 0.3346  | 0.9745 | 0.9872 |
| No log        | 0.5    | 54   | 0.9892          | 0.3154  | 0.9892 | 0.9946 |
| No log        | 0.5185 | 56   | 0.9996          | 0.4318  | 0.9996 | 0.9998 |
| No log        | 0.5370 | 58   | 1.0600          | 0.2883  | 1.0600 | 1.0296 |
| No log        | 0.5556 | 60   | 1.0838          | 0.2877  | 1.0838 | 1.0411 |
| No log        | 0.5741 | 62   | 1.0717          | 0.3430  | 1.0717 | 1.0352 |
| No log        | 0.5926 | 64   | 1.0834          | 0.2431  | 1.0834 | 1.0409 |
| No log        | 0.6111 | 66   | 1.0516          | 0.2709  | 1.0516 | 1.0255 |
| No log        | 0.6296 | 68   | 1.0386          | 0.2871  | 1.0386 | 1.0191 |
| No log        | 0.6481 | 70   | 1.0151          | 0.3294  | 1.0151 | 1.0075 |
| No log        | 0.6667 | 72   | 1.0660          | 0.2938  | 1.0660 | 1.0325 |
| No log        | 0.6852 | 74   | 1.2035          | 0.4045  | 1.2035 | 1.0970 |
| No log        | 0.7037 | 76   | 1.2189          | 0.4033  | 1.2189 | 1.1040 |
| No log        | 0.7222 | 78   | 1.0613          | 0.4005  | 1.0613 | 1.0302 |
| No log        | 0.7407 | 80   | 0.9840          | 0.3457  | 0.9840 | 0.9920 |
| No log        | 0.7593 | 82   | 0.9527          | 0.4260  | 0.9527 | 0.9761 |
| No log        | 0.7778 | 84   | 0.9423          | 0.4260  | 0.9423 | 0.9707 |
| No log        | 0.7963 | 86   | 0.9502          | 0.3814  | 0.9502 | 0.9748 |
| No log        | 0.8148 | 88   | 0.9627          | 0.3798  | 0.9627 | 0.9812 |
| No log        | 0.8333 | 90   | 0.9732          | 0.3798  | 0.9732 | 0.9865 |
| No log        | 0.8519 | 92   | 0.9781          | 0.3699  | 0.9781 | 0.9890 |
| No log        | 0.8704 | 94   | 0.9746          | 0.3559  | 0.9746 | 0.9872 |
| No log        | 0.8889 | 96   | 0.9998          | 0.3338  | 0.9998 | 0.9999 |
| No log        | 0.9074 | 98   | 1.0160          | 0.2891  | 1.0160 | 1.0080 |
| No log        | 0.9259 | 100  | 1.0355          | 0.2672  | 1.0355 | 1.0176 |
| No log        | 0.9444 | 102  | 1.0981          | 0.2482  | 1.0981 | 1.0479 |
| No log        | 0.9630 | 104  | 1.0951          | 0.2750  | 1.0951 | 1.0465 |
| No log        | 0.9815 | 106  | 1.0438          | 0.3173  | 1.0438 | 1.0217 |
| No log        | 1.0    | 108  | 1.0207          | 0.2796  | 1.0207 | 1.0103 |
| No log        | 1.0185 | 110  | 0.9698          | 0.3554  | 0.9698 | 0.9848 |
| No log        | 1.0370 | 112  | 0.9688          | 0.3351  | 0.9688 | 0.9843 |
| No log        | 1.0556 | 114  | 0.9859          | 0.3725  | 0.9859 | 0.9929 |
| No log        | 1.0741 | 116  | 0.9732          | 0.3303  | 0.9732 | 0.9865 |
| No log        | 1.0926 | 118  | 1.0109          | 0.3427  | 1.0109 | 1.0054 |
| No log        | 1.1111 | 120  | 1.0989          | 0.2203  | 1.0989 | 1.0483 |
| No log        | 1.1296 | 122  | 1.0715          | 0.2721  | 1.0715 | 1.0351 |
| No log        | 1.1481 | 124  | 0.9905          | 0.3276  | 0.9905 | 0.9952 |
| No log        | 1.1667 | 126  | 0.9455          | 0.3650  | 0.9455 | 0.9724 |
| No log        | 1.1852 | 128  | 0.9577          | 0.4736  | 0.9577 | 0.9786 |
| No log        | 1.2037 | 130  | 1.0176          | 0.3518  | 1.0176 | 1.0088 |
| No log        | 1.2222 | 132  | 0.9782          | 0.3725  | 0.9782 | 0.9890 |
| No log        | 1.2407 | 134  | 0.9128          | 0.4527  | 0.9128 | 0.9554 |
| No log        | 1.2593 | 136  | 0.8783          | 0.4197  | 0.8783 | 0.9372 |
| No log        | 1.2778 | 138  | 0.8656          | 0.4197  | 0.8656 | 0.9304 |
| No log        | 1.2963 | 140  | 0.9447          | 0.4631  | 0.9447 | 0.9720 |
| No log        | 1.3148 | 142  | 1.0511          | 0.3807  | 1.0511 | 1.0252 |
| No log        | 1.3333 | 144  | 0.9450          | 0.4565  | 0.9450 | 0.9721 |
| No log        | 1.3519 | 146  | 0.8753          | 0.4916  | 0.8753 | 0.9356 |
| No log        | 1.3704 | 148  | 0.8913          | 0.3965  | 0.8913 | 0.9441 |
| No log        | 1.3889 | 150  | 0.9184          | 0.4789  | 0.9184 | 0.9583 |
| No log        | 1.4074 | 152  | 0.9299          | 0.4454  | 0.9299 | 0.9643 |
| No log        | 1.4259 | 154  | 0.9219          | 0.4628  | 0.9219 | 0.9601 |
| No log        | 1.4444 | 156  | 0.9130          | 0.3814  | 0.9130 | 0.9555 |
| No log        | 1.4630 | 158  | 0.9167          | 0.4578  | 0.9167 | 0.9574 |
| No log        | 1.4815 | 160  | 0.9134          | 0.3382  | 0.9134 | 0.9557 |
| No log        | 1.5    | 162  | 0.9653          | 0.4074  | 0.9653 | 0.9825 |
| No log        | 1.5185 | 164  | 0.9814          | 0.3908  | 0.9814 | 0.9907 |
| No log        | 1.5370 | 166  | 0.9420          | 0.4074  | 0.9420 | 0.9706 |
| No log        | 1.5556 | 168  | 0.8930          | 0.4294  | 0.8930 | 0.9450 |
| No log        | 1.5741 | 170  | 0.8894          | 0.4661  | 0.8894 | 0.9431 |
| No log        | 1.5926 | 172  | 0.8838          | 0.4661  | 0.8838 | 0.9401 |
| No log        | 1.6111 | 174  | 0.8736          | 0.4004  | 0.8736 | 0.9347 |
| No log        | 1.6296 | 176  | 0.8568          | 0.4429  | 0.8568 | 0.9256 |
| No log        | 1.6481 | 178  | 0.8741          | 0.3991  | 0.8741 | 0.9349 |
| No log        | 1.6667 | 180  | 0.8583          | 0.3920  | 0.8583 | 0.9264 |
| No log        | 1.6852 | 182  | 0.8547          | 0.3920  | 0.8547 | 0.9245 |
| No log        | 1.7037 | 184  | 0.8589          | 0.3780  | 0.8589 | 0.9268 |
| No log        | 1.7222 | 186  | 0.8637          | 0.4197  | 0.8637 | 0.9293 |
| No log        | 1.7407 | 188  | 0.8782          | 0.4334  | 0.8782 | 0.9371 |
| No log        | 1.7593 | 190  | 0.8765          | 0.3627  | 0.8765 | 0.9362 |
| No log        | 1.7778 | 192  | 0.8782          | 0.3648  | 0.8782 | 0.9371 |
| No log        | 1.7963 | 194  | 0.8901          | 0.3648  | 0.8901 | 0.9434 |
| No log        | 1.8148 | 196  | 0.9284          | 0.3988  | 0.9284 | 0.9635 |
| No log        | 1.8333 | 198  | 0.8939          | 0.4093  | 0.8939 | 0.9455 |
| No log        | 1.8519 | 200  | 0.9117          | 0.3951  | 0.9117 | 0.9548 |
| No log        | 1.8704 | 202  | 0.9536          | 0.3988  | 0.9536 | 0.9765 |
| No log        | 1.8889 | 204  | 0.9097          | 0.4337  | 0.9097 | 0.9538 |
| No log        | 1.9074 | 206  | 0.9028          | 0.4337  | 0.9028 | 0.9502 |
| No log        | 1.9259 | 208  | 0.9348          | 0.4550  | 0.9348 | 0.9668 |
| No log        | 1.9444 | 210  | 0.9483          | 0.5163  | 0.9483 | 0.9738 |
| No log        | 1.9630 | 212  | 0.8748          | 0.4730  | 0.8748 | 0.9353 |
| No log        | 1.9815 | 214  | 0.8462          | 0.5024  | 0.8462 | 0.9199 |
| No log        | 2.0    | 216  | 0.8723          | 0.4563  | 0.8723 | 0.9340 |
| No log        | 2.0185 | 218  | 1.0110          | 0.4153  | 1.0110 | 1.0055 |
| No log        | 2.0370 | 220  | 1.0326          | 0.4214  | 1.0326 | 1.0161 |
| No log        | 2.0556 | 222  | 0.8998          | 0.4476  | 0.8998 | 0.9486 |
| No log        | 2.0741 | 224  | 0.8997          | 0.4144  | 0.8997 | 0.9485 |
| No log        | 2.0926 | 226  | 0.8919          | 0.3819  | 0.8919 | 0.9444 |
| No log        | 2.1111 | 228  | 0.8765          | 0.4563  | 0.8765 | 0.9362 |
| No log        | 2.1296 | 230  | 0.8990          | 0.4507  | 0.8990 | 0.9481 |
| No log        | 2.1481 | 232  | 0.8755          | 0.4841  | 0.8755 | 0.9357 |
| No log        | 2.1667 | 234  | 0.8642          | 0.4334  | 0.8642 | 0.9296 |
| No log        | 2.1852 | 236  | 0.8493          | 0.5216  | 0.8493 | 0.9216 |
| No log        | 2.2037 | 238  | 0.8464          | 0.4962  | 0.8464 | 0.9200 |
| No log        | 2.2222 | 240  | 0.8951          | 0.4848  | 0.8951 | 0.9461 |
| No log        | 2.2407 | 242  | 0.9781          | 0.4059  | 0.9781 | 0.9890 |
| No log        | 2.2593 | 244  | 1.0199          | 0.4056  | 1.0199 | 1.0099 |
| No log        | 2.2778 | 246  | 0.9495          | 0.3348  | 0.9495 | 0.9744 |
| No log        | 2.2963 | 248  | 0.9076          | 0.3992  | 0.9076 | 0.9527 |
| No log        | 2.3148 | 250  | 0.9068          | 0.4094  | 0.9068 | 0.9523 |
| No log        | 2.3333 | 252  | 0.9247          | 0.3992  | 0.9247 | 0.9616 |
| No log        | 2.3519 | 254  | 0.9014          | 0.3956  | 0.9014 | 0.9494 |
| No log        | 2.3704 | 256  | 0.9229          | 0.4136  | 0.9229 | 0.9607 |
| No log        | 2.3889 | 258  | 1.0185          | 0.4516  | 1.0185 | 1.0092 |
| No log        | 2.4074 | 260  | 0.9443          | 0.4991  | 0.9443 | 0.9717 |
| No log        | 2.4259 | 262  | 0.8616          | 0.3983  | 0.8616 | 0.9282 |
| No log        | 2.4444 | 264  | 0.8613          | 0.4757  | 0.8613 | 0.9280 |
| No log        | 2.4630 | 266  | 0.8595          | 0.4158  | 0.8595 | 0.9271 |
| No log        | 2.4815 | 268  | 0.9163          | 0.4763  | 0.9163 | 0.9572 |
| No log        | 2.5    | 270  | 0.9032          | 0.4861  | 0.9032 | 0.9504 |
| No log        | 2.5185 | 272  | 0.8801          | 0.4337  | 0.8801 | 0.9381 |
| No log        | 2.5370 | 274  | 0.8654          | 0.3596  | 0.8654 | 0.9302 |
| No log        | 2.5556 | 276  | 0.8752          | 0.4548  | 0.8752 | 0.9355 |
| No log        | 2.5741 | 278  | 0.8582          | 0.3483  | 0.8582 | 0.9264 |
| No log        | 2.5926 | 280  | 0.8549          | 0.4548  | 0.8549 | 0.9246 |
| No log        | 2.6111 | 282  | 0.8602          | 0.4646  | 0.8602 | 0.9275 |
| No log        | 2.6296 | 284  | 0.8500          | 0.3914  | 0.8500 | 0.9219 |
| No log        | 2.6481 | 286  | 0.8549          | 0.4056  | 0.8549 | 0.9246 |
| No log        | 2.6667 | 288  | 0.8686          | 0.4337  | 0.8686 | 0.9320 |
| No log        | 2.6852 | 290  | 0.8592          | 0.4297  | 0.8592 | 0.9269 |
| No log        | 2.7037 | 292  | 0.8533          | 0.4450  | 0.8533 | 0.9238 |
| No log        | 2.7222 | 294  | 0.8750          | 0.3943  | 0.8750 | 0.9354 |
| No log        | 2.7407 | 296  | 0.8459          | 0.4219  | 0.8459 | 0.9197 |
| No log        | 2.7593 | 298  | 0.8281          | 0.5042  | 0.8281 | 0.9100 |
| No log        | 2.7778 | 300  | 0.8731          | 0.3590  | 0.8731 | 0.9344 |
| No log        | 2.7963 | 302  | 0.8381          | 0.3946  | 0.8381 | 0.9155 |
| No log        | 2.8148 | 304  | 0.8299          | 0.4157  | 0.8299 | 0.9110 |
| No log        | 2.8333 | 306  | 0.8495          | 0.4470  | 0.8495 | 0.9217 |
| No log        | 2.8519 | 308  | 0.8499          | 0.4898  | 0.8499 | 0.9219 |
| No log        | 2.8704 | 310  | 0.8255          | 0.4012  | 0.8255 | 0.9086 |
| No log        | 2.8889 | 312  | 0.8458          | 0.3946  | 0.8458 | 0.9197 |
| No log        | 2.9074 | 314  | 0.8425          | 0.3951  | 0.8425 | 0.9179 |
| No log        | 2.9259 | 316  | 0.8074          | 0.3728  | 0.8074 | 0.8985 |
| No log        | 2.9444 | 318  | 0.8000          | 0.3583  | 0.8000 | 0.8944 |
| No log        | 2.9630 | 320  | 0.8083          | 0.4916  | 0.8083 | 0.8990 |
| No log        | 2.9815 | 322  | 0.8199          | 0.4998  | 0.8199 | 0.9055 |
| No log        | 3.0    | 324  | 0.7871          | 0.3787  | 0.7871 | 0.8872 |
| No log        | 3.0185 | 326  | 0.7799          | 0.4075  | 0.7799 | 0.8831 |
| No log        | 3.0370 | 328  | 0.7763          | 0.4075  | 0.7763 | 0.8811 |
| No log        | 3.0556 | 330  | 0.7751          | 0.4280  | 0.7751 | 0.8804 |
| No log        | 3.0741 | 332  | 0.7860          | 0.4611  | 0.7860 | 0.8866 |
| No log        | 3.0926 | 334  | 0.7832          | 0.4656  | 0.7832 | 0.8850 |
| No log        | 3.1111 | 336  | 0.8004          | 0.4075  | 0.8004 | 0.8946 |
| No log        | 3.1296 | 338  | 0.8624          | 0.3660  | 0.8624 | 0.9287 |
| No log        | 3.1481 | 340  | 0.8872          | 0.3866  | 0.8872 | 0.9419 |
| No log        | 3.1667 | 342  | 0.8758          | 0.3168  | 0.8758 | 0.9358 |
| No log        | 3.1852 | 344  | 0.8449          | 0.3437  | 0.8449 | 0.9192 |
| No log        | 3.2037 | 346  | 0.8313          | 0.3719  | 0.8313 | 0.9118 |
| No log        | 3.2222 | 348  | 0.8613          | 0.3946  | 0.8613 | 0.9281 |
| No log        | 3.2407 | 350  | 0.8908          | 0.3946  | 0.8908 | 0.9438 |
| No log        | 3.2593 | 352  | 0.8884          | 0.3356  | 0.8884 | 0.9426 |
| No log        | 3.2778 | 354  | 0.8856          | 0.3020  | 0.8856 | 0.9411 |
| No log        | 3.2963 | 356  | 0.8813          | 0.3229  | 0.8813 | 0.9388 |
| No log        | 3.3148 | 358  | 0.8314          | 0.3596  | 0.8314 | 0.9118 |
| No log        | 3.3333 | 360  | 0.7783          | 0.4466  | 0.7783 | 0.8822 |
| No log        | 3.3519 | 362  | 0.7897          | 0.4198  | 0.7897 | 0.8886 |
| No log        | 3.3704 | 364  | 0.7770          | 0.4587  | 0.7770 | 0.8815 |
| No log        | 3.3889 | 366  | 0.7246          | 0.4942  | 0.7246 | 0.8512 |
| No log        | 3.4074 | 368  | 0.7843          | 0.5567  | 0.7843 | 0.8856 |
| No log        | 3.4259 | 370  | 0.7833          | 0.5368  | 0.7833 | 0.8850 |
| No log        | 3.4444 | 372  | 0.7477          | 0.3933  | 0.7477 | 0.8647 |
| No log        | 3.4630 | 374  | 0.7421          | 0.4853  | 0.7421 | 0.8614 |
| No log        | 3.4815 | 376  | 0.7470          | 0.4853  | 0.7470 | 0.8643 |
| No log        | 3.5    | 378  | 0.7697          | 0.3933  | 0.7697 | 0.8773 |
| No log        | 3.5185 | 380  | 0.8245          | 0.3045  | 0.8245 | 0.9080 |
| No log        | 3.5370 | 382  | 0.8643          | 0.3519  | 0.8643 | 0.9297 |
| No log        | 3.5556 | 384  | 0.8671          | 0.4503  | 0.8671 | 0.9312 |
| No log        | 3.5741 | 386  | 0.8494          | 0.3147  | 0.8494 | 0.9216 |
| No log        | 3.5926 | 388  | 0.8145          | 0.4075  | 0.8145 | 0.9025 |
| No log        | 3.6111 | 390  | 0.8096          | 0.4054  | 0.8096 | 0.8998 |
| No log        | 3.6296 | 392  | 0.7907          | 0.3627  | 0.7907 | 0.8892 |
| No log        | 3.6481 | 394  | 0.8544          | 0.4949  | 0.8544 | 0.9243 |
| No log        | 3.6667 | 396  | 0.9670          | 0.4186  | 0.9670 | 0.9834 |
| No log        | 3.6852 | 398  | 0.9581          | 0.4186  | 0.9581 | 0.9788 |
| No log        | 3.7037 | 400  | 0.8559          | 0.3298  | 0.8559 | 0.9252 |
| No log        | 3.7222 | 402  | 0.8586          | 0.4483  | 0.8586 | 0.9266 |
| No log        | 3.7407 | 404  | 0.8696          | 0.4489  | 0.8696 | 0.9325 |
| No log        | 3.7593 | 406  | 0.8190          | 0.3951  | 0.8190 | 0.9050 |
| No log        | 3.7778 | 408  | 0.7880          | 0.3938  | 0.7880 | 0.8877 |
| No log        | 3.7963 | 410  | 0.8012          | 0.5467  | 0.8012 | 0.8951 |
| No log        | 3.8148 | 412  | 0.7806          | 0.5476  | 0.7806 | 0.8835 |
| No log        | 3.8333 | 414  | 0.7562          | 0.4019  | 0.7562 | 0.8696 |
| No log        | 3.8519 | 416  | 0.7573          | 0.4471  | 0.7573 | 0.8703 |
| No log        | 3.8704 | 418  | 0.7520          | 0.5057  | 0.7520 | 0.8672 |
| No log        | 3.8889 | 420  | 0.7460          | 0.5770  | 0.7460 | 0.8637 |
| No log        | 3.9074 | 422  | 0.7538          | 0.5450  | 0.7538 | 0.8682 |
| No log        | 3.9259 | 424  | 0.7739          | 0.3909  | 0.7739 | 0.8797 |
| No log        | 3.9444 | 426  | 0.8882          | 0.4594  | 0.8882 | 0.9424 |
| No log        | 3.9630 | 428  | 0.9200          | 0.4594  | 0.9200 | 0.9592 |
| No log        | 3.9815 | 430  | 0.8186          | 0.4315  | 0.8186 | 0.9048 |
| No log        | 4.0    | 432  | 0.6914          | 0.6059  | 0.6914 | 0.8315 |
| No log        | 4.0185 | 434  | 0.7329          | 0.6079  | 0.7329 | 0.8561 |
| No log        | 4.0370 | 436  | 0.7654          | 0.6079  | 0.7654 | 0.8749 |
| No log        | 4.0556 | 438  | 0.7051          | 0.5951  | 0.7051 | 0.8397 |
| No log        | 4.0741 | 440  | 0.7309          | 0.5503  | 0.7309 | 0.8549 |
| No log        | 4.0926 | 442  | 0.8199          | 0.5578  | 0.8199 | 0.9055 |
| No log        | 4.1111 | 444  | 0.8140          | 0.5578  | 0.8140 | 0.9022 |
| No log        | 4.1296 | 446  | 0.7557          | 0.5089  | 0.7557 | 0.8693 |
| No log        | 4.1481 | 448  | 0.7437          | 0.5125  | 0.7437 | 0.8624 |
| No log        | 4.1667 | 450  | 0.7631          | 0.5044  | 0.7631 | 0.8735 |
| No log        | 4.1852 | 452  | 0.7899          | 0.4792  | 0.7899 | 0.8888 |
| No log        | 4.2037 | 454  | 0.8066          | 0.4874  | 0.8066 | 0.8981 |
| No log        | 4.2222 | 456  | 0.8319          | 0.4197  | 0.8319 | 0.9121 |
| No log        | 4.2407 | 458  | 0.9779          | 0.3815  | 0.9779 | 0.9889 |
| No log        | 4.2593 | 460  | 1.0743          | 0.4040  | 1.0743 | 1.0365 |
| No log        | 4.2778 | 462  | 0.9684          | 0.4356  | 0.9684 | 0.9841 |
| No log        | 4.2963 | 464  | 0.8000          | 0.4197  | 0.8000 | 0.8944 |
| No log        | 4.3148 | 466  | 0.7748          | 0.4977  | 0.7748 | 0.8802 |
| No log        | 4.3333 | 468  | 0.7874          | 0.4715  | 0.7874 | 0.8874 |
| No log        | 4.3519 | 470  | 0.8109          | 0.3627  | 0.8109 | 0.9005 |
| No log        | 4.3704 | 472  | 0.8439          | 0.3771  | 0.8439 | 0.9187 |
| No log        | 4.3889 | 474  | 0.8567          | 0.3660  | 0.8567 | 0.9256 |
| No log        | 4.4074 | 476  | 0.8428          | 0.3483  | 0.8428 | 0.9180 |
| No log        | 4.4259 | 478  | 0.8335          | 0.3483  | 0.8335 | 0.9130 |
| No log        | 4.4444 | 480  | 0.8268          | 0.3483  | 0.8268 | 0.9093 |
| No log        | 4.4630 | 482  | 0.8385          | 0.3771  | 0.8385 | 0.9157 |
| No log        | 4.4815 | 484  | 0.8638          | 0.3806  | 0.8638 | 0.9294 |
| No log        | 4.5    | 486  | 0.8727          | 0.3513  | 0.8727 | 0.9342 |
| No log        | 4.5185 | 488  | 0.8904          | 0.3196  | 0.8904 | 0.9436 |
| No log        | 4.5370 | 490  | 0.9123          | 0.2470  | 0.9123 | 0.9551 |
| No log        | 4.5556 | 492  | 0.9144          | 0.2821  | 0.9144 | 0.9562 |
| No log        | 4.5741 | 494  | 0.8611          | 0.3744  | 0.8611 | 0.9279 |
| No log        | 4.5926 | 496  | 0.8303          | 0.4197  | 0.8303 | 0.9112 |
| No log        | 4.6111 | 498  | 0.8320          | 0.4337  | 0.8320 | 0.9122 |
| 0.2735        | 4.6296 | 500  | 0.8089          | 0.4197  | 0.8089 | 0.8994 |
| 0.2735        | 4.6481 | 502  | 0.8035          | 0.3879  | 0.8035 | 0.8964 |
| 0.2735        | 4.6667 | 504  | 0.8206          | 0.4912  | 0.8206 | 0.9059 |
| 0.2735        | 4.6852 | 506  | 0.8262          | 0.3583  | 0.8262 | 0.9090 |
| 0.2735        | 4.7037 | 508  | 0.8333          | 0.3974  | 0.8333 | 0.9129 |
| 0.2735        | 4.7222 | 510  | 0.8620          | 0.4012  | 0.8620 | 0.9284 |
| 0.2735        | 4.7407 | 512  | 0.8742          | 0.4012  | 0.8742 | 0.9350 |
| 0.2735        | 4.7593 | 514  | 0.8610          | 0.3970  | 0.8610 | 0.9279 |


### Framework versions

- Transformers 4.44.2
- Pytorch 2.4.0+cu118
- Datasets 2.21.0
- Tokenizers 0.19.1