Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,7 @@ datasets:
|
|
14 |
metrics:
|
15 |
- bleu
|
16 |
- wer
|
|
|
17 |
model-index:
|
18 |
- name: Whisper Small GA-EN Speech Translation
|
19 |
results:
|
@@ -30,6 +31,7 @@ model-index:
|
|
30 |
- name: Wer
|
31 |
type: wer
|
32 |
value: 73.52543899144528
|
|
|
33 |
---
|
34 |
|
35 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -37,12 +39,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
37 |
|
38 |
# Whisper Small GA-EN Speech Translation
|
39 |
|
40 |
-
This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the IWSLT-2023, FLEURS, BiteSize, SpokenWords
|
41 |
-
|
42 |
-
- Loss: 1.
|
43 |
-
- Bleu:
|
44 |
-
- Chrf:
|
45 |
-
- Wer:
|
46 |
|
47 |
## Model description
|
48 |
|
@@ -58,6 +60,11 @@ More information needed
|
|
58 |
|
59 |
## Training procedure
|
60 |
|
|
|
|
|
|
|
|
|
|
|
61 |
### Training hyperparameters
|
62 |
|
63 |
The following hyperparameters were used during training:
|
@@ -110,11 +117,11 @@ The following hyperparameters were used during training:
|
|
110 |
| 0.032 | 3.56 | 3300 | 29.81 | 46.5 | 1.5823 | 66.7267 |
|
111 |
| 0.0348 | 3.67 | 3400 | 30.77 | 46.43 | 1.5752 | 64.6556 |
|
112 |
| 0.0277 | 3.78 | 3500 | 30.3 | 46.02 | 1.5791 | 64.6105 |
|
113 |
-
| 0.0364 | 3.88 | 3600 |
|
114 |
-
| 0.0398 | 3.99 | 3700 |
|
115 |
-
| 0.0152 | 4.1 | 3800 |
|
116 |
-
| 0.0201 | 4.21 | 3900 |
|
117 |
-
| 0.0168 | 4.31 | 4000 |
|
118 |
|
119 |
|
120 |
### Framework versions
|
@@ -122,4 +129,4 @@ The following hyperparameters were used during training:
|
|
122 |
- Transformers 4.39.3
|
123 |
- Pytorch 2.2.1+cu121
|
124 |
- Datasets 2.18.0
|
125 |
-
- Tokenizers 0.15.2
|
|
|
14 |
metrics:
|
15 |
- bleu
|
16 |
- wer
|
17 |
+
- chrf
|
18 |
model-index:
|
19 |
- name: Whisper Small GA-EN Speech Translation
|
20 |
results:
|
|
|
31 |
- name: Wer
|
32 |
type: wer
|
33 |
value: 73.52543899144528
|
34 |
+
library_name: transformers
|
35 |
---
|
36 |
|
37 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
39 |
|
40 |
# Whisper Small GA-EN Speech Translation
|
41 |
|
42 |
+
This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the IWSLT-2023, FLEURS, BiteSize, and SpokenWords datasets.
|
43 |
+
The best model (this version) is at checkpoint 3400, epoch 3.67, and it achieves the following results on the evaluation set:
|
44 |
+
- Loss: 1.5752
|
45 |
+
- Bleu: 30.77
|
46 |
+
- Chrf: 46.43
|
47 |
+
- Wer: 64.6556
|
48 |
|
49 |
## Model description
|
50 |
|
|
|
60 |
|
61 |
## Training procedure
|
62 |
|
63 |
+
### Experiment
|
64 |
+
|
65 |
+
- language=English
|
66 |
+
- +more steps
|
67 |
+
|
68 |
### Training hyperparameters
|
69 |
|
70 |
The following hyperparameters were used during training:
|
|
|
117 |
| 0.032 | 3.56 | 3300 | 29.81 | 46.5 | 1.5823 | 66.7267 |
|
118 |
| 0.0348 | 3.67 | 3400 | 30.77 | 46.43 | 1.5752 | 64.6556 |
|
119 |
| 0.0277 | 3.78 | 3500 | 30.3 | 46.02 | 1.5791 | 64.6105 |
|
120 |
+
| 0.0364 | 3.88 | 3600 | 29.92 | 45.38 | 1.5895 | 65.0608 |
|
121 |
+
| 0.0398 | 3.99 | 3700 | 27.79 | 44.59 | 1.6167 | 68.2575 |
|
122 |
+
| 0.0152 | 4.1 | 3800 | 28.42 | 44.83 | 1.6241 | 67.5822 |
|
123 |
+
| 0.0201 | 4.21 | 3900 | 29.02 | 45.11 | 1.6243 | 67.4921 |
|
124 |
+
| 0.0168 | 4.31 | 4000 | 26.85 | 44.41 | 1.6195 | 73.5254 |
|
125 |
|
126 |
|
127 |
### Framework versions
|
|
|
129 |
- Transformers 4.39.3
|
130 |
- Pytorch 2.2.1+cu121
|
131 |
- Datasets 2.18.0
|
132 |
+
- Tokenizers 0.15.2
|