GerwinVanGiessen
/

whisper-base-nl-1

@@ -1,7 +1,7 @@
 ---
 base_model: openai/whisper-base
 datasets:
-- mozilla-foundation/common_voice_13_0
 language:
 - nl
 license: apache-2.0
@@ -16,14 +16,14 @@ model-index:
       type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
-      name: Common Voice 13.0
-      type: mozilla-foundation/common_voice_13_0
       config: nl
       split: test
       args: 'config: nl, split: test'
     metrics:
     - type: wer
-      value: 19.29784
       name: Wer
 ---
@@ -34,8 +34,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-base](https://huggingface.co/openai/whisper-base) on the Common Voice 13.0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.331696
-- Wer: 19.2978
 ## Model description
@@ -59,23 +59,28 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- training_steps: 5000
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Step | Validation Loss | Wer     |
 |:-------------:|:----:|:---------------:|:-------:|
-| 0.3695        |  500 | 0.397109        | 24.2911 |
-| 0.2612        | 1000 | 0.365275        | 22.6367 |
-| 0.2067        | 1500 | 0.348754        | 21.6366 |
-| 0.1473        | 2000 | 0.336159        | 20.4868 |
-| 0.1424        | 2500 | 0.326552        | 20.1531 |
-| 0.1042        | 3000 | 0.330685        | 19.9752 |
-| 0.0872        | 3500 | 0.326190        | 19.5249 |
-| 0.0872        | 4000 | 0.322959        | 19.4213 |
-| 0.0541        | 4500 | 0.331140        | 19.3702 |
-| 0.0529        | 5000 | 0.331696        | 19.2978 |
 ### Framework versions

 ---
 base_model: openai/whisper-base
 datasets:
+- mozilla-foundation/common_voice_17_0
 language:
 - nl
 license: apache-2.0
       type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
+      name: Common Voice 17.0
+      type: mozilla-foundation/common_voice_17_0
       config: nl
       split: test
       args: 'config: nl, split: test'
     metrics:
     - type: wer
+      value: 19.0031
       name: Wer
 ---
 This model is a fine-tuned version of [openai/whisper-base](https://huggingface.co/openai/whisper-base) on the Common Voice 13.0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.343928
+- Wer: 19.003155
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- training_steps: 7500
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Step | Validation Loss | Wer     |
 |:-------------:|:----:|:---------------:|:-------:|
+| 0.3639        |  500 | 0.396971        | 24.3028 |
+| 0.2625        | 1000 | 0.358340        | 22.5210 |
+| 0.2212        | 1500 | 0.341232        | 21.0322 |
+| 0.1455        | 2000 | 0.330033        | 20.2046 |
+| 0.1406        | 2500 | 0.324484        | 20.0508 |
+| 0.1244        | 3000 | 0.321562        | 19.5279 |
+| 0.0848        | 3500 | 0.321506        | 19.5114 |
+| 0.0844        | 4000 | 0.316492        | 19.1462 |
+| 0.0731        | 4500 | 0.321992        | 19.0167 |
+| 0.0515        | 5000 | 0.324720        | 19.1492 |
+| 0.0532        | 5500 | 0.324773        | 19.0148 |
+| 0.0426        | 6000 | 0.332404        | 19.0576 |
+| 0.0328        | 6500 | 0.334900        | 18.8249 |
+| 0.0327        | 7000 | 0.335876        | 19.0080 |
+| 0.0252        | 7500 | 0.343928        | 19.0031 |
 ### Framework versions