Update README.md
Browse files
README.md
CHANGED
@@ -73,21 +73,20 @@ wer_metric = load("wer")
|
|
73 |
| finetune (this model) | 94.837 | 1.429 | 2.951 | 67.659 | 7.951 | 18.313 |
|
74 |
|
75 |
|
76 |
-
|
|
|
|
|
77 |
|
|
|
78 |
|
79 |
-
|
80 |
-
|
81 |
-
ZerothDataset | CER after finetuning: 0.014290201005025125
|
82 |
-
|
83 |
-
ZerothDataset | WER after finetuning: 0.02951362746574311
|
84 |
-
|
85 |
-
FleursDataset | BLEU Score after finetuning: 67.6597801681411
|
86 |
-
|
87 |
-
FleursDataset | CER after finetuning: 0.07951429690560126
|
88 |
-
|
89 |
-
FleursDataset | WER after finetuning: 0.183130832893115
|
90 |
|
|
|
|
|
|
|
|
|
|
|
|
|
91 |
|
92 |
|
93 |
|
|
|
73 |
| finetune (this model) | 94.837 | 1.429 | 2.951 | 67.659 | 7.951 | 18.313 |
|
74 |
|
75 |
|
76 |
+
Evaluation was done on the following datasets:
|
77 |
+
- ASR (Automatic Speech Recognition): Evaluated with CER (Character Error Rate) on zeroth-test set (457 samples).
|
78 |
+
- AST (Automatic Speech Translation): Evaluated with BLEU score on fleurs ko <-> en speech translation result (270 samples).
|
79 |
|
80 |
+
Script is retrieved from [here](https://gist.github.com/seastar105/d1d8983b27611370528e3b194dcc5577#file-evaluate-py).
|
81 |
|
82 |
+
Compared to [Phi-4-mm-inst-zeroth-kor](https://huggingface.co/seastar105/Phi-4-mm-inst-zeroth-kor) and [Phi-4-multimodal-finetune-ko-speech](https://huggingface.co/daekeun-ml/Phi-4-multimodal-finetune-ko-speech), ASR is significantly improved.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
83 |
|
84 |
+
| Model | zeroth-test | fleurs-ko2en | fleurs-ko2en-cot | fleurs-en2ko | fleurs-en2ko-cot |
|
85 |
+
|----------------------|-------------|--------------|------------------|--------------|------------------|
|
86 |
+
| original | 198.32 | 5.63 | 2.42 | 6.86 | 4.17 |
|
87 |
+
| finetune (this model)| 1.31 | 7.46 | 6.24 | 12.15 | 8.91 |
|
88 |
+
| daekeun-ml/Phi-4-multimodal-finetune-ko-speech| 3.80 | 7.03 | 7.04 | 12.50 | 9.54 |
|
89 |
+
| seastar105/Phi-4-mm-inst-zeroth-kor | 7.02 | 7.07 | 9.19 | 13.08 | 9.35 |
|
90 |
|
91 |
|
92 |
|