Update README.md
Browse files
README.md
CHANGED
@@ -18,12 +18,12 @@ This document presents the evaluation results of `Llama-3.1-8B-Instruct-gptq-4bi
|
|
18 |
|
19 |
## π Evaluation Summary
|
20 |
|
21 |
-
| **Metric** | **Value** | **Description** |
|
22 |
-
|
23 |
-
| **Accuracy (acc,none)** | `47.1%` | Raw accuracy - percentage of correct answers. |
|
24 |
-
| **Standard Error (acc_stderr,none)** | `1.46%` | Uncertainty in the accuracy estimate. |
|
25 |
-
| **Normalized Accuracy (acc_norm,none)** | `49.9%` | Accuracy after dataset-specific normalization. |
|
26 |
-
| **Standard Error (acc_norm_stderr,none)** | `1.46%` | Uncertainty for normalized accuracy. |
|
27 |
|
28 |
π **Interpretation:**
|
29 |
- The model correctly answered **47.1% of the questions**.
|
|
|
18 |
|
19 |
## π Evaluation Summary
|
20 |
|
21 |
+
| **Metric** | **Value** | **Description** | **Llama-3.1-8B-Instruct** |
|
22 |
+
|----------------------|-----------|-----------------|-----------|
|
23 |
+
| **Accuracy (acc,none)** | `47.1%` | Raw accuracy - percentage of correct answers. | `53.1%` |
|
24 |
+
| **Standard Error (acc_stderr,none)** | `1.46%` | Uncertainty in the accuracy estimate. | `1.45%` |
|
25 |
+
| **Normalized Accuracy (acc_norm,none)** | `49.9%` | Accuracy after dataset-specific normalization. | `56.8%` |
|
26 |
+
| **Standard Error (acc_norm_stderr,none)** | `1.46%` | Uncertainty for normalized accuracy. | `1.45%` |
|
27 |
|
28 |
π **Interpretation:**
|
29 |
- The model correctly answered **47.1% of the questions**.
|