rwmasood commited on
Commit
92de51f
Β·
verified Β·
1 Parent(s): 1f864fb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -18,12 +18,12 @@ This document presents the evaluation results of `Llama-3.1-8B-Instruct-gptq-4bi
18
 
19
  ## πŸ“Š Evaluation Summary
20
 
21
- | **Metric** | **Value** | **Description** |
22
- |----------------------|-----------|-----------------|
23
- | **Accuracy (acc,none)** | `47.1%` | Raw accuracy - percentage of correct answers. |
24
- | **Standard Error (acc_stderr,none)** | `1.46%` | Uncertainty in the accuracy estimate. |
25
- | **Normalized Accuracy (acc_norm,none)** | `49.9%` | Accuracy after dataset-specific normalization. |
26
- | **Standard Error (acc_norm_stderr,none)** | `1.46%` | Uncertainty for normalized accuracy. |
27
 
28
  πŸ“Œ **Interpretation:**
29
  - The model correctly answered **47.1% of the questions**.
 
18
 
19
  ## πŸ“Š Evaluation Summary
20
 
21
+ | **Metric** | **Value** | **Description** | **Llama-3.1-8B-Instruct** |
22
+ |----------------------|-----------|-----------------|-----------|
23
+ | **Accuracy (acc,none)** | `47.1%` | Raw accuracy - percentage of correct answers. | `53.1%` |
24
+ | **Standard Error (acc_stderr,none)** | `1.46%` | Uncertainty in the accuracy estimate. | `1.45%` |
25
+ | **Normalized Accuracy (acc_norm,none)** | `49.9%` | Accuracy after dataset-specific normalization. | `56.8%` |
26
+ | **Standard Error (acc_norm_stderr,none)** | `1.46%` | Uncertainty for normalized accuracy. | `1.45%` |
27
 
28
  πŸ“Œ **Interpretation:**
29
  - The model correctly answered **47.1% of the questions**.