Update README.md
Browse files
README.md
CHANGED
@@ -31,7 +31,7 @@ At the time of release, FIRST demonstrates superior performance across a variety
|
|
31 |
|---------------|----------------|-------|---------------|---------|-------|-------|-----------|----------|----------|-------|----------|----------|------------|
|
32 |
| Rank Vicuna | GPT 3.5 | 50.7 | **28.2** | 50.0 | 81.0 | 35.9 | 73.5 | 36.7 | 33.1 | 58.6 | 18.4 | 70.5 | 71.3 |
|
33 |
| Rank Zephyr | GPT 3.5 + 3.5 | 53.7 | 25.6 | 50.0 | 80.1 | **42.2** | 71.6 | 42.7 | **37.7** | 65.6 | **20.5** | **76.7** | 78.4 |
|
34 |
-
| **FIRST** | GPT-4 | **54.3** | 26.7 | **50.9**| **81.7**| 42.2 | **74.2** | **44.4** | 37.4 | **66.4**| 20.4 | 74.6 | **78.8** |
|
35 |
|
36 |
|
37 |
More details can be found in the paper.
|
|
|
31 |
|---------------|----------------|-------|---------------|---------|-------|-------|-----------|----------|----------|-------|----------|----------|------------|
|
32 |
| Rank Vicuna | GPT 3.5 | 50.7 | **28.2** | 50.0 | 81.0 | 35.9 | 73.5 | 36.7 | 33.1 | 58.6 | 18.4 | 70.5 | 71.3 |
|
33 |
| Rank Zephyr | GPT 3.5 + 3.5 | 53.7 | 25.6 | 50.0 | 80.1 | **42.2** | 71.6 | 42.7 | **37.7** | 65.6 | **20.5** | **76.7** | 78.4 |
|
34 |
+
| **FIRST** | GPT-4 | **54.3** | 26.7 | **50.9**| **81.7**| **42.2** | **74.2** | **44.4** | 37.4 | **66.4**| 20.4 | 74.6 | **78.8** |
|
35 |
|
36 |
|
37 |
More details can be found in the paper.
|