grammar (#11)
Browse files- grammar (6c5837b2f3d29df2f245e280c6e06183de9b468e)
Co-authored-by: David Forman <[email protected]>
README.md
CHANGED
@@ -23,7 +23,7 @@ widget:
|
|
23 |
The Phi-3-Medium-4K-Instruct is a 14B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense properties.
|
24 |
The model belongs to the Phi-3 family with the Medium version in two variants [4K](https://huggingface.co/microsoft/Phi-3-medium-4k-instruct) and [128K](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) which is the context length (in tokens) that it can support.
|
25 |
|
26 |
-
The model has
|
27 |
When assessed against benchmarks testing common sense, language understanding, math, code, long context and logical reasoning, Phi-3-Medium-4K-Instruct showcased a robust and state-of-the-art performance among models of the same-size and next-size-up.
|
28 |
|
29 |
Resources and Technical Documentation:
|
|
|
23 |
The Phi-3-Medium-4K-Instruct is a 14B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense properties.
|
24 |
The model belongs to the Phi-3 family with the Medium version in two variants [4K](https://huggingface.co/microsoft/Phi-3-medium-4k-instruct) and [128K](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) which is the context length (in tokens) that it can support.
|
25 |
|
26 |
+
The model has undergone a post-training process that incorporates both supervised fine-tuning and direct preference optimization for the instruction following and safety measures.
|
27 |
When assessed against benchmarks testing common sense, language understanding, math, code, long context and logical reasoning, Phi-3-Medium-4K-Instruct showcased a robust and state-of-the-art performance among models of the same-size and next-size-up.
|
28 |
|
29 |
Resources and Technical Documentation:
|