Update README.md
Browse files
README.md
CHANGED
@@ -29,7 +29,7 @@ Deep-Reason-SMALL-V0 is a fine-tuned version of llama-3.2-3b-instruct, designed
|
|
29 |
|
30 |
Built with Unsloth and Hugging Face’s TRL, this model is optimized for faster inference and superior logical performance.
|
31 |
|
32 |
-
The model is available in GGUF format and has been quantized to different levels to support various hardware configurations.
|
33 |
|
34 |
**Model Details**
|
35 |
- Base Model: LLaMA-3 3B
|
@@ -40,6 +40,7 @@ The model is available in GGUF format and has been quantized to different levels
|
|
40 |
- q4_k_m
|
41 |
- q5_k_m
|
42 |
- q8_0
|
|
|
43 |
|
44 |
**Key Features**
|
45 |
- Enhanced Reasoning: Fine-tuned using GRPO to improve problem-solving and structured thought processes.
|
|
|
29 |
|
30 |
Built with Unsloth and Hugging Face’s TRL, this model is optimized for faster inference and superior logical performance.
|
31 |
|
32 |
+
The model is available in GGUF and 16 Bit format and has been quantized to different levels to support various hardware configurations.
|
33 |
|
34 |
**Model Details**
|
35 |
- Base Model: LLaMA-3 3B
|
|
|
40 |
- q4_k_m
|
41 |
- q5_k_m
|
42 |
- q8_0
|
43 |
+
- 16 Bit (https://huggingface.co/alphaaico/Deep-Reason-SMALL-V0)
|
44 |
|
45 |
**Key Features**
|
46 |
- Enhanced Reasoning: Fine-tuned using GRPO to improve problem-solving and structured thought processes.
|