Daemontatox commited on
Commit
720ff8e
·
verified ·
1 Parent(s): d7925fe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -26,7 +26,9 @@ datasets:
26
  # **Cogito-R1: An Advanced Reasoning and Chain-of-Thought Model**
27
 
28
  ## **Model Overview**
29
- **Cogito-R1** is a fine-tuned variant of [unsloth/qwen2.5-32b-instruct](https://huggingface.co/unsloth/qwen2.5-32b-instruct), specifically optimized for **complex reasoning, mathematical problem-solving, and chain-of-thought (CoT) inference**. Developed by **Daemontatox**, this model leverages state-of-the-art fine-tuning techniques to enhance its cognitive capabilities in structured reasoning tasks.
 
 
30
 
31
  ### **Key Features**
32
  - **Efficient Fine-tuning:** Trained 2× faster using [Unsloth](https://github.com/unslothai/unsloth) and the Hugging Face TRL library.
@@ -35,7 +37,7 @@ datasets:
35
  -
36
  - **Mathematical Competency:** Performs strongly on mathematical and arithmetic tasks, rivaling and surpassing models such as **ChatGPT-o1 Mini** on specific benchmarks.
37
  -
38
- - **Compact and Efficient:** Utilizes a 4-bit quantized format, making it more efficient for deployment while retaining high-level reasoning performance.
39
 
40
  [![Unsloth Logo](https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png)](https://github.com/unslothai/unsloth)
41
 
@@ -44,7 +46,7 @@ datasets:
44
  ## **Technical Details**
45
 
46
  ### **Base Model**
47
- - **Architecture:** Qwen2.5-32B
48
  -
49
  - **Fine-tuning Frameworks:** [Unsloth](https://github.com/unslothai/unsloth), [Hugging Face TRL](https://huggingface.co/docs/trl)
50
  -
 
26
  # **Cogito-R1: An Advanced Reasoning and Chain-of-Thought Model**
27
 
28
  ## **Model Overview**
29
+ **Cogito-R1** is a fine-tuned variant of [unsloth/qwen2.5-32b-instruct](https://huggingface.co/unsloth/qwen2.5-32b-instruct),
30
+ specifically optimized for **complex reasoning, mathematical problem-solving, and chain-of-thought (CoT) inference**.
31
+ Developed by **Daemontatox**, this model leverages state-of-the-art fine-tuning techniques to enhance its cognitive capabilities in structured reasoning tasks.
32
 
33
  ### **Key Features**
34
  - **Efficient Fine-tuning:** Trained 2× faster using [Unsloth](https://github.com/unslothai/unsloth) and the Hugging Face TRL library.
 
37
  -
38
  - **Mathematical Competency:** Performs strongly on mathematical and arithmetic tasks, rivaling and surpassing models such as **ChatGPT-o1 Mini** on specific benchmarks.
39
  -
40
+
41
 
42
  [![Unsloth Logo](https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png)](https://github.com/unslothai/unsloth)
43
 
 
46
  ## **Technical Details**
47
 
48
  ### **Base Model**
49
+ - **Architecture:** Qwen2.5
50
  -
51
  - **Fine-tuning Frameworks:** [Unsloth](https://github.com/unslothai/unsloth), [Hugging Face TRL](https://huggingface.co/docs/trl)
52
  -