Pinkstack commited on
Commit
44333e4
·
verified ·
1 Parent(s): 578313c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -2
README.md CHANGED
@@ -22,9 +22,11 @@ Demo: https://huggingface.co/spaces/Pinkstack/Chat-with-superthoughts-lite
22
  ![superthoughts lite](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/K5kYIHYj2aX2kB6MlcM9O.png)
23
 
24
  # Information
25
- Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
26
 
27
- Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned in on reasoning & modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
 
 
28
 
29
  # Which quant is right for you?
30
 
@@ -56,6 +58,7 @@ So, i've counted all the letters correctly, meaning that I am sure that there ar
56
  <output>3
57
  </output><|im_end|>
58
  ```
 
59
  # system prompt
60
  (important to ensure it would always think, output).
61
  ```
 
22
  ![superthoughts lite](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/K5kYIHYj2aX2kB6MlcM9O.png)
23
 
24
  # Information
25
+ Advanced, high-quality and **lite** reasoning for a tiny size that you can run on your phone.
26
 
27
+ At original quality, it runs at ~400 tokens/second on a single H100 Nvidia GPU from Friendli.
28
+
29
+ Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned on reasoning using our own private superthoughts instruct dataset which includes a mix of code, website generation, day-to-day chats, math and counting problems. And then we modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
30
 
31
  # Which quant is right for you?
32
 
 
58
  <output>3
59
  </output><|im_end|>
60
  ```
61
+ We reccomend to use a low temperatures as higher values may cause it to not think.
62
  # system prompt
63
  (important to ensure it would always think, output).
64
  ```