Pinkstack
/

Superthoughts-lite-v1-GGUF

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pinkstack commited on 6 days ago

Commit

44333e4

·

verified ·

1 Parent(s): 578313c

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -22,9 +22,11 @@ Demo: https://huggingface.co/spaces/Pinkstack/Chat-with-superthoughts-lite
 ![superthoughts lite](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/K5kYIHYj2aX2kB6MlcM9O.png)
 # Information
-Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
-Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned in on reasoning & modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
 # Which quant is right for you?
@@ -56,6 +58,7 @@ So, i've counted all the letters correctly, meaning that I am sure that there ar
 <output>3
 </output><|im_end|>
 ```
 # system prompt
 (important to ensure it would always think, output).
 ```

 ![superthoughts lite](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/K5kYIHYj2aX2kB6MlcM9O.png)
 # Information
+Advanced, high-quality and **lite** reasoning for a tiny size that you can run on your phone.
+At original quality, it runs at ~400 tokens/second on a single H100 Nvidia GPU from Friendli.
+Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned on reasoning using our own private superthoughts instruct dataset which includes a mix of code, website generation, day-to-day chats, math and counting problems. And then we modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
 # Which quant is right for you?
 <output>3
 </output><|im_end|>
 ```
+We reccomend to use a low temperatures as higher values may cause it to not think.
 # system prompt
 (important to ensure it would always think, output).
 ```