Satori-reasoning
/

Satori-7B-Round2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chaoscodes commited on Feb 5

Commit

64e0325

·

verified ·

1 Parent(s): 3055553

Update README.md

Files changed (1) hide show

README.md +9 -6

README.md CHANGED Viewed

@@ -138,15 +138,18 @@ We provide our training datasets:
 Please refer to our blog and research paper for more technical details of Satori.
  - [Blog](https://satori-reasoning.github.io/blog/satori/)
- - [Paper](https://satori-reasoning.github.io/blog/satori/)
 # **Citation**
 If you find our model and data helpful, please cite our paper:
 ```
-@article{TBD,
-  title={Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search},
-  author={Maohao Shen and Guangtao Zeng and Zhenting Qi and Zhang-Wei Hong and Zhenfang Chen and Wei Lu and Gregory Wornell and Subhro Das and David Cox and Chuang Gan},
-  journal={arXiv preprint arXiv: TBD},
-  year={2025}
 }
 ```

 Please refer to our blog and research paper for more technical details of Satori.
  - [Blog](https://satori-reasoning.github.io/blog/satori/)
+ - [Paper](https://arxiv.org/pdf/2502.02508)
 # **Citation**
 If you find our model and data helpful, please cite our paper:
 ```
+@misc{shen2025satorireinforcementlearningchainofactionthought,
+      title={Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search},
+      author={Maohao Shen and Guangtao Zeng and Zhenting Qi and Zhang-Wei Hong and Zhenfang Chen and Wei Lu and Gregory Wornell and Subhro Das and David Cox and Chuang Gan},
+      year={2025},
+      eprint={2502.02508},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2502.02508},
 }
 ```