Tanu

tanu360

AI & ML interests

None yet

Recent Activity

liked a Space 5 days ago
ginigen/Workflow-Canvas
liked a Space 19 days ago
m-ric/open_Deep-Research
liked a Space 19 days ago
PramaLLC/BEN2
View all activity

Organizations

None yet

tanu360's activity

upvoted an article 19 days ago
view article
Article

Open-source DeepResearch – Freeing our search agents

1.09k
upvoted an article 27 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

771
reacted to chansung's post with 👍 about 1 month ago
view post
Post
2042
Simple Summarization on DeepSeek-R1 from DeepSeek AI

The RL stage is very important.
↳ However, it is difficult to create a truly helpful AI for people solely through RL.
↳ So, we applied a learning pipeline consisting of four stages: providing a good starting point, reasoning RL, SFT, and safety RL, and achieved performance comparable to o1.
↳ Simply fine-tuning other open models with the data generated by R1-Zero (distillation) resulted in performance comparable to o1-mini.

Of course, this is just a brief overview and may not be of much help. All models are accessible on Hugging Face, and the paper can be read through the GitHub repository.


Model: https://huggingface.co/deepseek-ai
Paper: https://github.com/deepseek-ai/DeepSeek-R1
  • 1 reply
·
reacted to AlexBodner's post with 👀 about 1 month ago
view post
Post
1486
I just dropped a detailed guide on deploying ML models to Google Cloud Run with GPU support—completely serverless and auto-scaling. If you’re curious about seamlessly deploying your models to the cloud, give it a read! [https://medium.com/@alexbodner/deployment-of-serverless-machine-learning-models-with-gpus-using-google-cloud-cloud-run-573b836475b5]"
  • 1 reply
·