Reinforcement Learning
kauiu commited on
Commit
1670542
·
verified ·
1 Parent(s): 867c0ac

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -5,9 +5,8 @@ datasets:
5
  - PJMixers-Dev/open-thoughts_OpenThoughts-114k-CustomShareGPT
6
  - open-r1/OpenR1-Math-220k
7
  base_model:
8
- - deepseek-ai/DeepSeek-V3
9
  - deepseek-ai/DeepSeek-R1
10
- pipeline_tag: question-answering
11
  new_version: deepseek-ai/DeepSeek-R1
12
  ---
13
  # Model Card for Model ID
 
5
  - PJMixers-Dev/open-thoughts_OpenThoughts-114k-CustomShareGPT
6
  - open-r1/OpenR1-Math-220k
7
  base_model:
 
8
  - deepseek-ai/DeepSeek-R1
9
+ pipeline_tag: reinforcement-learning
10
  new_version: deepseek-ai/DeepSeek-R1
11
  ---
12
  # Model Card for Model ID