Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,8 @@ base_model:
|
|
14 |
|
15 |
# Model Card
|
16 |
|
|
|
|
|
17 |
Unlike the impressive DeepSeek-R1(-Zero), this project focuses on a pure reinforcement learning (RL) experiment applied to an open-domain task: creative advertisement generation.
|
18 |
|
19 |
**Objective:**
|
|
|
14 |
|
15 |
# Model Card
|
16 |
|
17 |
+
[Github](https://github.com/mrzjy/CreativeTinyZero) repo here.
|
18 |
+
|
19 |
Unlike the impressive DeepSeek-R1(-Zero), this project focuses on a pure reinforcement learning (RL) experiment applied to an open-domain task: creative advertisement generation.
|
20 |
|
21 |
**Objective:**
|