mrzjy
/

Qwen2.5-1.5B-GRPO-Creative-Ad-Generation

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mrzjy commited on 8 days ago

Commit

df323e9

·

verified ·

1 Parent(s): 9eafb6a

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -14,6 +14,8 @@ base_model:
 # Model Card
 Unlike the impressive DeepSeek-R1(-Zero), this project focuses on a pure reinforcement learning (RL) experiment applied to an open-domain task: creative advertisement generation.
 **Objective:**

 # Model Card
+[Github](https://github.com/mrzjy/CreativeTinyZero) repo here.
 Unlike the impressive DeepSeek-R1(-Zero), this project focuses on a pure reinforcement learning (RL) experiment applied to an open-domain task: creative advertisement generation.
 **Objective:**