sergiopaniego commited on
Commit
5644c6d
·
verified ·
1 Parent(s): f9264f2

Model save

Browse files
Files changed (2) hide show
  1. README.md +1 -2
  2. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -1,6 +1,5 @@
1
  ---
2
  base_model: Qwen/Qwen2-0.5B-Instruct
3
- datasets: AI-MO/NuminaMath-TIR
4
  library_name: transformers
5
  model_name: Qwen2-0.5B-GRPO
6
  tags:
@@ -12,7 +11,7 @@ licence: license
12
 
13
  # Model Card for Qwen2-0.5B-GRPO
14
 
15
- This model is a fine-tuned version of [Qwen/Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct) on the [AI-MO/NuminaMath-TIR](https://huggingface.co/datasets/AI-MO/NuminaMath-TIR) dataset.
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
 
1
  ---
2
  base_model: Qwen/Qwen2-0.5B-Instruct
 
3
  library_name: transformers
4
  model_name: Qwen2-0.5B-GRPO
5
  tags:
 
11
 
12
  # Model Card for Qwen2-0.5B-GRPO
13
 
14
+ This model is a fine-tuned version of [Qwen/Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c2fba1d7e535642aaa5c800d1cea03979b0b39dfaacd8006c84469c0542dc5d8
3
  size 2175168
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e6b227f34355fe95c859c2d662d9e03401b277ee8ae70991689634c0a513731c
3
  size 2175168