kenhktsui
/

Qwen2.5-3B-Instruct-GRPO-basic-sampling_temp_05

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Qwen2.5-3B-Instruct-GRPO-basic-sampling_temp_05

1 contributor

History: 7 commits

kenhktsui's picture

Upload model trained with Unsloth

b0ccec8 verified 5 days ago

.gitattributes

1.57 kB

Upload tokenizer 6 days ago
README.md

617 Bytes

Trained with Unsloth 6 days ago
added_tokens.json

605 Bytes

Upload tokenizer 6 days ago
config.json

808 Bytes

Trained with Unsloth 6 days ago
generation_config.json

139 Bytes

Trained with Unsloth 6 days ago
merges.txt

1.67 MB

Upload tokenizer 6 days ago
pytorch_model-00001-of-00002.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.HalfStorage",
- "collections.OrderedDict"
What is a pickle import?
4.96 GB
LFS

Trained with Unsloth 6 days ago
pytorch_model-00002-of-00002.bin
Detected Pickle imports (3)
- "torch.HalfStorage",
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.21 GB
LFS

Trained with Unsloth 6 days ago
pytorch_model.bin.index.json

35.6 kB

Trained with Unsloth 6 days ago
special_tokens_map.json

614 Bytes

Upload tokenizer 6 days ago
tokenizer.json

11.4 MB
LFS

Upload tokenizer 6 days ago
tokenizer_config.json

7.36 kB

Upload model trained with Unsloth 5 days ago
vocab.json

2.78 MB

Upload tokenizer 6 days ago