Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
suayptalha
/
ThinkerLlama-8B-v1
like
3
Text Generation
Transformers
Safetensors
microsoft/orca-math-word-problems-200k
English
llama
unsloth
trl
grpo
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
ThinkerLlama-8B-v1
1 contributor
History:
7 commits
suayptalha
Update README.md
96eeab4
verified
7 days ago
.gitattributes
1.57 kB
Upload tokenizer
8 days ago
README.md
1.13 kB
Update README.md
7 days ago
config.json
990 Bytes
Trained with Unsloth
8 days ago
generation_config.json
166 Bytes
Trained with Unsloth
8 days ago
model-00001-of-00004.safetensors
4.98 GB
LFS
Trained with Unsloth
8 days ago
model-00002-of-00004.safetensors
5 GB
LFS
Trained with Unsloth
8 days ago
model-00003-of-00004.safetensors
4.92 GB
LFS
Trained with Unsloth
8 days ago
model-00004-of-00004.safetensors
1.17 GB
LFS
Trained with Unsloth
8 days ago
model.safetensors.index.json
24 kB
Trained with Unsloth
8 days ago
special_tokens_map.json
454 Bytes
Upload tokenizer
8 days ago
tokenizer.json
17.2 MB
LFS
Upload tokenizer
8 days ago
tokenizer_config.json
55.5 kB
Upload tokenizer
8 days ago