AlejandroOlmedo
/

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

8-bit precision

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx

1 contributor

History: 13 commits

AlejandroOlmedo's picture

AlejandroOlmedo

Update README.md

3be7211 verified about 10 hours ago

.gitattributes

1.57 kB

Upload tokenizer.json with huggingface_hub 3 days ago
README.md

2.85 kB

Update README.md about 10 hours ago
config.json

917 Bytes

Upload config.json with huggingface_hub 3 days ago
model-00001-of-00002.safetensors

5.32 GB
LFS

Upload model-00001-of-00002.safetensors with huggingface_hub 3 days ago
model-00002-of-00002.safetensors

2.78 GB
LFS

Upload model-00002-of-00002.safetensors with huggingface_hub 3 days ago
model.safetensors.index.json

62.7 kB

Upload model.safetensors.index.json with huggingface_hub 3 days ago
special_tokens_map.json

485 Bytes

Upload special_tokens_map.json with huggingface_hub 3 days ago
tokenizer.json

11.4 MB
LFS

Upload tokenizer.json with huggingface_hub 3 days ago
tokenizer_config.json

6.86 kB

Upload tokenizer_config.json with huggingface_hub 3 days ago