Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16
like
2
Follow
Neural Magic
342
Text Generation
Transformers
Safetensors
qwen2
deepseek
int4
vllm
llmcompressor
conversational
text-generation-inference
Inference Endpoints
compressed-tensors
arxiv:
2210.17323
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16
Commit History
update tokenizer configs
138dd61
nm-research
commited on
4 days ago
Update README.md
c0dc799
verified
nm-research
commited on
6 days ago
Create README.md
a542407
verified
nm-research
commited on
14 days ago
Upload folder using huggingface_hub
7642f19
verified
nm-research
commited on
22 days ago
initial commit
1cec3af
verified
nm-research
commited on
22 days ago