Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
steerapi
/
Llama-2-7b-chat-hf-onnx-awq
like
0
Text Generation
Transformers
ONNX
llama
Inference Endpoints
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
62f6e98
Llama-2-7b-chat-hf-onnx-awq
/
onnx
1 contributor
History:
2 commits
steerapi
Upload folder using huggingface_hub
f236c3f
over 1 year ago
q1
Upload folder using huggingface_hub
over 1 year ago
decoder_model.onnx
Safe
5.44 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model.onnx_data
Safe
27 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged.onnx
Safe
10.9 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged.onnx_data
Safe
27 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged_quantized.onnx
Safe
19 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged_quantized.onnx_data
Safe
6.74 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model.onnx
Safe
5.47 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model.onnx_data
Safe
27 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
quantize_config.json
Safe
991 Bytes
Upload folder using huggingface_hub
over 1 year ago