microsoft
/

Phi-3-mini-4k-instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Quantized

#13

by wrosko - opened Apr 23, 2024

wrosko

Apr 23, 2024

The paper suggests 4-bit quantization, will Microsoft release the quantized version?

Microsoft org Apr 23, 2024

There are quantized models here:
https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx
https://huggingface.co/microsoft/Phi-3-mini-128k-instruct-onnx
https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf

nguyenbh changed discussion status to closed Apr 23, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment