Edit Models filters

Inference status

Misc

8-bit precision

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

11,087

Full-text search

Active filters: 8-bit

mlx-community/phi-4-8bit

Text Generation • Updated about 4 hours ago • 368 • 6

Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 18, 2024 • 21.6k • 12

huihui-ai/Llama-3.3-70B-Instruct-abliterated-finetuned-GPTQ-Int8

Text Generation • Updated 6 days ago • 158k • 3

MaziyarPanahi/phi-4-GGUF

Text Generation • Updated 4 days ago • 158k • 2

nejumi/phi-4-GPTQ-Int8-calib-ja-1k

Updated 2 days ago • 49 • 2

rycont/kakaobrain__kogpt-6b-8bit

Text Generation • Updated May 30, 2023 • 22 • 2

samadpls/querypls-prompt2sql

Text2Text Generation • Updated Nov 2, 2024 • 64 • 6

Undi95/Llama2-13B-no_robots-alpaca-lora

Text Generation • Updated Nov 17, 2023 • 573 • 10

ecastera/eva-mistral-dolphin-7b-spanish

Text Generation • Updated Mar 16, 2024 • 51 • 12

ecastera/eva-mistral-catmacaroni-7b-spanish

Text Generation • Updated Jan 29, 2024 • 12 • 2

MaziyarPanahi/ANIMA-Phi-Neptune-Mistral-7B-Mistral-7B-Instruct-v0.1-GGUF

Text Generation • Updated Jan 26, 2024 • 61 • 1

Qwen/Qwen1.5-0.5B-Chat-GPTQ-Int8

Text Generation • Updated Apr 30, 2024 • 70 • 3

MaziyarPanahi/Yi-9B-200K-GGUF

Text Generation • Updated Mar 18, 2024 • 100 • 6

Vision-CAIR/MiniGPT4-Video

Updated Jul 24, 2024 • 23 • 31

MaziyarPanahi/Meta-Llama-3-8B-Instruct-GGUF

Text Generation • Updated Apr 23, 2024 • 2.6M • 79

MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF

Text Generation • Updated May 22, 2024 • 2.59M • 73

kim512/Llama-3-70b-Arimas-story-RP-V1.6-8.0bpw-h8-exl2

Text Generation • Updated Jun 17, 2024 • 24 • 1

lakkeo/stable-cypher-instruct-3b

Text Generation • Updated Oct 3, 2024 • 451 • 24

neuralmagic/Meta-Llama-3-8B-Instruct-quantized.w8a16

Text Generation • Updated Jul 18, 2024 • 7.02k • 3

neuralmagic/Meta-Llama-3-70B-Instruct-quantized.w8a16

Text Generation • Updated Jul 18, 2024 • 318 • 4

neuralmagic/Mistral-7B-Instruct-v0.3-quantized.w8a8

Text Generation • Updated Oct 9, 2024 • 311 • 1

meta-llama/Llama-Guard-3-8B-INT8

Text Generation • Updated Aug 7, 2024 • 1.91k • 32

MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF

Text Generation • Updated Jul 29, 2024 • 2.55M • 38

neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a8

Text Generation • Updated Oct 23, 2024 • 4.35k • 13

Slvcxc/L3-Super-Nova-RP-8B-8.0bpw-h8-exl2

Text Generation • Updated Jul 24, 2024 • 34 • 5

neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8

Text Generation • Updated Oct 10, 2024 • 7.81k • 18

neuralmagic/Meta-Llama-3.1-8B-quantized.w8a8

Text Generation • Updated Oct 23, 2024 • 307 • 2

MaziyarPanahi/gemma-2-2b-it-GGUF

Text Generation • Updated Aug 1, 2024 • 2.55M • 11

LoneStriker/Hermes-3-Llama-3.1-8B-8.0bpw-h8-exl2

Updated Aug 15, 2024 • 8 • 1

KhanhVan/Vistral-7B-Chat-gguf1

Text Generation • Updated Aug 24, 2024 • 76 • 2