-
-
-
-
-
-
Inference status
Active filters:
8-bit
mlx-community/phi-4-8bit
Text Generation
•
Updated
•
368
•
6
Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
21.6k
•
12
huihui-ai/Llama-3.3-70B-Instruct-abliterated-finetuned-GPTQ-Int8
Text Generation
•
Updated
•
158k
•
3
MaziyarPanahi/phi-4-GGUF
Text Generation
•
Updated
•
158k
•
2
nejumi/phi-4-GPTQ-Int8-calib-ja-1k
Updated
•
49
•
2
rycont/kakaobrain__kogpt-6b-8bit
Text Generation
•
Updated
•
22
•
2
samadpls/querypls-prompt2sql
Text2Text Generation
•
Updated
•
64
•
6
Undi95/Llama2-13B-no_robots-alpaca-lora
Text Generation
•
Updated
•
573
•
10
ecastera/eva-mistral-dolphin-7b-spanish
Text Generation
•
Updated
•
51
•
12
ecastera/eva-mistral-catmacaroni-7b-spanish
Text Generation
•
Updated
•
12
•
2
MaziyarPanahi/ANIMA-Phi-Neptune-Mistral-7B-Mistral-7B-Instruct-v0.1-GGUF
Text Generation
•
Updated
•
61
•
1
Qwen/Qwen1.5-0.5B-Chat-GPTQ-Int8
Text Generation
•
Updated
•
70
•
3
MaziyarPanahi/Yi-9B-200K-GGUF
Text Generation
•
Updated
•
100
•
6
Vision-CAIR/MiniGPT4-Video
Updated
•
23
•
31
MaziyarPanahi/Meta-Llama-3-8B-Instruct-GGUF
Text Generation
•
Updated
•
2.6M
•
79
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
Updated
•
2.59M
•
73
kim512/Llama-3-70b-Arimas-story-RP-V1.6-8.0bpw-h8-exl2
Text Generation
•
Updated
•
24
•
1
lakkeo/stable-cypher-instruct-3b
Text Generation
•
Updated
•
451
•
24
neuralmagic/Meta-Llama-3-8B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
7.02k
•
3
neuralmagic/Meta-Llama-3-70B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
318
•
4
neuralmagic/Mistral-7B-Instruct-v0.3-quantized.w8a8
Text Generation
•
Updated
•
311
•
1
meta-llama/Llama-Guard-3-8B-INT8
Text Generation
•
Updated
•
1.91k
•
32
MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF
Text Generation
•
Updated
•
2.55M
•
38
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
4.35k
•
13
Slvcxc/L3-Super-Nova-RP-8B-8.0bpw-h8-exl2
Text Generation
•
Updated
•
34
•
5
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
7.81k
•
18
neuralmagic/Meta-Llama-3.1-8B-quantized.w8a8
Text Generation
•
Updated
•
307
•
2
MaziyarPanahi/gemma-2-2b-it-GGUF
Text Generation
•
Updated
•
2.55M
•
11
LoneStriker/Hermes-3-Llama-3.1-8B-8.0bpw-h8-exl2
Updated
•
8
•
1
KhanhVan/Vistral-7B-Chat-gguf1
Text Generation
•
Updated
•
76
•
2