-
-
-
-
-
-
Inference status
Active filters:
chat
sethut/QwQ-32B-Preview-Q8_0-GGUF
madroid/Qwen2.5-3B-Instruct-4bit-mlx
Text Generation
•
Updated
•
13
eligapris/Qwen2.5-Coder-32B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
•
2
tensorblock/Sailor-0.5B-Chat-GGUF
Updated
•
90
tensorblock/Experiment31-7B-GGUF
Text Generation
•
Updated
•
10
chende2024/Qwen2.5-1.5B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
mlx-community/Hermes-3-Llama-3.2-3B-4bit
Text Generation
•
Updated
•
333
mlx-community/Hermes-3-Llama-3.2-3B-8bit
Text Generation
•
Updated
•
61
mlx-community/Hermes-3-Llama-3.2-3B-bf16
Text Generation
•
Updated
•
51
mradermacher/Holland-4B-V1-GGUF
ShikharLLM/Llm1
Text Generation
•
Updated
•
251
JackeyLai/Qwen2.5-3B-Instruct-Q4_0-GGUF
Text Generation
•
Updated
•
16
JackeyLai/Qwen2.5-7B-Instruct-Q4_0-GGUF
Text Generation
•
Updated
•
20
cphan-intersystems/Qwen2.5-Coder-32B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
•
11
cphan-intersystems/Qwen2.5-32B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
•
2
mradermacher/QwQ-32B-Preview-GGUF
Updated
•
61
NikolayKozloff/Llama-DNA-1.0-8B-Instruct-Q8_0-GGUF
Text Generation
•
Updated
•
8
•
1
tensorblock/Sailor-1.8B-Chat-GGUF
Updated
•
76
ericliu2007/Qwen2.5-14B-Instruct-Q2_K-GGUF
Text Generation
•
Updated
•
7
Sg-at-srijan-us-kg/Qwen2.5-Coder-32B-Instruct-128k-yarn-Q4_K_M-GGUF
Text Generation
•
Updated
•
18
tensorblock/Llama-2-7b-ultrachat200k-GGUF
Text Generation
•
Updated
•
37
ericliu2007/Qwen2.5-32B-Instruct-Q2_K-GGUF
Text Generation
•
Updated
•
22
cnfusion/QwQ-32B-Coder-Fusion-8020-Q4-mlx
Text Generation
•
Updated
•
12
cnfusion/QwQ-32B-Coder-Fusion-7030-Q4-mlx
Text Generation
•
Updated
•
13
cnfusion/QwQ-32B-Coder-Fusion-8020-Q8-mlx
Text Generation
•
Updated
•
15
cnfusion/QwQ-32B-Coder-Fusion-7030-Q8-mlx
Text Generation
•
Updated
•
20
yuh0512/QwQ-32B-Preview-Q4_K_M-GGUF
Updated
•
12
tensorblock/Sailor-7B-Chat-GGUF
Updated
•
256
TheBlueObserver/Qwen2.5-1.5B-Instruct
Text Generation
•
Updated
•
21
mlx-community/Hermes-3-Llama-3.1-8B-3bit
Updated
•
158