Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Misc with no match

text-embeddings-inference

Models

3,403

Full-text search

Active filters: chat

sethut/QwQ-32B-Preview-Q8_0-GGUF

Updated Dec 11, 2024 • 5

madroid/Qwen2.5-3B-Instruct-4bit-mlx

Text Generation • Updated Dec 11, 2024 • 13

eligapris/Qwen2.5-Coder-32B-Instruct-Q4_K_M-GGUF

Text Generation • Updated Dec 11, 2024 • 2

tensorblock/Sailor-0.5B-Chat-GGUF

Updated Dec 11, 2024 • 90

tensorblock/Experiment31-7B-GGUF

Text Generation • Updated Dec 11, 2024 • 10

chende2024/Qwen2.5-1.5B-Instruct-Q4_K_M-GGUF

Text Generation • Updated Dec 11, 2024

mlx-community/Hermes-3-Llama-3.2-3B-4bit

Text Generation • Updated Dec 11, 2024 • 333

mlx-community/Hermes-3-Llama-3.2-3B-8bit

Text Generation • Updated Dec 11, 2024 • 61

mlx-community/Hermes-3-Llama-3.2-3B-bf16

Text Generation • Updated Dec 11, 2024 • 51

mradermacher/Holland-4B-V1-GGUF

Updated Dec 11, 2024 • 7

ShikharLLM/Llm1

Text Generation • Updated 15 days ago • 251

JackeyLai/Qwen2.5-3B-Instruct-Q4_0-GGUF

Text Generation • Updated Dec 12, 2024 • 16

JackeyLai/Qwen2.5-7B-Instruct-Q4_0-GGUF

Text Generation • Updated Dec 12, 2024 • 20

cphan-intersystems/Qwen2.5-Coder-32B-Instruct-Q4_K_M-GGUF

Text Generation • Updated Dec 12, 2024 • 11

cphan-intersystems/Qwen2.5-32B-Instruct-Q4_K_M-GGUF

Text Generation • Updated Dec 12, 2024 • 2

mradermacher/QwQ-32B-Preview-GGUF

Updated Dec 12, 2024 • 61

NikolayKozloff/Llama-DNA-1.0-8B-Instruct-Q8_0-GGUF

Text Generation • Updated Dec 12, 2024 • 8 • 1

tensorblock/Sailor-1.8B-Chat-GGUF

Updated Dec 12, 2024 • 76

ericliu2007/Qwen2.5-14B-Instruct-Q2_K-GGUF

Text Generation • Updated Dec 12, 2024 • 7

Sg-at-srijan-us-kg/Qwen2.5-Coder-32B-Instruct-128k-yarn-Q4_K_M-GGUF

Text Generation • Updated about 1 month ago • 18

tensorblock/Llama-2-7b-ultrachat200k-GGUF

Text Generation • Updated Dec 12, 2024 • 37

ericliu2007/Qwen2.5-32B-Instruct-Q2_K-GGUF

Text Generation • Updated Dec 13, 2024 • 22

cnfusion/QwQ-32B-Coder-Fusion-8020-Q4-mlx

Text Generation • Updated Dec 13, 2024 • 12

cnfusion/QwQ-32B-Coder-Fusion-7030-Q4-mlx

Text Generation • Updated Dec 13, 2024 • 13

cnfusion/QwQ-32B-Coder-Fusion-8020-Q8-mlx

Text Generation • Updated about 1 month ago • 15

cnfusion/QwQ-32B-Coder-Fusion-7030-Q8-mlx

Text Generation • Updated about 1 month ago • 20

yuh0512/QwQ-32B-Preview-Q4_K_M-GGUF

Updated about 1 month ago • 12

tensorblock/Sailor-7B-Chat-GGUF

Updated about 1 month ago • 256

TheBlueObserver/Qwen2.5-1.5B-Instruct

Text Generation • Updated about 1 month ago • 21

mlx-community/Hermes-3-Llama-3.1-8B-3bit

Updated about 1 month ago • 158