mlx-community/simplescaling-s1-32B-4bit

The Model mlx-community/simplescaling-s1-32B-4bit was converted to MLX format from simplescaling/s1-32B using mlx-lm version 0.21.1 by Focused.

Use with mlx

pip install mlx-lm

from mlx_lm import load, generate

model, tokenizer = load("mlx-community/simplescaling-s1-32B-4bit")

prompt = "hello"

if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, add_generation_prompt=True
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)

Focused is a technology company at the forefront of AI-driven development, empowering organizations to unlock the full potential of artificial intelligence. From integrating innovative models into existing systems to building scalable, modern AI infrastructures, we specialize in delivering tailored, incremental solutions that meet you where you are. Curious how we can help with your AI next project? Get in Touch

mlx-community
/

simplescaling-s1-32B-4bit

mlx-community/simplescaling-s1-32B-4bit

Use with mlx

Model tree for mlx-community/simplescaling-s1-32B-4bit

Dataset used to train mlx-community/simplescaling-s1-32B-4bit