Can this model use with VLLM?

#2
by ChloeHuang1 - opened

Can this model use with VLLM?

Yes. I am getting 60+ tokens/s (single user) on 3090.

Sign up or log in to comment