[MODELS] Discussion

#372
by victor HF staff - opened
Hugging Chat org
โ€ข
edited Sep 23, 2024

Here we can discuss about HuggingChat available models.

image.png

victor pinned discussion

what are limits of using these? how many api calls can i send them per month?

How can I know which model am using

How can I know which model am using

at the bottom of your screen:
image.png

Out of all these models, Gemma, which was recently released, has the newest information about .NET. However, I don't know which one has the most accurate answers regarding coding

Gemma seems really biased. With web search on, it says that it doesn't have access to recent information asking it almost anything about recent events. But when I ask it about recent events with Google, I get responses with the recent events.

apparently gemma cannot code?

Gemma is just like Google's Gemini series models, it have a very strong moral limit put on, any operation that may related to file operation, access that might be deep, would be censored and refused to reply.
So even there are solution for such things in its training data, it will just be filtered and ignored.
But still didn't test the coding accuracy that doesn't related to these kind of "dangerous" operations

Screenshot 2025-01-05 201832.png

Hi @nsarrazin @victor can you guys take a look at this. on third question nemotron 70B started producing output like this.

Meta Llama 3.3 does this too. it's pretty annoying and feels like random chance to get a coherent result or nonsensical gibberish.

Hugging Chat org

Hi everyone! The models should work better now. We're still investigating the cause so please report back if it happens again but the replicas should be fixed for now!

Whatever was done to fix the models seems to have worked. At least for Llama-3.3-70B-Instruct. All responses have been coherent with no need to retry. Whereas before, most responses were incoherent with with only an occasional coherent one. I don't know what replicas are, but they seem to be fixed.

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
This model seems to be broken. As soon as we chat a little, the model begins to respond with incomprehensible typing...

is it qwen serial models down? seems 503 error

Screenshot 2025-01-12 at 16-04-19 ๐Ÿ•ณ Time and space.png

so this happened today nemotron started acting weird again

Sign up or log in to comment