Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
8
3
12
emin temiz
PRO
etemiz
Follow
21world's profile picture
cmontg's profile picture
NoteBookLM's profile picture
7 followers
·
9 following
AI & ML interests
None yet
Recent Activity
replied
to
AlexBodner
's
post
about 11 hours ago
Just published a post explaining Monte Carlo Tree Search: the magic behind AlphaZero and now used to tackle reasoning benchmarks with LLMs. Check it out because it's a must know nowadays! https://x.com/AlexBodner_/status/1877789879398244382
reacted
to
AlexBodner
's
post
with 🔥
about 11 hours ago
Just published a post explaining Monte Carlo Tree Search: the magic behind AlphaZero and now used to tackle reasoning benchmarks with LLMs. Check it out because it's a must know nowadays! https://x.com/AlexBodner_/status/1877789879398244382
posted
an
update
1 day ago
-= DeepSeek V3 =- After installing the new CUDA toolkit and compiling llama.cpp again I tested DeepSeek V3 yesterday. In terms of human alignment DeepSeek V3 did worse on: - health - fasting - nostr - misinfo - nutrition did better on: - faith - bitcoin - alternative medicine - ancient wisdom compared to DeepSeek 2.5. In my opinion overall it is worse than 2.5. And 2.5 wasn't that great. There is a general tendency of models getting smarter but at the same time getting less wiser, less human aligned, less beneficial to humans. I don't know what is causing this. But maybe synthetic dataset use for further training the LLMs makes it more and more detached from humanity. This is not going in the right direction. My solution is to come up with a curator council to determine the datasets that are closest to human preference. "Humans that care about other humans the most" could be a definition of this dataset. What do you think?
View all activity
Articles
Symbiotic Intelligence
Nov 19, 2024
•
2
Organizations
None yet
etemiz
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mradermacher/grok-1-GGUF
2 months ago
parts
1
#1 opened 2 months ago by
etemiz
New activity in
hilaltekgoz/whisper-large-tr
3 months ago
nice
#1 opened 3 months ago by
etemiz
New activity in
etemiz/Llama-3.1-405B-Inst-GGUF
6 months ago
Larger quants please
1
#1 opened 6 months ago by
YokaiKoibito
New activity in
DontPlanToEnd/UGI-Leaderboard
9 months ago
Llama3
4
#7 opened 9 months ago by
etemiz
New activity in
hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling
9 months ago
just for curiosity
9
#1 opened 10 months ago by
prudant
New activity in
LiteLLMs/Mixtral-8x22B-v0.1-GGUF
9 months ago
ram usage
1
#1 opened 9 months ago by
Tom-Neverwinter
New activity in
AlexWortega/miqu-1-70b-AQLM-2Bit-1x16-hf
10 months ago
Space before EOS
#3 opened 10 months ago by
etemiz
New activity in
ISTA-DASLab/Mixtral-8x7B-Instruct-v0_1-AQLM-2Bit-1x16-hf
10 months ago
One needs a Qwen 72B AQLM
#4 opened 10 months ago by
etemiz
New activity in
hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling
10 months ago
just for curiosity
9
#1 opened 10 months ago by
prudant
just for curiosity
9
#1 opened 10 months ago by
prudant