emin temiz's picture

8 3 12

emin temiz PRO

etemiz

·

AI & ML interests

None yet

Recent Activity

replied to AlexBodner's post about 11 hours ago

Just published a post explaining Monte Carlo Tree Search: the magic behind AlphaZero and now used to tackle reasoning benchmarks with LLMs. Check it out because it's a must know nowadays! https://x.com/AlexBodner_/status/1877789879398244382

reacted to AlexBodner's post with 🔥 about 11 hours ago

Just published a post explaining Monte Carlo Tree Search: the magic behind AlphaZero and now used to tackle reasoning benchmarks with LLMs. Check it out because it's a must know nowadays! https://x.com/AlexBodner_/status/1877789879398244382

posted an update 1 day ago

-= DeepSeek V3 =- After installing the new CUDA toolkit and compiling llama.cpp again I tested DeepSeek V3 yesterday. In terms of human alignment DeepSeek V3 did worse on: - health - fasting - nostr - misinfo - nutrition did better on: - faith - bitcoin - alternative medicine - ancient wisdom compared to DeepSeek 2.5. In my opinion overall it is worse than 2.5. And 2.5 wasn't that great. There is a general tendency of models getting smarter but at the same time getting less wiser, less human aligned, less beneficial to humans. I don't know what is causing this. But maybe synthetic dataset use for further training the LLMs makes it more and more detached from humanity. This is not going in the right direction. My solution is to come up with a curator council to determine the datasets that are closest to human preference. "Humans that care about other humans the most" could be a definition of this dataset. What do you think?

View all activity

Articles

Symbiotic Intelligence

Organizations

None yet

etemiz's activity

New activity in mradermacher/grok-1-GGUF 2 months ago

parts

#1 opened 2 months ago by

New activity in hilaltekgoz/whisper-large-tr 3 months ago

nice

#1 opened 3 months ago by

New activity in etemiz/Llama-3.1-405B-Inst-GGUF 6 months ago

Larger quants please

#1 opened 6 months ago by

New activity in DontPlanToEnd/UGI-Leaderboard 9 months ago

Llama3

#7 opened 9 months ago by

New activity in hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling 9 months ago

just for curiosity

#1 opened 10 months ago by

New activity in LiteLLMs/Mixtral-8x22B-v0.1-GGUF 9 months ago

ram usage

#1 opened 9 months ago by

Tom-Neverwinter

New activity in AlexWortega/miqu-1-70b-AQLM-2Bit-1x16-hf 10 months ago

Space before EOS

#3 opened 10 months ago by

New activity in ISTA-DASLab/Mixtral-8x7B-Instruct-v0_1-AQLM-2Bit-1x16-hf 10 months ago

One needs a Qwen 72B AQLM

#4 opened 10 months ago by

New activity in hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling 10 months ago

just for curiosity

#1 opened 10 months ago by

just for curiosity

#1 opened 10 months ago by