7 155 9

Robin Williams PRO

bfuzzy1

AI & ML interests

None yet

Recent Activity

reacted to JingzeShi's post with 🤯 about 13 hours ago

Only a single RTX 4090 running model pre-training is really slow, even for small language models!!! (https://huggingface.co/collections/JingzeShi/doge-slm-677fd879f8c4fd0f43e05458)

upvoted a paper 1 day ago

Control LLM: Controlled Evolution for Intelligence Retention in LLM

upvoted a paper 3 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

View all activity

Organizations

None yet

Collections 12

models 10

datasets 2

bfuzzy1/gunny_v2_solo_dolo

Viewer • Updated Oct 10, 2024 • 2.9k • 43 • 1

bfuzzy1/gunny_x

Viewer • Updated Oct 1, 2024 • 10k • 64 • 3

Robin Williams PRO

AI & ML interests

Recent Activity

Organizations

Collections 12

bfuzzy1/acheron-m

bfuzzy1/acheron-m1a-llama

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Deliberation in Latent Space via Differentiable Cache Augmentation

Outcome-Refining Process Supervision for Code Generation

models 10

bfuzzy1/acheron-m1a-llama

bfuzzy1/acheron-m

bfuzzy1/acheron-d

bfuzzy1/llambses-1

bfuzzy1/acheron-o9

bfuzzy1/acheron

bfuzzy1/acheron-c

bfuzzy1/Gunny

bfuzzy1/llambses-1_4bit

bfuzzy1/acheron-x

datasets 2

bfuzzy1/gunny_v2_solo_dolo

bfuzzy1/gunny_x

Robin Williams PRO

AI & ML interests

Recent Activity

Organizations

Collections 12

models 10 Sort: Recently updated

datasets 2 Sort: Recently updated

models 10

datasets 2