2 18 71

wangrui

varuy322

varuy322

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

liked a dataset 5 days ago

HuggingFaceFW/fineweb-edu

liked a model 5 days ago

yulan-team/YuLan-Mini

View all activity

Organizations

None yet

varuy322's activity

upvoted an article 3 days ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

•

10 days ago

• 29

upvoted a paper 19 days ago

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published 23 days ago • 22

upvoted a collection 21 days ago

Agents

Collection

67 items • Updated 3 days ago • 3

upvoted a paper 3 months ago

Baichuan Alignment Technical Report

Paper • 2410.14940 • Published Oct 19, 2024 • 50

upvoted an article 3 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31, 2024

• 59

upvoted 2 papers 3 months ago

Erasing Conceptual Knowledge from Language Models

Paper • 2410.02760 • Published Oct 3, 2024 • 14

LML: Language Model Learning a Dataset for Data-Augmented Prediction

Paper • 2409.18957 • Published Sep 27, 2024 • 10

upvoted 2 collections 4 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 555

LLM Reasoning Papers

Collection

Papers to improve reasoning capabilities of LLMs • 18 items • Updated 3 days ago • 99

upvoted an article 4 months ago

Article

An Introduction to Deep Reinforcement Learning

May 4, 2022

• 3

upvoted an article 6 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 298

upvoted a paper 7 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 90

upvoted an article 7 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20, 2024

• 72

upvoted 3 collections 7 months ago

upvoted an article 8 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19, 2024

• 128

upvoted an article 9 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 127