Unchun Yang's picture

Unchun Yang

ucyang

·

https://ucyang.com/

AI & ML interests

None yet

Recent Activity

liked a model about 24 hours ago

Qwen/Qwen2.5-14B-Instruct-1M

upvoted a collection about 24 hours ago

liked a dataset 1 day ago

FreedomIntelligence/medical-o1-reasoning-SFT

View all activity

Organizations

ucyang's activity

upvoted a collection about 24 hours ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 1 day ago • 73

upvoted a collection 1 day ago

HuatuoGPT-o1

4 items • Updated 28 days ago • 15

upvoted a paper 1 day ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 97

upvoted a paper 2 days ago

ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Paper • 2501.10132 • Published 10 days ago • 14

upvoted a collection 4 days ago

Eagle 2

Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 4 days ago • 19

upvoted a paper 6 days ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published 6 days ago • 45

upvoted 3 collections 6 days ago

DeepSeek-R1

8 items • Updated 7 days ago • 187

Phi-4 (All Versions)

Microsoft's new Phi-4 model in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. • 4 items • Updated 7 days ago • 34

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 27 items • Updated about 16 hours ago • 68

upvoted an article 8 days ago

Article

🌁#83: GAN is back

By

•

14 days ago

• 7

upvoted 5 collections 9 days ago

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks • 6 items • Updated Dec 13, 2024 • 10

Aya Datasets

The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 5 items • Updated Dec 3, 2024 • 15

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated Dec 16, 2024 • 30

Command Models

Latest C4AI Command models • 4 items • Updated 10 days ago • 6

Phi-4

Phi-4 small language model. • 2 items • Updated 19 days ago • 45

upvoted a paper 11 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 13 days ago • 268

upvoted a collection 13 days ago

MiniCPM

The MiniCPM family of LLMs and VLLMs. • 32 items • Updated 8 days ago • 60

upvoted a paper 14 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 20 days ago • 81

upvoted a paper 15 days ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 59

upvoted an article 16 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 76