Sichen Liu

Seas0

https://seashkey.com

Seas0

AI & ML interests

Diffusion, Flow, Any Generative model

Recent Activity

liked a Space 4 days ago

nanotron/ultrascale-playbook

liked a model 4 days ago

Qwen/QwQ-32B

authored a paper 5 days ago

Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think

View all activity

Organizations

None yet

Seas0's activity

upvoted a paper 5 days ago

Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think

Paper • 2503.00948 • Published 8 days ago • 3

upvoted a paper 13 days ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published 14 days ago • 26

upvoted a paper 27 days ago

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Paper • 2411.19108 • Published Nov 28, 2024 • 19

upvoted a paper about 2 months ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 56

upvoted a collection 3 months ago

InternVL2.5

Collection

Better than InternVL 2.0 • 19 items • Updated 7 days ago • 86

upvoted 2 collections 6 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 12 days ago • 554

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 208

upvoted an article 6 months ago

Article

Explaining the SDXL latent space

•

May 20, 2024

• 37

upvoted a paper 7 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 159

upvoted 8 papers 12 months ago

DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Model

Paper • 2402.17412 • Published Feb 27, 2024 • 23

ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models

Paper • 2403.02084 • Published Mar 4, 2024 • 15

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 186