OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 4 days ago • 157 • 17
ViM: Vision Middleware for Unified Downstream Transferring Paper • 2303.06911 • Published Mar 13, 2023
Rethinking Supervised Pre-training for Better Downstream Transferring Paper • 2110.06014 • Published Oct 12, 2021
RLIPv2: Fast Scaling of Relational Language-Image Pre-training Paper • 2308.09351 • Published Aug 18, 2023
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval Paper • 2211.12764 • Published Nov 23, 2022
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper • 2409.02634 • Published Sep 4, 2024 • 93
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention Paper • 2409.01876 • Published Sep 3, 2024 • 2
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 4 days ago • 157
FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation Paper • 2412.16915 • Published Dec 22, 2024
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 4 days ago • 157
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper • 2409.02634 • Published Sep 4, 2024 • 93