Taylor658
's Collections
Computer Vision
updated
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse
Viewpoints
Paper
•
2412.07760
•
Published
•
50
MoViE: Mobile Diffusion for Video Editing
Paper
•
2412.06578
•
Published
•
18
Video Motion Transfer with Diffusion Transformers
Paper
•
2412.07776
•
Published
•
17
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment
Paper
•
2412.04814
•
Published
•
45
VisionZip: Longer is Better but Not Necessary in Vision Language Models
Paper
•
2412.04467
•
Published
•
105
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video
Generation
Paper
•
2412.02259
•
Published
•
59
STIV: Scalable Text and Image Conditioned Video Generation
Paper
•
2412.07730
•
Published
•
71
Towards Language Models That Can See: Computer Vision Through the LENS
of Natural Language
Paper
•
2306.16410
•
Published
•
28
SynerGen-VL: Towards Synergistic Image Understanding and Generation with
Vision Experts and Token Folding
Paper
•
2412.09604
•
Published
•
35
GenEx: Generating an Explorable World
Paper
•
2412.09624
•
Published
•
88
Apollo: An Exploration of Video Understanding in Large Multimodal Models
Paper
•
2412.10360
•
Published
•
137