Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 6 days ago • 55
XCube ($\mathcal{X}^3$): Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies Paper • 2312.03806 • Published Dec 6, 2023 • 1
LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Paper • 2403.15385 • Published Mar 22, 2024 • 6
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation Paper • 2404.19752 • Published Apr 30, 2024 • 22
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Paper • 2411.07126 • Published Nov 11, 2024 • 28
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper • 2411.09595 • Published Nov 14, 2024 • 71
Wolf: Captioning Everything with a World Summarization Framework Paper • 2407.18908 • Published Jul 26, 2024 • 32
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation Paper • 2404.19752 • Published Apr 30, 2024 • 22
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Paper • 2402.19481 • Published Feb 29, 2024 • 20
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Paper • 2310.15308 • Published Oct 23, 2023 • 22
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion Paper • 2307.01097 • Published Jul 3, 2023 • 10