LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces Paper • 2312.13208 • Published Dec 20, 2023
Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning Paper • 2501.13042 • Published Jan 22
Nexus-O: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision Paper • 2503.01879 • Published 12 days ago • 1