VILA-U: Efficient and Unified Visual Language Understanding and Generation

Publication
International Conference on Learning Representations (ICLR)

Related