Paper summary: Janus - Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

DeepSeek is on a roll

Repo: https://github.com/deepseek-ai/Janus (MIT license!)

Paper: https://arxiv.org/html/2410.13848v1

United States of Banan is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.