Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation

(github.com)

9 points | by neehao 13 hours ago ago

No comments yet.