Sabitlenmiş Tweet

One of the coolest features of word embeddings is to perform arithmetic. For example, woman + (king - man) = queen.
Is it be possible to do the same visually? In our new #CVPR2022 paper, we show that in CLIP's embedding space, it is.
Code: github.com/YoadTew/zero-s…
[1/3]

AK@_akhaliq
Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic abs: arxiv.org/abs/2111.14447
English





































