msr_knowledgenlp retweeted

Any-to-Any Generation via Composable Diffusion
present Composable Diffusion (CoDi), a novel generative model capable of generating any combination of output modalities, such as language, image, video, or audio, from any combination of input modalities
paper page: huggingface.co/papers/2305.11…
English














