cclin

15 posts

cclin

cclin

@cclin_

เข้าร่วม Temmuz 2019
77 กำลังติดตาม3 ผู้ติดตาม
cclin รีทวีตแล้ว
Tan Wang
Tan Wang@Wangt97·
Thx @_akhaliq! Check out our DisCo at disco-dance.github.io.🔥🔥🔥 🧙‍♂️High Generalizability. No need human-specific fine-tuning! 💃Extensive human-related applications with disentangled control! 👨‍💻Easy-to-follow framework and totally opensource code!
AK@_akhaliq

DisCo: Disentangled Control for Referring Human Dance Generation in Real World paper page: huggingface.co/papers/2307.00… Generative AI has made significant strides in computer vision, particularly in image/video synthesis conditioned on text descriptions. Despite the advancements, it remains challenging especially in the generation of human-centric content such as dance synthesis. Existing dance synthesis methods struggle with the gap between synthesized content and real-world dance scenarios. In this paper, we define a new problem setting: Referring Human Dance Generation, which focuses on real-world dance scenarios with three important properties: (i) Faithfulness: the synthesis should retain the appearance of both human subject foreground and background from the reference image, and precisely follow the target pose; (ii) Generalizability: the model should generalize to unseen human subjects, backgrounds, and poses; (iii) Compositionality: it should allow for composition of seen/unseen subjects, backgrounds, and poses from different sources. To address these challenges, we introduce a novel approach, DISCO, which includes a novel model architecture with disentangled control to improve the faithfulness and compositionality of dance synthesis, and an effective human attribute pre-training for better generalizability to unseen humans. Extensive qualitative and quantitative results demonstrate that DISCO can generate high-quality human dance images and videos with diverse appearances and flexible motions.

English
2
6
8
3.9K
cclin รีทวีตแล้ว
Microsoft Research
Microsoft Research@MSFTResearch·
Xuedong Huang, CTO, Azure AI, will present his keynote at CVPR 2022 @CVPR today at 5PM CT where he will share progress on the application of Integrative AI on computer vision and its promising results. Virtual conference registrants can tune in here: msft.it/6017bWAUz
English
1
7
26
0
cclin รีทวีตแล้ว
Linjie (Lindsey) Li
Linjie (Lindsey) Li@LINJIEFUN·
Interested in Vision Language Pre-training (VLP) but do not know where to start? Hard to track the rapid progress in VLP? Come and join us at our CVPR2022 VLP tutorial on 19th Jun (9am-5pm CDT) in person in New Orleans or virtually. vlp-tutorial.github.io #CVPR2022
Linjie (Lindsey) Li tweet mediaLinjie (Lindsey) Li tweet mediaLinjie (Lindsey) Li tweet media
English
0
23
107
0
cclin รีทวีตแล้ว
Chitwan Saharia
Chitwan Saharia@Chitwan_Saharia·
We are thrilled to announce Imagen, a text-to-image model with unprecedented photorealism and deep language understanding. Explore imagen.research.google and Imagen! A large rusted ship stuck in a frozen lake. Snowy mountains and beautiful sunset in the background. #imagen
Chitwan Saharia tweet media
English
55
294
1.6K
0
cclin รีทวีตแล้ว
Elliott / Shangzhe Wu
Elliott / Shangzhe Wu@elliottszwu·
Here are the video recordings of the workshop: youtube.com/watch?v=VmKc_s…
YouTube video
YouTube
Elliott / Shangzhe Wu@elliottszwu

Announcing @ICCV_2021 workshop on "Unsupervised 3D Learning in the Wild" with an incredible line-up of speakers on this topic! #ICCV2021 🚩 Website: unsup3d.github.io 📅 Time: 7:00-18:00 EDT / 12:00-23:00 BST, 11 Oct 2021 Calendar: calendar.google.com/calendar/embed… (mark it down!)

English
0
12
71
0
cclin รีทวีตแล้ว
AI at Meta
AI at Meta@AIatMeta·
We're sharing Unidentified Video Objects (UVO), a new benchmark to facilitate research in open-world segmentation, an important computer vision task that aims to detect, segment, and track all objects exhaustively in a video. Learn more: ow.ly/rH8650FQ7vp
AI at Meta tweet media
English
6
85
362
0
cclin รีทวีตแล้ว
Jia-Bin Huang
Jia-Bin Huang@jbhuang0604·
Writing Related Work I enjoy reading/writing the related work section of a paper. It helps organize prior research and put the contributions of the work in proper context. But HOW? Check the thread below👇
English
7
165
730
0
cclin รีทวีตแล้ว
Google AI
Google AI@GoogleAI·
Today, we are announcing the open source release of DeepLab2, a modern TensorFlow library for deep labeling that aims to facilitate future research on dense pixel labeling by providing a unified, state-of-the-art, and easy-to-use TensorFlow codebase → goo.gle/3d3SnVE
GIF
English
14
287
1.2K
0
cclin รีทวีตแล้ว