Ta-Ying Cheng

49 posts

Ta-Ying Cheng

Ta-Ying Cheng

@ChengTim0708

Research Scientist @Netflix | D.Phil. in Computer Science @UniofOxford

Inscrit le Ekim 2020
221 Abonnements176 Abonnés
Ta-Ying Cheng
Ta-Ying Cheng@ChengTim0708·
Dial roughness down, crank metallic up, stack multiple attributes at once all in a single forward pass!
Ta-Ying Cheng tweet media
English
1
0
2
102
Ta-Ying Cheng retweeté
Ta-Ying Cheng retweeté
Chun-Hsiao (Daniel) Yeh
Chun-Hsiao (Daniel) Yeh@danielyehhh·
❗️❗️ Can MLLMs understand scenes from multiple camera viewpoints — like humans? 🧭 We introduce All-Angles Bench — 2,100+ QA pairs on multi-view scenes. 📊 We evaluate 27 top MLLMs, including Gemini-2.0-Flash, Claude-3.7-Sonnet, and GPT-4o. 🌐 Project: danielchyeh.github.io/All-Angles-Ben…
Chun-Hsiao (Daniel) Yeh tweet media
English
2
26
79
18K
Ta-Ying Cheng retweeté
The Nobel Prize
The Nobel Prize@NobelPrize·
BREAKING NEWS The Royal Swedish Academy of Sciences has decided to award the 2024 #NobelPrize in Physics to John J. Hopfield and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine learning with artificial neural networks.”
The Nobel Prize tweet media
English
990
13.1K
32.4K
12.7M
Ta-Ying Cheng retweeté
AI at Meta
AI at Meta@AIatMeta·
🎥 Today we’re premiering Meta Movie Gen: the most advanced media foundation models to-date. Developed by AI research teams at Meta, Movie Gen delivers state-of-the-art results across a range of capabilities. We’re excited for the potential of this line of research to usher in entirely new possibilities for casual creators and creative professionals alike. More details and examples of what Movie Gen can do ➡️ go.fb.me/kx1nqm 🛠️ Movie Gen models and capabilities Movie Gen Video: 30B parameter transformer model that can generate high-quality and high-definition images and videos from a single text prompt. Movie Gen Audio: A 13B parameter transformer model that can take a video input along with optional text prompts for controllability to generate high-fidelity audio synced to the video. It can generate ambient sound, instrumental background music and foley sound — delivering state-of-the-art results in audio quality, video-to-audio alignment and text-to-audio alignment. Precise video editing: Using a generated or existing video and accompanying text instructions as an input it can perform localized edits such as adding, removing or replacing elements — or global changes like background or style changes. Personalized videos: Using an image of a person and a text prompt, the model can generate a video with state-of-the-art results on character preservation and natural movement in video. We’re continuing to work closely with creative professionals from across the field to integrate their feedback as we work towards a potential release. We look forward to sharing more on this work and the creative possibilities it will enable in the future.
English
529
1.5K
6.6K
2.3M
Ta-Ying Cheng
Ta-Ying Cheng@ChengTim0708·
Amazing work combining a variety of 3D models with LLMs for better spatial reasoning! @ChenyangMa119
Chenyang Ma@ChenyangMa119

#NeurIPS #NeurIPSConf Thrilled to share that our paper SpatialPIN has been accepted at #NeurIPS2024! We introduce a modular plug-and-play framework that progressively enhances VLMs' 3D reasoning by prompting and interacting with 3D foundational models. (1/8)

English
0
0
2
245
Ta-Ying Cheng
Ta-Ying Cheng@ChengTim0708·
Thrilled to share that ZeST has been accepted to #ECCV2024 !! A huge thanks to my collaborators/mentors @prafull7 , @jampani_varun, and my supervisors Niki Trigoni and Andrew Markham for the amazing support!
Ta-Ying Cheng@ChengTim0708

Today, with my collaborators @prafull7 (MIT CSAIL), @jampani_varun (@StabilityAI ), and my supervisors Niki Trigoni and Andrew Markham, we share with you ZeST, a zero-shot, training free method for image-to-image material transfer! Project Page: ttchengab.github.io/zest/ 1/8

English
1
5
28
3.2K
Prafull Sharma
Prafull Sharma@prafull7·
Graduated with a PhD in Computer Science @MIT! Grateful to my advisors and teachers who helped me learn and grow in this journey! Thanks to all my friends and family members for their support.
Prafull Sharma tweet media
English
116
45
2K
97.3K