Zineng Tang
133 posts

Zineng Tang
@ZinengTang
PhD in @Berkeley_ai and @BerkeleyNLP. Previously @UNCNLP and @MSFTResearch.

🚨Announcing Zebra-CoT, a large-scale dataset of high quality interleaved image-text reasoning traces 📜. Humans often draw visual aids like diagrams when solving problems, but existing VLMs reason mostly in pure text. 1/n



We are thrilled to announce TULIP! 🌷 tulip-berkeley.github.io A state of the vision language encoders coupled with generative model for stronger representation learning.

We are thrilled to announce TULIP! 🌷 tulip-berkeley.github.io A state of the vision language encoders coupled with generative model for stronger representation learning.






We released phi 3.5: mini+MoE+vision A better mini model with multilingual support: huggingface.co/microsoft/Phi-… A new MoE model:huggingface.co/microsoft/Phi-… A new vision model supporting multiple images: huggingface.co/microsoft/Phi-…

🔥Excited to introduce CoDi-2! It follows complex multimodal-interleaved in-context instructions to generate any modalities (text, vision, audio) in zero/few-shot interactive way! codi-2.github.io huggingface.co/papers/2311.18… @yzy_ai @nlpyang @ChenguangZhu2 @mohitban47 🧵👇

Congratulations to professors Ron Alterovitz and Mohit Bansal (@mohitban47) on being conferred distinguished professorships by @UNC! 🎉

🔥Excited to introduce CoDi-2! It follows complex multimodal-interleaved in-context instructions to generate any modalities (text, vision, audio) in zero/few-shot interactive way! codi-2.github.io huggingface.co/papers/2311.18… @yzy_ai @nlpyang @ChenguangZhu2 @mohitban47 🧵👇

Day 1: Session 1 Foundational and Generative Models Talk 3: Mohit Bansal Professor University of North Carolina at Chapel Hill We’re really excited to play around with his state-of-the-art any-to-any multimodal models CoDi and CoDi-2 ! #indoml #ml #iitbombay #aiml







