

Haoxuan You
12 posts

@XyouH
Research Scientist @ Apple AI/ML. Prev CS Ph.D. @Columbia University.










As Apple Intelligence is rolling out to our beta users today, we are proud to present a technical report on our Foundation Language Models that power these features on devices and cloud: machinelearning.apple.com/research/apple…. 🧵



Congrats to @LiLiunian for winning Google PhD Fellowship! 🎉🥳🎊 Harold led pioneering efforts in vision-language research, including developing notable models such as VisualBERT, CLIP, and recently introduced Desco. He will be on the market this year! @uclanlp @UCLAengineering

🚀🚀Introducing Ferret, a new MLLM that can refer and ground anything anywhere at any granularity. 📰arxiv.org/abs/2310.07704 1⃣ Ferret enables referring of an image region at any shape 2⃣ It often shows better precise understanding of small image regions than GPT-4V (sec 5.6)

