
Trevor Back
504 posts

Trevor Back
@TrevorBack
MAI Multimodal - Product. Previously @GoogleDeepMind, @ShiftLabAI, @Speechmatics. Speech is core to the future AGI stack.


Sama during trial: "I don't think Mr. Musk understood how to run a good research lab" Q: How did Musk's management affect OpenAI's research culture? Sama: "He had demotivated some of our most key researchers... required Greg and Ilya to make a list of the researchers and list out their accomplishments and sort of stack rank them... I take a chainsaw through a bunch and that did huge damage for a long time to the culture of the organization... this idea that you constantly have to show your results and if they're not good enough on a short period, you're going to get fired... that really didn't work for the kind of research we went on to successfully do."















Let’s dive deeper into the massive improvements between MAI-Image-2 vs. MAI-Image-1 by @MicrosoftAI. MAI-Image-2 shows significant gains across all sub-categories for Text-to-Image: Gains across all 7 sub-categories in order of magnitude: - Text Rendering (+115 pts) - Portraits (+105 pts) - Product, Branding & Commercial Design (+102 pts) - Photorealistic & Cinematic Imagery (+97 pts) - 3D Imaging & Modeling (+92 pts) - Art (+87 pts) - Cartoon, Anime & Fantasy (+81 pts)




MAI-Image-2 debuts at #5 in the Image Arena! Highlights: - #5 in Text-to-Image overall - #5 for 3D Imaging & Modeling, Cartoon, Anime & Fantasy, Photorealistic & Cinematic Imagery, Art and Portraits - #6 for Product, Branding & Commercial Design Congrats to the @MicrosoftAI team on this milestone!


