Hexiang (Frank) Hu

847 posts

Hexiang (Frank) Hu banner
Hexiang (Frank) Hu

Hexiang (Frank) Hu

@hexiang

@xAI: making @grok imagine | Previously @GoogleDeepMind: Gemini & Imagen | Seattle ← Los angeles ← Hangzhou ← Wenzhou

Earth Katılım Eylül 2014
801 Takip Edilen4.3K Takipçiler
AnonymousSulla
AnonymousSulla@AnonymousSulla·
@hexiang It’s absolutely incredible. I’d love a bit more documentation and such or a tutorial or smth to make the learning curve shallower
English
1
0
1
55
Mark Kretschmann
Mark Kretschmann@mark_k·
@hexiang 1) The default view you click on "Agent" is very confusing. You need to type something into the text field and then it starts working, without the chance to upload images first. 2) Make the prompt text editable and easily copyable. 3) Unclear what the "Text" function is for.
English
2
0
6
445
Hexiang (Frank) Hu retweetledi
Bill Yuchen Lin
Bill Yuchen Lin@billyuchenlin·
smarter and cheaper, enjoy!
Artificial Analysis@ArtificialAnlys

xAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower output price than Grok 4.20 The release of Grok 4.3 places @xAI just above Muse Spark and Claude Sonnet 4.6 on the Intelligence Index, and a 4 points ahead of the latest version of Grok 4.20. Grok 4.3 improves its Artificial Analysis Intelligence Index score while reducing cost to run the benchmark suite. Key Takeaways: ➤ Grok 4.3 improves on cost-per-intelligence relative to Grok 4.20 0309 v2: it scores higher on the Intelligence Index while costing less to run the full benchmark suite. Grok 4.3 costs $395 to run the Artificial Analysis Intelligence Index, around 20% lower than Grok 4.20 0309 v2, despite using more output tokens. This makes it one of the lower-cost models at its intelligence level ➤ Large increase in real world agentic task performance: The largest single benchmark improvement is on GDPval-AA, where Grok 4.3 scores an ELO of 1500, up 321 points from Grok 4.20 0309 v2’s score of 1179 Grok 4.3, surpassing Gemini 3.1 Pro Preview, Muse Spark, Gpt-5.4 mini (xhigh), and Kimi K2.5. Grok 4.3 narrows the gap to the leading model on GDPval-AA, but still trails GPT-5.5 (xhigh) by 276 Elo points, with an expected win rate of ~17% against GPT-5.5 (xhigh) under the standard Elo formula ➤ Grok 4.3’s performs strongly on instruction following and agentic customer support tasks. It gains 5 points on 𝜏²-Bench Telecom to reach 98%, in line with GLM-5.1. Grok 4.3 maintains an 81% IFBench score from Grok 4.20 0309 v2 ➤ Gains 8 points on AA-Omniscience Accuracy, but at the cost of lower AA-Omniscience Non-Hallucination Rate of 8 points, so Grok 4.20 0309 v2 still leads AA-Omniscience Non-Hallucination Rate, followed by MiMo-V2.5-Pro, in line with Grok 4.3 Congratulations to @xAI and @elonmusk on the impressive release!

English
5
7
110
4.2K
Yihe Deng
Yihe Deng@Yihe__Deng·
Last day at xAI. For a new grad, the past six months have been an irreplaceable experience. I feel fortunate to have made the decision to join this journey with xAI, and grateful for how much I was able to learn here in such a short, dense period of time. I'm proud of what we built, and what the multimodal team continues to build. I have deep faith in this team. No matter where I go next, I'll always look forward to seeing what my friends here pull off and bring into the world. I'm especially grateful to my captains along the way -- people I look up to, trust deeply, and who placed trust in my potential. I truly appreciate all the friends I met here, and the time we spent building together. And thanks xAI for the opportunity, and for giving me the space to learn, contribute, and grow. In the end, the greatest treasure is indeed the journey itself: the problems worth solving, and the people worth building with. Now, it's time to step into the uncertainty of what comes next.
English
48
6
546
34.9K
Hexiang (Frank) Hu retweetledi
Dogan Ural
Dogan Ural@doganuraldesign·
Grok Imagine Agent Mode (beta) is here! - Brainstorm with agents on an infinite canvas - Create & edit multiple images at once - Turn images into videos & stitch them together - Trim, fade, crop, export, and more… Go to the web app and give it a try.
Dogan Ural tweet media
English
36
31
339
21.5K
Jon Barron
Jon Barron@jon_barron·
"World Models" discourse will now be paused, pending the invention of terraforming. If we had named LLMs "Thought Models" we'd never get past the philosophical debates around what is *actually* happening under the hood. Just name your model according to its inputs or outputs.
English
12
6
129
13.8K
Jack Cai
Jack Cai@JackCaiXun·
I resigned from xAI yesterday. But I still dreamt about it whenever I take a nap, no matter finished work, ongoing, or something I wanted to do but didn’t have chance to. An unforgettable experience like this can rewire your brain and gives you a chill whenever you think about it. I worked with a lot of brilliant minds at xAI, not just within Grok Imagine, and I want to thank you all for making it like a family to me. Thanks Elon, you built this platform where miracles happened. I will miss xAI, but I’m sure the world outside is just as exciting, and I can’t wait to take on this journey. Lastly, I want to thank my wife for her endless support. Last year, I hopped on an airplane to SF not knowing what was coming to me. Her support helped me to go through the toughest time, and I can’t wait to reunite with her again! — sent from anonymous flight
English
63
11
640
50.1K
Hexiang (Frank) Hu retweetledi
Grok Imagine
Grok Imagine@imagine·
Grok Imagine now has dramatically improved lip sync and sharper audio quality on all image-to-video generations. Dialogue tracks the mouth. Sound matches the scene. Your videos look and sound the way you imagined them.
English
1.8K
3.2K
30.7K
213M
Hexiang (Frank) Hu retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
This is Decoupled DiLoCo: our new resilient and flexible way to train advanced AI models across multiple data centres. 🧵
GIF
English
85
170
1.3K
160.8K
Boyuan Chen
Boyuan Chen@BoyuanChen0·
This is what I’ve been cooking in the past 4 months . GPT Image 2 is over a massive 240 elo jump over the second place model, marking the biggest jump bigger than the rest of the leaderboard combined
Arena.ai@arena

Exciting news - GPT-Image-2 by @OpenAI has claimed the #1 spot across all Image Arena leaderboards! A clean sweep with a record-breaking +242 point lead in Text-to-Image - the largest gap we’ve seen to date. - #1 Text-to-Image (1512), +242 over #2 (Nano-banana-2 with web-search aka gemini-3.1-flash-image) - #1 Single-Image Edit (1513), +125 over #2 (Nano-banana-pro aka gemini-3-pro-image) - #1 Multi-Image Edit (1464), +90 over #2 (Nano-banana-2) No model has dominated Image Arena with margins this wide. Huge congratulations to @OpenAI on this major breakthrough in image generation! More performance breakdowns by category in the thread below.

English
75
77
1.6K
147.2K