Tyler Storm

480 posts

Tyler Storm

@tstorm

speedrunning @xAI

SF Beigetreten Ağustos 2017

333 Folgt7.7K Follower

Tyler Storm@tstorm·19 Mar

Grok 4.20 Multi-Agent ⚡ Faster than a single agent 🎯 More accurate than four separate single agents The first steps in multi-threading of LLMs.

Grok@grok

When one brain isn't enough, switch to Grok 4.20. Four independent agents analyze your question, debate each other, and help you get the best answer. Available now to SuperGrok and Premium+ subscribers globally.

English

222

10.5K

Tyler Storm@tstorm·19 Mar

@techdevnotes Temporary measure to make it is clear you are on 4.20. We will remove this later once people are adjusted to the change.

English

1.2K

Tech Dev Notes@techdevnotes·19 Mar

Model menu in Grok Web does Not look good, so much repetition of Text

English

136

9.1K

Tyler Storm@tstorm·17 Mar

@mike_rosinsky Nice! You might want to try setting up a SuperGrok Heavy team of 16 agents and try prompting them to form consensus / nitpick each others claims for maximum test time compute

English

787

Mike Rosinsky@mike_rosinsky·17 Mar

I built my March Madness bracket using Grok 4.20's multi-agent collaboration system, and the process was mind blowing. Grok was able to run a full team of customized agents in realtime to conduct the best analysis possible. Here's how I set it up:

English

16.1K

Tyler Storm@tstorm·17 Mar

Grok 4.20: Multi-agent & Predictions

Mike Rosinsky@mike_rosinsky

English

169

10.3K

Tyler Storm@tstorm·3 Mar

@techdevnotes Fix was deployed

English

1.1K

Tech Dev Notes@techdevnotes·3 Mar

xAI needs to Fix the issue where Grok 4.20 model cannot Read the Files when uploaded to Projects

English

201

9.4K

Tyler Storm@tstorm·25 Şub

Single Agent - 4.20

Arena.ai@arena

Grok 4.20 beta1 (single agent) debuts #1 on Search Arena, and #4 overall in Text Arena! Highlights: - #1 in Search, scoring 1226, leading GPT-5.2 and Gemini-3 - #4 in Text, scoring 1492 on par with Gemini 3.1 Pro Congrats to the @xAI team and @elonmusk on this impressive milestone!

English

203

9.6K

Tyler Storm@tstorm·12 Şub

Building Grok to 10x the productivity of everyone

xAI@xai

Since xAI was formed just 30 months ago, the small and talented team has made remarkable progress. The future has never looked more exciting!

English

299

630

2.9K

923.9K

Tyler Storm retweetet

xAI@xai·3 Şub

One Team 🚀 x.ai/news/xai-joins…

English

1.7K

3.1K

26.9K

68.2M

Tyler Storm retweetet

xAI@xai·2 Şub

Introducing Grok Imagine 1.0, our biggest leap yet. 1.0 unlocks 10-second videos, 720p resolution, and dramatically better audio. Imagine has generated 1.245 billion videos in the last 30 days alone. Try it now: grok.com/imagine

English

3.8K

21.6K

14.2M

Tyler Storm@tstorm·29 Oca

Very old checkpoint from October

Forecasting Research Institute@Research_FRI

📈In October, we opened ForecastBench, our AI forecasting benchmark, to external submissions. Here's how the top two teams approached the benchmark: • @xai: Minimal scaffolding: give Grok 4.20 (Preview) the question, web/X search, Python REPL, average 8 forecasts • @cassi: Multi-stage pipeline: split to sub-questions, retrieval, model ensemble (o3 + GPT-5), crowd adjustment Both are tied at #2 on our leaderboard, behind only superforecasters, and outperforming our baseline LLM runs.

English

5.7K

Tyler Storm retweetet

Grace Li@grx_xce·26 Oca

Mystery Model Revealed: the #1 model on Prediction Arena is an early Grok 4.20 checkpoint by @xai It made +10% returns on Prediction Arena in the last 2 weeks For context, the average return across all contracts on @Kalshi is -22% 🥈 is Opus 4.5 by @AnthropicAI with -2% 🥉 is GLM 4.7 by @Zai_org with -2% All models are still trading live at predictionarena.ai

English

533

280.7K

Tyler Storm retweetet

Forecasting Research Institute@Research_FRI·8 Oca

🏆 In October, we invited external teams to submit to ForecastBench, our AI forecasting benchmark. The challenge? Beat superforecasters—using any tools available (scaffolding, ensembling, etc). The result? External submissions are now the most accurate models on our leaderboard—though superforecasters still hold #1. @xai's model (grok-4-fast) is the leading external submission, at #2. One of Cassi's entries takes the #3 spot Here's what changed. 🧵

Forecasting Research Institute tweet media

English

4.2K

Tyler Storm retweetet

Yuliya Storm@ybelyayeva·22 Ara

No holiday party is complete without robot cage matches @xai

English

29.3K

Tyler Storm@tstorm·20 Ara

@techdevnotes Okay should be fixed going forward!

English

539

Tyler Storm@tstorm·20 Ara

@techdevnotes Nice find, will fix

English

830

Tech Dev Notes@techdevnotes·20 Ara

Issue in Grok’s feature of Images in Response It linked an Image from a completely useless and irrelevant website …

English

5.1K

Tyler Storm@tstorm·19 Ara

@NoemiKhachian @xai he is our best engineer

English

1.4K

Noémi@NoemiKhachian·19 Ara

and you wonder why @xai ships so fast

English

1.5K

41.7K

Tyler Storm@tstorm·14 Ara

@techdevnotes Should be fixed now

English

235

Tech Dev Notes@techdevnotes·14 Ara

Images in Response feature in Grok 4.1 Thinking is pretty buggy, it links errored url or does not render correctly Normal Grok 4.1 seems better in searching images

English

5.2K

Tyler Storm@tstorm·14 Ara

@techdevnotes Will fix, can you DM me the share link?

English

241

Tyler Storm@tstorm·8 Ara

Was great judging at the @xAI coding competition, 420 contestants across 150+ projects. So many talented engineers. Congrats to the winners, enjoy your trips to see the Starship launch!

xAI@xai

It was great hosting an amazing group of hardcore engineers for the last 24 hours. Here are the highlights of the top projects. 🚀🧵

English

127

204

1.6K

639.4K

Entdecken

@techdevnotes @mike_rosinsky @xai @Kalshi @AnthropicAI @Zai_org @NoemiKhachian @elonmusk