Tyler Storm

480 posts

Tyler Storm banner
Tyler Storm

Tyler Storm

@tstorm

speedrunning @xAI

SF Beigetreten Ağustos 2017
333 Folgt7.7K Follower
Tyler Storm
Tyler Storm@tstorm·
@techdevnotes Temporary measure to make it is clear you are on 4.20. We will remove this later once people are adjusted to the change.
English
5
0
44
1.2K
Tech Dev Notes
Tech Dev Notes@techdevnotes·
Model menu in Grok Web does Not look good, so much repetition of Text
Tech Dev Notes tweet media
English
22
5
136
9.1K
Tyler Storm
Tyler Storm@tstorm·
@mike_rosinsky Nice! You might want to try setting up a SuperGrok Heavy team of 16 agents and try prompting them to form consensus / nitpick each others claims for maximum test time compute
English
1
0
15
787
Mike Rosinsky
Mike Rosinsky@mike_rosinsky·
I built my March Madness bracket using Grok 4.20's multi-agent collaboration system, and the process was mind blowing. Grok was able to run a full team of customized agents in realtime to conduct the best analysis possible. Here's how I set it up:
Mike Rosinsky tweet media
English
8
4
84
16.1K
Tech Dev Notes
Tech Dev Notes@techdevnotes·
xAI needs to Fix the issue where Grok 4.20 model cannot Read the Files when uploaded to Projects
Tech Dev Notes tweet media
English
20
11
201
9.4K
Tyler Storm retweetet
xAI
xAI@xai·
Introducing Grok Imagine 1.0, our biggest leap yet. 1.0 unlocks 10-second videos, 720p resolution, and dramatically better audio. Imagine has generated 1.245 billion videos in the last 30 days alone. Try it now: grok.com/imagine
English
3.8K
3K
21.6K
14.2M
Tyler Storm
Tyler Storm@tstorm·
Very old checkpoint from October
Forecasting Research Institute@Research_FRI

📈In October, we opened ForecastBench, our AI forecasting benchmark, to external submissions. Here's how the top two teams approached the benchmark: • @xai: Minimal scaffolding: give Grok 4.20 (Preview) the question, web/X search, Python REPL, average 8 forecasts • @cassi: Multi-stage pipeline: split to sub-questions, retrieval, model ensemble (o3 + GPT-5), crowd adjustment Both are tied at #2 on our leaderboard, behind only superforecasters, and outperforming our baseline LLM runs.

English
4
2
62
5.7K
Tyler Storm retweetet
Grace Li
Grace Li@grx_xce·
Mystery Model Revealed: the #1 model on Prediction Arena is an early Grok 4.20 checkpoint by @xai It made +10% returns on Prediction Arena in the last 2 weeks For context, the average return across all contracts on @Kalshi is -22% 🥈 is Opus 4.5 by @AnthropicAI with -2% 🥉 is GLM 4.7 by @Zai_org with -2% All models are still trading live at predictionarena.ai
Grace Li tweet media
English
38
66
533
280.7K
Tyler Storm retweetet
Forecasting Research Institute
Forecasting Research Institute@Research_FRI·
🏆 In October, we invited external teams to submit to ForecastBench, our AI forecasting benchmark. The challenge? Beat superforecasters—using any tools available (scaffolding, ensembling, etc). The result? External submissions are now the most accurate models on our leaderboard—though superforecasters still hold #1. @xai's model (grok-4-fast) is the leading external submission, at #2. One of Cassi's entries takes the #3 spot Here's what changed. 🧵
Forecasting Research Institute tweet media
English
2
11
29
4.2K
Tyler Storm retweetet
Yuliya Storm
Yuliya Storm@ybelyayeva·
No holiday party is complete without robot cage matches @xai
Yuliya Storm tweet media
Yuliya Storm tweet mediaYuliya Storm tweet media
English
1
2
81
29.3K
Tech Dev Notes
Tech Dev Notes@techdevnotes·
Issue in Grok’s feature of Images in Response It linked an Image from a completely useless and irrelevant website …
Tech Dev Notes tweet media
English
5
1
40
5.1K
Noémi
Noémi@NoemiKhachian·
and you wonder why @xai ships so fast
Noémi tweet media
English
71
26
1.5K
41.7K
Tech Dev Notes
Tech Dev Notes@techdevnotes·
Images in Response feature in Grok 4.1 Thinking is pretty buggy, it links errored url or does not render correctly Normal Grok 4.1 seems better in searching images
Tech Dev Notes tweet media
English
4
1
38
5.2K