Sultan Khan

324 posts

Sultan Khan banner
Sultan Khan

Sultan Khan

@thesultanster

Travel Filmmaker / App Developer / x-Headspace / x-Spotify

New York, USA Katılım Ekim 2009
344 Takip Edilen149 Takipçiler
hayden
hayden@hxxwhite·
Serious question for other mobile devs: Why waste compute running Xcode sims locally, when you can just stream them from the cloud?
English
33
13
349
66.9K
Fekri
Fekri@fekdaoui·
i don't get how everyone is having issues running @openclaw with gpt 5.4? it's been running perfectly fine for me for weeks now
English
1
0
0
493
Sultan Khan retweetledi
Claude
Claude@claudeai·
Introducing Code Review, a new feature for Claude Code. When a PR opens, Claude dispatches a team of agents to hunt for bugs.
English
2.1K
5.1K
62.4K
23.5M
Sultan Khan retweetledi
Peter Steinberger 🦞
Peter Steinberger 🦞@steipete·
it’s a good model. the coding specific jump is more in line what we had in 5.0 to 5.1; but it’s now unified and smarter on everything else, writes better docs, is a better general purpose agent and is overall more pleasant to use.
OpenAI@OpenAI

GPT-5.4 Thinking and GPT-5.4 Pro are rolling out now in ChatGPT. GPT-5.4 is also now available in the API and Codex. GPT-5.4 brings our advances in reasoning, coding, and agentic workflows into one frontier model.

English
267
160
3.8K
412.4K
Sultan Khan retweetledi
am.will
am.will@LLMJunky·
GPT 5.4 has an experimental 1M context window you can configure inside of Codex. And unlike the flicker company, it works on your ChatGPT plan instead of requiring API rates. It does consume 2x more usage, but that's still notably cheaper than paying $22.50/mtok. To enable, add this to the top of your config file: model = "gpt-5.4" model_context_window = 1000000 model_auto_compact_token_limit = 900000
am.will tweet media
English
61
60
1.1K
90.8K
Sultan Khan
Sultan Khan@thesultanster·
A memory appreciates in value overtime, invest in creating memories
English
0
0
0
2
Sultan Khan
Sultan Khan@thesultanster·
qwen3.5 is maxing out my computer lol
Sultan Khan tweet media
English
0
0
0
6
Sultan Khan
Sultan Khan@thesultanster·
@BHolmesDev I cloned the repo again instead of using worktrees and gave explicit instruction on which port to test in. This has helped run agents coding in parallel
English
0
0
0
38
Ben Holmes
Ben Holmes@BHolmesDev·
Every day I'm more convinced that worktrees are a band-aid solution. Putting agents in cloud runners lets you *actually* close the laptop, and gives agents a space to check their work with sandboxed screenshotting / e2e testing. Y'all experiment with this yet? I'm still early
English
90
8
422
45.6K
Aaron Ng
Aaron Ng@localghost·
5.3-codex-spark is insanely fast at responding on openclaw. not as friendly but actually a big experience step up
English
19
4
136
27.3K
Sultan Khan
Sultan Khan@thesultanster·
@zivdotcat ive been getting rate limits on two separate accounts 😭
English
0
0
0
51
dev
dev@zivdotcat·
pov: u finally got $200 claude code max plan and never have to worry about rate limits again
English
45
18
459
29.1K
Sultan Khan
Sultan Khan@thesultanster·
@aidigest_ I can't seem to get past 20+ min lol what am I doing wrong here
English
1
0
0
685
AI Digest
AI Digest@aidigest_·
The exponential continues. Nov 2025: Opus 4.5 had a 5hr 20 time horizon. Feb 2026: Opus 4.6 has a 14hr 30 time horizon. Over three months, that's more than a *doubling* in the duration of coding tasks, measured by how long it takes human professionals, that AI can complete with 50% accuracy. Note that at this duration, the estimate is very noisy - see the thread from @METR_Evals for more on this. Now that agents can do most of the tasks on their benchmark, it's harder to be confident. But it looks like this is sitting above-trend. Read our full explainer on what this measure means: theaidigest.org/time-horizons
AI Digest tweet media
METR@METR_Evals

We estimate that Claude Opus 4.6 has a 50%-time-horizon of around 14.5 hours (95% CI of 6 hrs to 98 hrs) on software tasks. While this is the highest point estimate we’ve reported, this measurement is extremely noisy because our current task suite is nearly saturated.

English
20
65
611
92.1K
Scott Stevenson
Scott Stevenson@scottastevenson·
Meditation clears your context window Doing anything ambitious is very difficult when you are carrying around 200,000 junk tokens unrelated to the task Sleep does the same thing. This is why people find mornings so productive.
English
30
90
1K
23.7K
Tibo
Tibo@thsottiaux·
We’ve made GPT-5.3-Codex-Spark about 30% faster. It is now serving at over 1200 tokens per second. More to come on speed across the board.
English
210
118
2.6K
349.3K
Sultan Khan
Sultan Khan@thesultanster·
@PlayboyTigerX @TheAhmadOsman Consider this, my problem was I wanted to talk to Claude Code over my phone. What solution do you have for that other than terminus + Tailscale + tmux and a bad ui?
English
0
0
0
31
Ahmad
Ahmad@TheAhmadOsman·
Unpopular opinion now that the masses will not have me hanged Clawdbot / Motlbot / Openclaw is absolute and complete useless slop Kudos to Apple for capitalizing on that and selling all its Mac minis stock lol
English
127
41
1.8K
88.7K