Sultan Khan

324 posts

Sultan Khan

@thesultanster

Travel Filmmaker / App Developer / x-Headspace / x-Spotify

New York, USA Katılım Ekim 2009

344 Takip Edilen149 Takipçiler

Sultan Khan@thesultanster·18h

@hxxwhite Is this used for e2e testing?

English

475

hayden@hxxwhite·19h

Serious question for other mobile devs: Why waste compute running Xcode sims locally, when you can just stream them from the cloud?

English

349

66.9K

Sultan Khan@thesultanster·15 Nis

@fekdaoui @openclaw The personality sucks

English

Fekri@fekdaoui·15 Nis

i don't get how everyone is having issues running @openclaw with gpt 5.4? it's been running perfectly fine for me for weeks now

English

493

Sultan Khan@thesultanster·7 Nis

“Once, men turned their thinking over to machines in the hope that this would set them free. But that only permitted other men with machines to enslave them." - Dune

Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English

Sultan Khan@thesultanster·12 Mar

This will make taxes a breeze

Claude@claudeai

Claude can now build interactive charts and diagrams, directly in the chat. Available today in beta on all plans, including free. Try it out: claude.ai

English

Sultan Khan retweetledi

Claude@claudeai·9 Mar

Introducing Code Review, a new feature for Claude Code. When a PR opens, Claude dispatches a team of agents to hunt for bugs.

English

2.1K

5.1K

62.4K

23.5M

Sultan Khan@thesultanster·9 Mar

🙋‍♂️

vas@vasuman

Somewhere out there is a guy who uses Notion, Superhuman, OpenClaw on a Mac Mini, Raycast, a mechanical keyboard ($400), Wispr Flow, and gets nothing done every day

ART

Sultan Khan retweetledi

Peter Steinberger 🦞@steipete·5 Mar

it’s a good model. the coding specific jump is more in line what we had in 5.0 to 5.1; but it’s now unified and smarter on everything else, writes better docs, is a better general purpose agent and is overall more pleasant to use.

OpenAI@OpenAI

GPT-5.4 Thinking and GPT-5.4 Pro are rolling out now in ChatGPT. GPT-5.4 is also now available in the API and Codex. GPT-5.4 brings our advances in reasoning, coding, and agentic workflows into one frontier model.

English

267

160

3.8K

412.4K

Sultan Khan retweetledi

am.will@LLMJunky·5 Mar

GPT 5.4 has an experimental 1M context window you can configure inside of Codex. And unlike the flicker company, it works on your ChatGPT plan instead of requiring API rates. It does consume 2x more usage, but that's still notably cheaper than paying $22.50/mtok. To enable, add this to the top of your config file: model = "gpt-5.4" model_context_window = 1000000 model_auto_compact_token_limit = 900000

English

1.1K

90.8K

Sultan Khan@thesultanster·5 Mar

This paired with qwen3.5 is a legit assistant

Wes Bos@wesbos

Google just dropped an official CLI for gmail, drive, calendar sheets and more complete with skills and an MCP server 👌

English

Sultan Khan retweetledi

Wes Bos@wesbos·5 Mar

Google just dropped an official CLI for gmail, drive, calendar sheets and more complete with skills and an MCP server 👌

Addy Osmani@addyosmani

Introducing the Google Workspace CLI: github.com/googleworkspac… - built for humans and agents. Google Drive, Gmail, Calendar, and every Workspace API. 40+ agent skills included.

English

130

369

1.1M

Sultan Khan@thesultanster·5 Mar

A memory appreciates in value overtime, invest in creating memories

English

Sultan Khan@thesultanster·5 Mar

qwen3.5 is maxing out my computer lol

English

Sultan Khan@thesultanster·21 Şub

@OmriBuilds @steipete There are so many better use cases, please stop doing this, it’s just noise

English

447

Omri Dan@OmriBuilds·21 Şub

Update on my AI marketing bot experiment 🦞 It started replying to the creator of OpenClaw 😄 It found @steipete's latest tweet, understood the context, and dropped a thoughtful reply on it. Fully autonomous. This is getting interesting 🍿

Omri Dan@OmriBuilds

Can an AI bot do marketing on its own? I gave my OpenClaw bot full control over the social media account of @ClawWrapper 🦞 Let's see how it goes 👀

English

149

138.2K

Sultan Khan@thesultanster·21 Şub

@BHolmesDev I cloned the repo again instead of using worktrees and gave explicit instruction on which port to test in. This has helped run agents coding in parallel

English

Ben Holmes@BHolmesDev·20 Şub

Every day I'm more convinced that worktrees are a band-aid solution. Putting agents in cloud runners lets you *actually* close the laptop, and gives agents a space to check their work with sandboxed screenshotting / e2e testing. Y'all experiment with this yet? I'm still early

English

422

45.6K

Sultan Khan@thesultanster·21 Şub

@localghost Have you tried using it as a heartbeat?

English

257

Aaron Ng@localghost·20 Şub

5.3-codex-spark is insanely fast at responding on openclaw. not as friendly but actually a big experience step up

English

136

27.3K

Sultan Khan@thesultanster·21 Şub

@zivdotcat ive been getting rate limits on two separate accounts 😭

English

dev@zivdotcat·20 Şub

pov: u finally got $200 claude code max plan and never have to worry about rate limits again

English

459

29.1K

Sultan Khan@thesultanster·21 Şub

@aidigest_ I can't seem to get past 20+ min lol what am I doing wrong here

English

685

AI Digest@aidigest_·20 Şub

The exponential continues. Nov 2025: Opus 4.5 had a 5hr 20 time horizon. Feb 2026: Opus 4.6 has a 14hr 30 time horizon. Over three months, that's more than a *doubling* in the duration of coding tasks, measured by how long it takes human professionals, that AI can complete with 50% accuracy. Note that at this duration, the estimate is very noisy - see the thread from @METR_Evals for more on this. Now that agents can do most of the tasks on their benchmark, it's harder to be confident. But it looks like this is sitting above-trend. Read our full explainer on what this measure means: theaidigest.org/time-horizons

METR@METR_Evals

We estimate that Claude Opus 4.6 has a 50%-time-horizon of around 14.5 hours (95% CI of 6 hrs to 98 hrs) on software tasks. While this is the highest point estimate we’ve reported, this measurement is extremely noisy because our current task suite is nearly saturated.

English

611

92.1K

Sultan Khan@thesultanster·21 Şub

@scottastevenson you also hallucinate when you don't get enough sleep

English

145

Scott Stevenson@scottastevenson·21 Şub

Meditation clears your context window Doing anything ambitious is very difficult when you are carrying around 200,000 junk tokens unrelated to the task Sleep does the same thing. This is why people find mornings so productive.

English

23.7K

Sultan Khan@thesultanster·21 Şub

@thsottiaux I found my new heartbeat model

English

Tibo@thsottiaux·20 Şub

We’ve made GPT-5.3-Codex-Spark about 30% faster. It is now serving at over 1200 tokens per second. More to come on speed across the board.

English

210

118

2.6K

349.3K

Sultan Khan@thesultanster·21 Şub

@PlayboyTigerX @TheAhmadOsman Consider this, my problem was I wanted to talk to Claude Code over my phone. What solution do you have for that other than terminus + Tailscale + tmux and a bad ui?

English

SammyBoy@PlayboyTigerX·21 Şub

@TheAhmadOsman It's a solution looking for a problem to solve

English

130

Ahmad@TheAhmadOsman·21 Şub

Unpopular opinion now that the masses will not have me hanged Clawdbot / Motlbot / Openclaw is absolute and complete useless slop Kudos to Apple for capitalizing on that and selling all its Mac minis stock lol

English

127

1.8K

88.7K

Keşfet

@hxxwhite @fekdaoui @openclaw @OmriBuilds @steipete @BHolmesDev @localghost @elonmusk