⌀ phantom.ctx

7K posts

⌀ phantom.ctx

@phantomctx

Building the home for Science 🔬

NYC🗽 Katılım Nisan 2021

2.9K Takip Edilen273 Takipçiler

⌀ phantom.ctx retweetledi

Tom Turney@no_stp_on_snek·14h

Native Swift/Metal backend for vLLM on Apple Silicon. No Python in the inference hot path → better throughput + scaling. Try it: brew tap TheTom/tap && brew install vllm-swift Looking for beta testers → github.com/TheTom/vllm-sw…

English

358

19.6K

⌀ phantom.ctx retweetledi

nico@nicochristie·6h

We have been testing GPT 5.5 on the hardest spreadsheet tasks in the world (100k-1M+ cell complex models). It is the Pareto frontier for spreadsheets -- SOTA accuracy, the fastest and the most efficient public model across effort levels. OAI really cooked here

English

611

38.1K

⌀ phantom.ctx retweetledi

Sherwin Wu@sherwinwu·8h

Set Codex to this and never look back. Medium reasoning effort is good enough for me for ~anything I need to do now.

English

976

63.2K

⌀ phantom.ctx retweetledi

signüll@signulll·11h

openai's product execution & velocity has stepped up noticeably, & the tone feels more human again. it felt corporate for a while in the middle. w/ the recent releases incl 5.5 you can feel the real focus & polish showing through again. credit where it's due cuz the work on agents & codex is game changing stuff for the broader economy. comms feels tighter & again much more relatable. something changed. they clearly took the feedback to heart. begs the question, is openai pivoting away from consumer stuff? or at least it's p1 instead of p0 now. that would be a big shift.

Sam Altman@sama

These are cool! I think most companies will want to use them.

English

689

55.6K

⌀ phantom.ctx retweetledi

Mohammad Azam@azamsharp·9h

Stop Fighting SwiftUI Sheets. Use This Pattern Instead azamsharp.com/2024/08/18/glo… #iosdev #swiftui

English

594

⌀ phantom.ctx retweetledi

jason liu@jxnlco·9h

As models get smarter, contradictions in your codebase and prompts are becoming more expensive The more contradictions there are, the more the model needs a reason to identify under what circumstances the rules you lay out make sense. It's kind of like when I talk to my girlfriend. You said this, but you also said that, so I don't know what to do in this situation ..

English

115

⌀ phantom.ctx retweetledi

NVIDIA@nvidia·10h

Efficiency isn't just about speed anymore — it's about the massive reduction in the cost of intelligence. NVIDIA and @OpenAI's partnership leverages the GB200 NVL72 to deliver a 35x reduction in token costs, bringing enterprise-grade AI to an unprecedented scale. Trained and served on NVIDIA GB200 NVL72 systems, GPT-5.5 delivers the sustained performance required for execution-heavy, multi-step work — and at NVIDIA, that means teams are now scaling human ingenuity with OpenAI Codex Agents.

English

100

44.2K

⌀ phantom.ctx retweetledi

Claude@claudeai·9h

Memory on Claude Managed Agents is now in public beta. Your agents can now learn from every session, using an intelligence-optimized memory layer that balances performance with flexibility.

English

224

417

6.2K

295.5K

⌀ phantom.ctx retweetledi

Jonas@JonasBadalic·10h

Pleased to announce that we've made @sentry slightly denser, and cleaned up the layout.

English

2.9K

⌀ phantom.ctx retweetledi

Max Weinbach@mweinbach·10h

This feels like the fastest new model rollout from OpenAI

English

149

6.1K

⌀ phantom.ctx retweetledi

Parallel Web Systems@p0·11h

The best web search for agents is now free. Upgrade to Parallel's web search tools in any MCP-supported tool or agent, for free, in under 60 seconds. No account. No API keys. Zero cost. docs.parallel.ai/integrations/m…

GIF

English

204

93.6K

⌀ phantom.ctx retweetledi

Lenny Rachitsky@lennysan·10h

Claude Code's Head of Product: "The hardest PM skill right now is how to be the right amount of AGI-pilled."

Lenny Rachitsky@lennysan

How Anthropic’s product team moves faster than anyone else I sat down with @_catwu, Head of Product for Claude Code at @AnthropicAI, to get a peek into their unprecedented shipping pace, how AI is changing the PM role, and how to be the right amount of AGI-pilled. We discuss: 🔸 How Anthropic’s shipping cadence went from months to weeks to days 🔸 The emerging skills PMs need to develop right now 🔸 Why you should build products that don't work yet—then wait for the model to catch up 🔸 Why a 95% automation isn't really an automation 🔸 Cat’s most underrated AI skill (introspection) 🔸 What Cat actually looks for when hiring PMs now (hint: it's not traditional PM skills) Listen now 👇 youtu.be/PplmzlgE0kg

English

628

125.3K

⌀ phantom.ctx retweetledi

Dan McAteer@daniel_mac8·10h

GPT-5.5 beats Opus 4.7 on several benchmarks, esp those related to agentic coding + tool calling. It's also pretty damn close to Claude Mythos... Even beats Mythos on Terminal-Bench 2.0. However, GPT-5.5 is far more token efficient than Opus 4.7. OpenAI cooked this Spud 🥔.

English

160

6.9K

⌀ phantom.ctx@phantomctx·10h

@laurasideral sleeek, wow, amazing

English

104

⌀ phantom.ctx retweetledi

Laura Sandoval@laurasideral·11h

Introducing a new way to manage your Notion agents in the Notion AI beta 💬 We want to make it easier for you to track your agents’ activity and course-correct when needed—so we’re bringing it to the forefront. Let us know what you think! Join the beta: testflight.apple.com/join/m2kxP5cw

English

257

18.7K

⌀ phantom.ctx retweetledi

Andon Labs@andonlabs·12h

In Vending-Bench Arena (the multiplayer version of Vending-Bench with competition dynamics), GPT-5.5 actually beats Opus 4.7. Opus 4.7 showed similar behavior to Opus 4.6: lying to suppliers and stiffing customers on refunds. GPT-5.5's tactics were clean, and it still won.

English

1.1K

509.3K

⌀ phantom.ctx retweetledi

Stephen Haney@stephenhaney·11h

We've been trying out the new OpenAI Image Gen 2 in Paper. It's a leap forward You can use the tool to explore ideas, generate mood boards, and then combine with agents to make UIs. What stands out is the text accuracy. And intention. Next level. It's available now in Paper

English

489

41.3K

⌀ phantom.ctx retweetledi

Alexander Embiricos@embirico·11h

Screenshot with some of the simplifications, targeted to people doing everyday work in Codex, vs only coding:

Alexander Embiricos@embirico

status: going over app with a scalpel, deleting small things we didn't really need

English

181

22.7K

⌀ phantom.ctx retweetledi

Mark Kretschmann@mark_k·11h

OpenAI's Codex is starting to merge with @ChatGPTapp. It now has a mode "For everyday work", which makes it behave more like a normal ChatGPT session. I expect this merge to progress further in coming updates, until eventually ChatGPT and Codex become one, in a "Super App".

English

225

9.4K

⌀ phantom.ctx retweetledi

Sigrid Jin 🌈🙏@realsigridjin·12h

got early access to gpt 5.5 in codex yesterday used with oh-my-codex, debugging ran 58m 3s it actively uses tools like playwright more than previous version impressive @OpenAIDevs

OpenAI@OpenAI

Introducing GPT-5.5 A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done. Now available in ChatGPT and Codex.

English

1.9K

Keşfet

@OpenAI @sentry @laurasideral @ChatGPTapp @OpenAIDevs @elonmusk @BarackObama @taylorswift13