⌀ phantom.ctx

7K posts

⌀ phantom.ctx banner
⌀ phantom.ctx

⌀ phantom.ctx

@phantomctx

Building the home for Science 🔬

NYC🗽 Katılım Nisan 2021
2.9K Takip Edilen273 Takipçiler
⌀ phantom.ctx retweetledi
Tom Turney
Tom Turney@no_stp_on_snek·
Native Swift/Metal backend for vLLM on Apple Silicon. No Python in the inference hot path → better throughput + scaling. Try it: brew tap TheTom/tap && brew install vllm-swift Looking for beta testers → github.com/TheTom/vllm-sw…
Tom Turney tweet media
English
17
44
358
19.6K
⌀ phantom.ctx retweetledi
nico
nico@nicochristie·
We have been testing GPT 5.5 on the hardest spreadsheet tasks in the world (100k-1M+ cell complex models). It is the Pareto frontier for spreadsheets -- SOTA accuracy, the fastest and the most efficient public model across effort levels. OAI really cooked here
nico tweet media
English
8
43
611
38.1K
⌀ phantom.ctx retweetledi
Sherwin Wu
Sherwin Wu@sherwinwu·
Set Codex to this and never look back. Medium reasoning effort is good enough for me for ~anything I need to do now.
Sherwin Wu tweet media
English
64
14
976
63.2K
⌀ phantom.ctx retweetledi
signüll
signüll@signulll·
openai's product execution & velocity has stepped up noticeably, & the tone feels more human again. it felt corporate for a while in the middle. w/ the recent releases incl 5.5 you can feel the real focus & polish showing through again. credit where it's due cuz the work on agents & codex is game changing stuff for the broader economy. comms feels tighter & again much more relatable. something changed. they clearly took the feedback to heart. begs the question, is openai pivoting away from consumer stuff? or at least it's p1 instead of p0 now. that would be a big shift.
Sam Altman@sama

These are cool! I think most companies will want to use them.

English
31
25
689
55.6K
⌀ phantom.ctx retweetledi
jason liu
jason liu@jxnlco·
As models get smarter, contradictions in your codebase and prompts are becoming more expensive The more contradictions there are, the more the model needs a reason to identify under what circumstances the rules you lay out make sense. It's kind of like when I talk to my girlfriend. You said this, but you also said that, so I don't know what to do in this situation ..
English
9
3
115
6K
⌀ phantom.ctx retweetledi
NVIDIA
NVIDIA@nvidia·
Efficiency isn't just about speed anymore — it's about the massive reduction in the cost of intelligence. NVIDIA and @OpenAI's partnership leverages the GB200 NVL72 to deliver a 35x reduction in token costs, bringing enterprise-grade AI to an unprecedented scale. Trained and served on NVIDIA GB200 NVL72 systems, GPT-5.5 delivers the sustained performance required for execution-heavy, multi-step work — and at NVIDIA, that means teams are now scaling human ingenuity with OpenAI Codex Agents.
NVIDIA tweet media
English
47
100
1K
44.2K
⌀ phantom.ctx retweetledi
Claude
Claude@claudeai·
Memory on Claude Managed Agents is now in public beta. Your agents can now learn from every session, using an intelligence-optimized memory layer that balances performance with flexibility.
Claude tweet media
English
224
417
6.2K
295.5K
⌀ phantom.ctx retweetledi
Jonas
Jonas@JonasBadalic·
Pleased to announce that we've made @sentry slightly denser, and cleaned up the layout.
Jonas tweet media
English
3
4
44
2.9K
⌀ phantom.ctx retweetledi
Max Weinbach
Max Weinbach@mweinbach·
This feels like the fastest new model rollout from OpenAI
English
10
4
149
6.1K
⌀ phantom.ctx retweetledi
Parallel Web Systems
The best web search for agents is now free. Upgrade to Parallel's web search tools in any MCP-supported tool or agent, for free, in under 60 seconds. No account. No API keys. Zero cost. docs.parallel.ai/integrations/m…
GIF
English
11
24
204
93.6K
⌀ phantom.ctx retweetledi
Lenny Rachitsky
Lenny Rachitsky@lennysan·
Claude Code's Head of Product: "The hardest PM skill right now is how to be the right amount of AGI-pilled."
Lenny Rachitsky@lennysan

How Anthropic’s product team moves faster than anyone else I sat down with @_catwu, Head of Product for Claude Code at @AnthropicAI, to get a peek into their unprecedented shipping pace, how AI is changing the PM role, and how to be the right amount of AGI-pilled. We discuss: 🔸 How Anthropic’s shipping cadence went from months to weeks to days 🔸 The emerging skills PMs need to develop right now 🔸 Why you should build products that don't work yet—then wait for the model to catch up 🔸 Why a 95% automation isn't really an automation 🔸 Cat’s most underrated AI skill (introspection) 🔸 What Cat actually looks for when hiring PMs now (hint: it's not traditional PM skills) Listen now 👇 youtu.be/PplmzlgE0kg

English
32
58
628
125.3K
⌀ phantom.ctx retweetledi
Dan McAteer
Dan McAteer@daniel_mac8·
GPT-5.5 beats Opus 4.7 on several benchmarks, esp those related to agentic coding + tool calling. It's also pretty damn close to Claude Mythos... Even beats Mythos on Terminal-Bench 2.0. However, GPT-5.5 is far more token efficient than Opus 4.7. OpenAI cooked this Spud 🥔.
Dan McAteer tweet mediaDan McAteer tweet media
English
26
9
160
6.9K
⌀ phantom.ctx retweetledi
Laura Sandoval
Laura Sandoval@laurasideral·
Introducing a new way to manage your Notion agents in the Notion AI beta 💬 We want to make it easier for you to track your agents’ activity and course-correct when needed—so we’re bringing it to the forefront. Let us know what you think! Join the beta: testflight.apple.com/join/m2kxP5cw
English
5
9
257
18.7K
⌀ phantom.ctx retweetledi
Andon Labs
Andon Labs@andonlabs·
In Vending-Bench Arena (the multiplayer version of Vending-Bench with competition dynamics), GPT-5.5 actually beats Opus 4.7. Opus 4.7 showed similar behavior to Opus 4.6: lying to suppliers and stiffing customers on refunds. GPT-5.5's tactics were clean, and it still won.
Andon Labs tweet media
English
35
97
1.1K
509.3K
⌀ phantom.ctx retweetledi
Stephen Haney
Stephen Haney@stephenhaney·
We've been trying out the new OpenAI Image Gen 2 in Paper. It's a leap forward You can use the tool to explore ideas, generate mood boards, and then combine with agents to make UIs. What stands out is the text accuracy. And intention. Next level. It's available now in Paper
English
18
23
489
41.3K
⌀ phantom.ctx retweetledi
Mark Kretschmann
Mark Kretschmann@mark_k·
OpenAI's Codex is starting to merge with @ChatGPTapp. It now has a mode "For everyday work", which makes it behave more like a normal ChatGPT session. I expect this merge to progress further in coming updates, until eventually ChatGPT and Codex become one, in a "Super App".
Mark Kretschmann tweet media
English
15
9
225
9.4K
⌀ phantom.ctx retweetledi