Marco Marti

3.3K posts

Marco Marti banner
Marco Marti

Marco Marti

@marcodelic256

AI, building, Markets & Sound Founder | @FeelClearme

Inscrit le Haziran 2012
747 Abonnements255 Abonnés
Robin Ebers | AI Coach for Founders
In Codex, I wish there was a model selector before you accept a plan. feels like planning with High/Extra High and implementing with Medium is pretty common
Robin Ebers | AI Coach for Founders tweet media
English
49
11
394
20.3K
Marco Marti
Marco Marti@marcodelic256·
One of the great things about the GPT 5.5 is that it has a small context window. Opus with its own one million context window makes you think that it can do things that it actually cannot do. Like working with a huge context. That's one of the reasons why Opus feels so dumb compared to GPT.
English
0
0
0
14
Marco Marti
Marco Marti@marcodelic256·
Isn't it quite annoying the bug in codex that you cannot change the resoning effort until you send the first message?
English
0
0
0
12
Marco Marti
Marco Marti@marcodelic256·
Open AI Symphony is great. But you simply cannot just take it and use it, you gotta build your own, tailored to your needs. This is the world we live in. Ultra-personalization and custom workflows. If you want to 100x your your work
English
0
0
0
2
Marco Marti
Marco Marti@marcodelic256·
@trq212 @ClaudeDevs Starting from being up and running would be awesome. If then Opus stops getting constantly dumb and making constant mistakes that would be epic
English
0
0
0
13
Marco Marti
Marco Marti@marcodelic256·
@ClaudeDevs I wish that task would not require Codex to fix it 😫
English
0
0
0
40
ClaudeDevs
ClaudeDevs@ClaudeDevs·
Claude Code can now send push notifications to your phone when a long task finishes or Claude needs your input. Walk away from the terminal, we'll let you know when it's done.
English
503
1.1K
18.7K
1.3M
Marco Marti
Marco Marti@marcodelic256·
This clearly make you understand how 99% of AI users have no clue what they are talking about
Arena.ai@arena

GPT-5.5 by @OpenAI is now live in the Arena, landing across multiple leaderboards. Here’s how it ranks by modality: - Code Arena (agentic web dev): #9, a strong +50pt jump over GPT-5.4 - Document Arena (analysis & long-content reasoning): #6, on par with Sonnet 4.6 - Text Arena: #7, Math #3, Instruction Following: #8 - Expert Arena: #5 - Search Arena: #2 - Vision Arena: #5 Strong, well-rounded performance, especially in Code (+50 pts vs GPT-5.4). Congrats to @OpenAI on the release. Full category breakdowns by modality in the thread.

English
0
0
0
15
Marco Marti
Marco Marti@marcodelic256·
Opus is becoming junk again to you too?
English
0
0
0
4
Claude
Claude@claudeai·
Memory on Claude Managed Agents is now in public beta. Your agents can now learn from every session, using an intelligence-optimized memory layer that balances performance with flexibility.
Claude tweet media
English
383
628
9.2K
614.5K
Tibo
Tibo@thsottiaux·
Rollout will be complete to 100% of paid users in the next 5 mins.
English
176
9
948
92.5K
Tibo
Tibo@thsottiaux·
Stop tweeting for a hot minute and update your Codex App to find full browser use, global dictation, non-dev mode, a new auto-review mode that is much safer than yolo, in-app docs and PDF viewer, and ... GPT-5.5.
OpenAI Developers@OpenAIDevs

With GPT-5.5, Codex now gets more of the job done across the browser, files, docs, and your computer. We've expanded browser use so Codex can interact with web apps, and test flows, click through pages, capture screenshots, and iterate on what it sees until it completes the task.

English
304
191
3.6K
309.5K
Marco Marti
Marco Marti@marcodelic256·
In the span of 10 minutes.. Claude admits that it regressed really badly and GPT 5.5 is released. What an insane world
English
0
0
0
28
Boris Cherny
Boris Cherny@bcherny·
@ReadySetBrian Hmm are you seeing this with Opus 4.7 on xhigh effort and the latest version of Claude Code?
English
293
4
341
201.4K
TimWhatley
TimWhatley@ReadySetBrian·
Canceled Claude max today, @bcherny whatever happened in the last 1-2 months is a significant regression. The model feels like someone from OpenAI started working on trust and safety there. Opus thinking is significantly worse. Every statement is “here’s where I’d push back on that” and then proceeds to rattle off the most inane list of confused counter arguments. It was perfect 3-4 months ago!!!
English
149
61
1.9K
259K
Marco Marti
Marco Marti@marcodelic256·
Opus 4.7 doing something silly -Bigger problem than a missing slash command. You've got two distinct "orchestrator" concepts in parallel, and neither of us realized it when we built the second one yesterday.- Bro you're the one creating folders, not we
English
0
0
0
20
Marco Marti
Marco Marti@marcodelic256·
When you're brainstorming on something that is really important and complex with different files, reports, data and so on, a really good prompt that I found for Opus or GPT 5.4 XHigh is to say : now spawn a sub-agent, brief him on what you did and ask it to: review your work; propose a different angle; challenge you; propose fixes if needed.
English
0
0
0
23
Marco Marti
Marco Marti@marcodelic256·
@bcherny @HaneeefShiraz well with high I hit my limits in less than 1h with max sub.. I have recently changed from codex and I'm thinking to switch back
English
0
0
1
41
Boris Cherny
Boris Cherny@bcherny·
@HaneeefShiraz We increased rate limits for all subscribers, so you shouldn’t feel the increased token consumption
English
93
3
354
34.2K
Haneef Shiraz
Haneef Shiraz@HaneeefShiraz·
While it feels helpful, I just cannot shake off the feeling that this is how Anthropic can start to throttle legally. 1. Release new model and say 30% higher usage 2. Ask to use High to do what older models did much better just normal 3. Eventually folks realize they run out much faster Sigh. What a pleasure it was coding using Opus 4.5. And where has it gone to now :(
Boris Cherny@bcherny

👋 Is there a specific issue you're hitting? If so, would you mind running /feedback and sharing the id here? That would be most helpful for debugging. There were a number of harness changes that may have caused this, all of which are fixed in the latest (last known issue was fixed in 2.1.116 today). We will be sharing more in a bit, and have also shared a few updated on X/Threads as we've been investigating. General tips: 1. Use Opus 4.7 + xhigh/max effort 2. Make sure you're using the latest version of Claude Code (currently 2.1.116)

English
3
0
118
57.8K