Marco Marti

3.3K posts

Marco Marti

@marcodelic256

AI, building, Markets & Sound Founder | @FeelClearme

Tham gia Haziran 2012

747 Đang theo dõi255 Người theo dõi

Marco Marti@marcodelic256·7h

x.com/i/article/2049…

ZXX

Marco Marti@marcodelic256·16h

@trq212 @ClaudeDevs Starting from being up and running would be awesome. If then Opus stops getting constantly dumb and making constant mistakes that would be epic

English

Thariq@trq212·1d

we're doing a lot more of this, hunting down some of the most annoying bugs in Claude Code let me know if you have any white whales

ClaudeDevs@ClaudeDevs

In the last four Claude Code CLI releases, we’ve shipped 50+ stability and performance fixes. Faster resume, stable auth, lower memory, fewer hangs: 🧵

English

409

1.3K

201.1K

Marco Marti@marcodelic256·1d

@ClaudeDevs I wish that task would not require Codex to fix it 😫

English

ClaudeDevs@ClaudeDevs·1d

Claude Code can now send push notifications to your phone when a long task finishes or Claude needs your input. Walk away from the terminal, we'll let you know when it's done.

English

469

988

17.4K

1.2M

Marco Marti@marcodelic256·2d

This clearly make you understand how 99% of AI users have no clue what they are talking about

Arena.ai@arena

GPT-5.5 by @OpenAI is now live in the Arena, landing across multiple leaderboards. Here’s how it ranks by modality: - Code Arena (agentic web dev): #9, a strong +50pt jump over GPT-5.4 - Document Arena (analysis & long-content reasoning): #6, on par with Sonnet 4.6 - Text Arena: #7, Math #3, Instruction Following: #8 - Expert Arena: #5 - Search Arena: #2 - Vision Arena: #5 Strong, well-rounded performance, especially in Code (+50 pts vs GPT-5.4). Congrats to @OpenAI on the release. Full category breakdowns by modality in the thread.

English

Marco Marti@marcodelic256·2d

Opus is becoming junk again to you too?

English

Marco Marti@marcodelic256·5d

@claudeai goodbye tokens

English

Claude@claudeai·6d

Memory on Claude Managed Agents is now in public beta. Your agents can now learn from every session, using an intelligence-optimized memory layer that balances performance with flexibility.

English

383

627

9.1K

602.8K

Marco Marti@marcodelic256·5d

Anione actually using deepSeek?

DeepSeek@deepseek_ai

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n

English

Marco Marti@marcodelic256·6d

@thsottiaux not yet!!!

English

Tibo@thsottiaux·6d

Rollout will be complete to 100% of paid users in the next 5 mins.

English

174

947

92.1K

Tibo@thsottiaux·6d

Stop tweeting for a hot minute and update your Codex App to find full browser use, global dictation, non-dev mode, a new auto-review mode that is much safer than yolo, in-app docs and PDF viewer, and ... GPT-5.5.

OpenAI Developers@OpenAIDevs

With GPT-5.5, Codex now gets more of the job done across the browser, files, docs, and your computer. We've expanded browser use so Codex can interact with web apps, and test flows, click through pages, capture screenshots, and iterate on what it sees until it completes the task.

English

304

192

3.6K

308.2K

Marco Marti@marcodelic256·6d

In the span of 10 minutes.. Claude admits that it regressed really badly and GPT 5.5 is released. What an insane world

English

Marco Marti@marcodelic256·22 Nis

@bcherny @ReadySetBrian On xhigh it lasts like 10 min with the max sub

English

Boris Cherny@bcherny·22 Nis

@ReadySetBrian Hmm are you seeing this with Opus 4.7 on xhigh effort and the latest version of Claude Code?

English

293

341

201.2K

TimWhatley@ReadySetBrian·22 Nis

Canceled Claude max today, @bcherny whatever happened in the last 1-2 months is a significant regression. The model feels like someone from OpenAI started working on trust and safety there. Opus thinking is significantly worse. Every statement is “here’s where I’d push back on that” and then proceeds to rattle off the most inane list of confused counter arguments. It was perfect 3-4 months ago!!!

English

149

1.9K

258.7K

Marco Marti@marcodelic256·22 Nis

Opus 4.7 doing something silly -Bigger problem than a missing slash command. You've got two distinct "orchestrator" concepts in parallel, and neither of us realized it when we built the second one yesterday.- Bro you're the one creating folders, not we

English

Marco Marti@marcodelic256·22 Nis

This makes the difference in how good your Business strategy will be

Marco Marti@marcodelic256

When you're brainstorming on something that is really important and complex with different files, reports, data and so on, a really good prompt that I found for Opus or GPT 5.4 XHigh is to say : now spawn a sub-agent, brief him on what you did and ask it to: review your work; propose a different angle; challenge you; propose fixes if needed.

English

Marco Marti@marcodelic256·21 Nis

English

Marco Marti@marcodelic256·21 Nis

@bcherny @HaneeefShiraz well with high I hit my limits in less than 1h with max sub.. I have recently changed from codex and I'm thinking to switch back

English

Boris Cherny@bcherny·21 Nis

@HaneeefShiraz We increased rate limits for all subscribers, so you shouldn’t feel the increased token consumption

English

354

34.2K

Haneef Shiraz@HaneeefShiraz·21 Nis

While it feels helpful, I just cannot shake off the feeling that this is how Anthropic can start to throttle legally. 1. Release new model and say 30% higher usage 2. Ask to use High to do what older models did much better just normal 3. Eventually folks realize they run out much faster Sigh. What a pleasure it was coding using Opus 4.5. And where has it gone to now :(

Boris Cherny@bcherny

👋 Is there a specific issue you're hitting? If so, would you mind running /feedback and sharing the id here? That would be most helpful for debugging. There were a number of harness changes that may have caused this, all of which are fixed in the latest (last known issue was fixed in 2.1.116 today). We will be sharing more in a bit, and have also shared a few updated on X/Threads as we've been investigating. General tips: 1. Use Opus 4.7 + xhigh/max effort 2. Make sure you're using the latest version of Claude Code (currently 2.1.116)

English

118

57.8K

Marco Marti@marcodelic256·21 Nis

basically the 100€ sub for Claude lasts as long as the 30€ OpenAI

English

Marco Marti@marcodelic256·20 Nis

I’m loving Opus 4.7 Happy I’m the only one

English

Marco Marti@marcodelic256·18 Nis

Is Open AI manipulating the answers of its own model to boost their own adoption? This happened several times when I ran tests and asked gpt to evaluate sonnet vs gemini flash vs gpt 5.4 mini. Do you trust what a single model outputs?

English

Marco Marti@marcodelic256·16 Nis

Damn Anthropic killed it. The new Opus 4.7 is insane. Going to move fro the 100$ to 200$ plan.

English

Marco Marti@marcodelic256·16 Nis

I hope that @AnthropicAI didn't use Opus 4.7 to build and ship their new desktop app.. it's so buggy.. unreal how bad Claude is for coding. Don't get me wrong, it's still amazing for writing and for some business ideas but other than that coding is really like bad.

English

167

Marco Marti@marcodelic256·16 Nis

@shawmakesmagic @sporadica Not too hard to understand

English

235

Shaw (spirit/acc)@shawmakesmagic·16 Nis

He's literally talking about taxing people who own property they don't live in in NYC, you don't have to pay the tax if you live there and the housing is so expensive that it could lower prices in Manhattan to where normal people could live there and not just Russian oligarch ghost condos

English

169

9.3K

Khám phá

@trq212 @ClaudeDevs @claudeai @thsottiaux @bcherny @ReadySetBrian @elonmusk @BarackObama