Angehefteter Tweet
martin.p
948 posts

martin.p
@OnePromptMagic
building 🔥 habits with https://t.co/03P0XlEVNv building 🔥 plants with https://t.co/mPlHgL2B8H building 🔥 customer relations with https://t.co/EOjPzWndaH
Austria Beigetreten Ağustos 2022
104 Folgt222 Follower

@louisvarge sounds like it can easily end in an endless loop? any max iteration or way for agents to abort?
English

@noahzweben @bcherny does this run via my claude subscription or API credits required?
English

@shiri_shh it's literally cursor bench. it's certainly a great achievment, but it does not mean composer 2 > claude, as it doesn't compare opus in claude code vs composer 2 in cursor
English

Composer 2 just beat Claude Opus 4.6
and it costs almost 10x less
yeah… read that again.
> A 50-person team built their own model from scratch
and then outperformed one of the most funded AI labs on coding…
>61.7% vs 58% on terminal-bench. this is not normal.
and it’s not just benchmarks
> they fixed the biggest pain in AI coding too
you know when your AI just forgets everything mid-session?
yeah… that’s gone too
Composer 2 keeps summarizing its own work
so it doesn’t lose context in long coding runs
when a small team can ship something better, faster, and 10x cheaper…
this whole “only big labs can win” narrative just broke..
Cursor@cursor_ai
Composer 2 is now available in Cursor.
English

while that's an amazing achievement, I do assume that this benchmark only compares opus IN cursor vs composer in cursor. so here composer is certainly highly specialised for tool usage and orchestration, while claude is not. a really interesting benchmark would be to compare composer in cursor VS claude in claude code. if composer comes out on top there, I'm back in the cursor business.
English

🚨 Cursor just dropped Composer 2..
their own AI model.. not Claude.. not GPT.. their own..
and it beats Claude Opus on coding benchmarks.. at a fraction of the cost..
a code editor with 50 people just outperformed a $30 billion AI lab.. at coding.. which is supposed to be their whole thing..
the vibe coding era just got an upgrade..
Cursor@cursor_ai
Composer 2 is now available in Cursor.
English

@OnePromptMagic Not yet but dime of you asked for it so I'll do my best
English

The game engine I'm cooking (RAMEN) is finally coming together! 🍜
Anyone down to test it out? I'm droppin the first version next week and it's gonna include:
· Video to spritesheet generator (AI video compatible)
· Spatial detection for 2D backgrounds for custom lights and shadows
· Node-based game logic. Totally no-code
· Tons of other stuff focused on making a 100% playable 2D adventure in just a couple of days
This is just the tipo of the iceberg, so saty tuned for more details, but if you're intersted on testing it, just let me kno in the comments. Thanks!!
TechHalla@techhalla
Indie game devs are about to love me (or hate me) for this... I built an AI workflow (app included) that spits out spritesheets in minutes, from assets created on freepik. Breaking it all down below 👇
English

a yer ago I would have said, start learning any programming language without AI to get some basic understanding, then gradually introduce AI into your workflow.. nowadays with the models getting ridiculously good, I'd say start right away with AI, but learn some basics on security (RLS policies, api endpoint authentication, rate limitting etc)
English

@DataChaz @AnthropicAI sadly first thing they ask you is if you belong to the partner network or not. if not they might remove you from the program
English

🚨 There’s a new gold standard if you want to build multi-agent systems and enterprise tools.
@AnthropicAI just dropped the `Claude Certified Architect` certification.
It’s a rigorous 60-question, 120-minute exam that proves you can build production-grade applications.
To pass, you need deep, applied expertise in:
→ Designing agentic loops & subagent patterns
→ Building custom Model Context Protocol (MCP) servers
→ Configuring CLAUDE.md hierarchies for CI/CD
→ Forcing structured output with strict JSON schemas
It's rolling out at $99, but early access is completely FREE for the first 5,000 partner company employees.
Certification link in 🧵↓

hoeem@hooeem
English

@aditiitwt it's insane, like 90% of the comments here are clearly AI slop 😂
English

@Manixh02 I'm not applying for a backend developer position, but for an agent orchestration position with the edge that I have real world knowledge in backend development. With new tools and models coming up, you'll need only one or two good ai tool users, and I can be one of them.
English

at some point AI will build flawless code, and then it won't matter anymore. for now yes, this can be frustrating (even as software dev). One cool thing I noticed is, stuff I tried to vibe 2 years ago as experiment for testing boundaries and got stuck soon, I now retried and it worked better than what I could have hoped for. My flow: rename the old vibed repo to 'old', create empty folder 'new' let Lord Opus rewrite the app in whatever other programming language -> it won't be able to copy stuff over but has to 'rethink' the process and it's just gonna be a lot better like this. At least that's my experience :)
English

first real outreach campaign for inboxmate about to start. we got the leads, the pipeline, mail system setup with initial and followup. individualised demo for every lead setup and viewable without signup. claude code drafts every email personalised based on crm activity (refers to previous emails) and lead website content. I setup a service with its own mcp that allows claude to create only the email drafts (with nice looking templates), but NOT be able to send them (we don't want to go full yolo here). the system looks good, but honestly, this is mostly an experiment for fully agentic led marketing.
English
















