Moonfarm 🇸🇪

5.9K posts

Moonfarm 🇸🇪 banner
Moonfarm 🇸🇪

Moonfarm 🇸🇪

@moonfarm_dev

Software engineer ex Polestar, AstraZeneca. Currently building safe email mailboxes for agents https://t.co/tHU2upSdQz 🧑‍💻

Need email for your agent? 👉 Katılım Şubat 2017
856 Takip Edilen2.1K Takipçiler
Sabitlenmiş Tweet
Moonfarm 🇸🇪
Moonfarm 🇸🇪@moonfarm_dev·
1/4 This year AI and specifically AI Agents have taken a huge leap forward with @openclaw, @anthropics coworker and now @cursor launching their own "always on agent". 🔥 I have fully embraced "Agentic engineering" as @steipete so gracefully put it, and have had a blast working on an agent-first saas, I actually think I need to start using all my time to build agent-first saas. 🫣 So! Here are the new goals for this year 🧵👇
Moonfarm 🇸🇪 tweet media
Moonfarm 🇸🇪@moonfarm_dev

My 2026 Challenge 🚀 ⭐️Build 10+ revenue-focused iOS apps 💸 Reach $10k MRR ✍️ Document every step to 20k followers. (We started at 1881). Who's joining the ride? 🐎 #buildinpublic

English
4
0
13
760
Tuki
Tuki@TukiFromKL·
🚨 Cursor just dropped Composer 2.. their own AI model.. not Claude.. not GPT.. their own.. and it beats Claude Opus on coding benchmarks.. at a fraction of the cost.. a code editor with 50 people just outperformed a $30 billion AI lab.. at coding.. which is supposed to be their whole thing.. the vibe coding era just got an upgrade..
Cursor@cursor_ai

Composer 2 is now available in Cursor.

English
286
263
4.5K
995.2K
Moonfarm 🇸🇪
Moonfarm 🇸🇪@moonfarm_dev·
@geggleto @itsolelehmann It kinda helps make a generic skill more specialized, but of course, if the verification the agent tests it against is poor, the skill will not become better
English
0
0
0
3
Ole Lehmann
Ole Lehmann@itsolelehmann·
i built a skill that 10x's all your other claude skills on autopilot (using karpathy's autoresearch method) the problem: most of your claude skills are quietly broken and you have no idea i didn't either. i ran my landing page copy skill through the autoresearch scoring loop and it was failing its own quality checks 44% of the time(!!) karpathy (co-founded openai, coined "vibe coding") built autoresearch to improve machine learning code autonomously. turns out it works on skills too here's how: you give it 3-6 yes/no questions that define what "good" means for that skill. ex: "does the headline include a specific number?" or "is the copy free of words like revolutionary and game-changing?" then it loops: 1. runs your skill 2. scores the output against your questions 3. tweaks one thing in the prompt 4. runs it again 5. score went up? keep the tweak. score went down? undo it 6. repeat i walked away and came back to a landing page skill that passes 92% of its checks. 4 rounds of changes, zero manual editing works on anything where you can define what good looks like. outreach sequences, newsletter drafts, any prompt you reuse, etc note: method credit to karpathy, i just adapted it for claude skills full skill below.
Ole Lehmann tweet media
English
6
2
36
3.4K
geggleto | YGG
geggleto | YGG@geggleto·
@moonfarm_dev absolutely wild. I was thinking about spinning up a K2.5 instance this weekend, guess I dont need to now lol
English
1
0
1
4
Thomas Sanlis 🥐
Thomas Sanlis 🥐@T_Zahil·
❌ Stop brainstorming product ideas ✅ The only ideas worth your time are distribution ideas.
English
5
3
16
694
Moonfarm 🇸🇪
Moonfarm 🇸🇪@moonfarm_dev·
@leerob Can i just say that it's great that you decided to be transparent a it this, kudos for not just leaving it in the dark 👏
English
0
0
0
123
Lee Robinson
Lee Robinson@leerob·
Yep, Composer 2 started from an open-source base! We will do full pretraining in the future. Only ~1/4 of the compute spent on the final model came from the base, the rest is from our training. This is why evals are very different. And yes, we are following the license through our inference partner terms.
Fynn@fynnso

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

English
162
69
1K
180.2K
Moonfarm 🇸🇪
Moonfarm 🇸🇪@moonfarm_dev·
I don't know if this is any good for a new domain, but I guess it's progress 🥳
Moonfarm 🇸🇪 tweet media
English
2
0
3
56
Aman Sanger
Aman Sanger@amanrsanger·
Composer 2 marks the one-year anniversary of our large model training efforts. Since then, we've built an exceptionally talent-dense team of ~40 people with some of the best researchers and engineers from the labs, academia, industry, and more heterogeneous backgrounds. And we are exclusively focused on coding. We don't care about models that can respond to emails, do your tax returns, or be your friend. Every FLOP, token, parameter, and researcher is entirely dedicated to software engineering.
Cursor@cursor_ai

Composer 2 is now available in Cursor.

English
74
51
1K
111.3K
Moonfarm 🇸🇪
Moonfarm 🇸🇪@moonfarm_dev·
@MartinTale Yeah, my openclaw is just a mess of fixing issues, ill probably lean into whatever this turns out to be haha
English
0
0
1
13
Sui_Builds
Sui_Builds@suida_ajdini·
Finally 1K verified followers 🥳
Sui_Builds tweet media
English
112
9
130
2.7K
JaredC
JaredC@JaredC50767·
@moonfarm_dev Yeah, the interesting split is convenience vs control. Messaging Claude Code from your phone is nice, but the durable layer is still whether you own the runtime, local files, browser auth, and the recovery path when the workflow gets weird.
English
2
0
2
17
jordi
jordi@jordienr·
yall spending thousands on GEO tools and this shit so easy
jordi tweet media
English
4
0
21
880
Moonfarm 🇸🇪
Moonfarm 🇸🇪@moonfarm_dev·
@shredandship I couldnt agree more, i love how AI enables us to do so much more with the same resources
English
1
0
4
56
Johan
Johan@shredandship·
Some developers fell in love with writing code. I fell in love with building things. That’s why AI doesn’t feel like a threat to me. The how has changed a dozen times over my career. The why never has.
English
45
2
84
2K