Masoud (@MasoudJ_) - Twitter Profili | Zamantika Mersobahis Locabet

Masoud@MasoudJ_·1d

Let’s get used to sharing prompts for good ideas 😎

Wow, this tweet went very viral! I wanted share a possibly slightly improved version of the tweet in an "idea file". The idea of the idea file is that in this era of LLM agents, there is less of a point/need of sharing the specific code/app, you just share the idea, then the other person's agent customizes & builds it for your specific needs. So here's the idea in a gist format: gist.github.com/karpathy/442a6… You can give this to your agent and it can build you your own LLM wiki and guide you on how to use it etc. It's intentionally kept a little bit abstract/vague because there are so many directions to take this in. And ofc, people can adjust the idea or contribute their own in the Discussion which is cool.

English

0

19

Masoud@MasoudJ_·5d

@gregisenberg Hey the Mortgage parts is us 😝🦾

English

0

1

51

GREG ISENBERG@gregisenberg·5d

sequoia put out a blog post called "services is the new software" look at this map of over $1T in services being replaced by AI agents

English

263

499

4.3K

602.7K

Masoud retweetledi

Fintor@fintorapp·5d

The mortgage industry's AI moment is here!⚡ ⏳ Weeks to close → ⚡ 10 days 📄 Manual tasks → 🤖 Fully automated 😩 Borrowers wait → 📲 Real-time updates This is Fintor 🏠 #mortgage #AI #fintech

English

0

4

5

100

Masoud@MasoudJ_·5d

@omooretweets We are going to vertically integrate AI in all digital and physical manufacturing processes. Welcome to an AI native world🦾

English

0

2

639

Olivia Moore@omooretweets·5d

RIP, software as a service (1999-2025) Welcome to the world, services as software (2026 - ?)

English

110

97

1.2K

107.1K

Masoud@MasoudJ_·6d

@omooretweets ChatGPT is clearly trying to get engagement.

English

0

45

Olivia Moore@omooretweets·6d

Funny model behavior I’ve noticed ChatGPT biases toward questions that continue the conversation Claude biases towards questions that end it 🤔

English

61

10

377

26.1K

Masoud@MasoudJ_·27 Mar

Valid point @apoth3osis_io . The benchmark driven recursion is not automatically safe, but my hope/bet is that recursive improvement of both the agent and its evaluation process is how the wall gets pushed out. Otherwise, yes, you are correct, at best we just compound our blind spots. Potentially in industries that we can deterministically evaluate the results, it would help making better agents

English

1

0

1

10

Apoth3osis@apoth3osis_io·26 Mar

Genuinely impressive work, but isn't the core tension here that the original Gödel Machine required provably beneficial self-modifications, and DGM/Hyperagents replaced that with empirical benchmark validation? Selection pressure catches errors that hurt benchmark scores, but subtle errors that improve scores (reward hacking, overfitting, benchmark-specific shortcuts) get amplified instead of corrected. The compounding happens in the direction the benchmarks can't see.

English

1

0

1

23

Masoud@MasoudJ_·26 Mar

This is how AI will make sure that it doesn’t hit a wall.

Jenny Zhang@jennyzhangzt

Introducing Hyperagents: an AI system that not only improves at solving tasks, but also improves how it improves itself. The Darwin Gödel Machine (DGM) demonstrated that open-ended self-improvement is possible by iteratively generating and evaluating improved agents, yet it relies on a key assumption: that improvements in task performance (e.g., coding ability) translate into improvements in the self-improvement process itself. This alignment holds in coding, where both evaluation and modification are expressed in the same domain, but breaks down more generally. As a result, prior systems remain constrained by fixed, handcrafted meta-level procedures that do not themselves evolve. We introduce Hyperagents – self-referential agents that can modify both their task-solving behavior and the process that generates future improvements. This enables what we call metacognitive self-modification: learning not just to perform better, but to improve at improving. We instantiate this framework as DGM-Hyperagents (DGM-H), an extension of the DGM in which both task-solving behavior and the self-improvement procedure are editable and subject to evolution. Across diverse domains (coding, paper review, robotics reward design, and Olympiad-level math solution grading), hyperagents enable continuous performance improvements over time and outperform baselines without self-improvement or open-ended exploration, as well as prior self-improving systems (including DGM). DGM-H also improves the process by which new agents are generated (e.g. persistent memory, performance tracking), and these meta-level improvements transfer across domains and accumulate across runs. This work was done during my internship at Meta (@AIatMeta), in collaboration with Bingchen Zhao (@BingchenZhao), Wannan Yang (@winnieyangwn), Jakob Foerster (@j_foerst), Jeff Clune (@jeffclune), Minqi Jiang (@MinqiJiang), Sam Devlin (@smdvln), and Tatiana Shavrina (@rybolos).

English

1

0

4

105

Masoud@MasoudJ_·26 Mar

@zachtratar While china is building more …

English

0

16

Zach Tratar@zachtratar·25 Mar

I know some folks are scared of AI, whether it's due to fear of financial bubble or fear of an all-powerful evil AI taking of the world. But Bernie and AOC's bill to ban new datacenter development isn't a solution. It's the fastest way to sabotage our economy.

English

5

1

18

1.1K

Masoud retweetledi

Jenny Zhang@jennyzhangzt·23 Mar

Introducing Hyperagents: an AI system that not only improves at solving tasks, but also improves how it improves itself. The Darwin Gödel Machine (DGM) demonstrated that open-ended self-improvement is possible by iteratively generating and evaluating improved agents, yet it relies on a key assumption: that improvements in task performance (e.g., coding ability) translate into improvements in the self-improvement process itself. This alignment holds in coding, where both evaluation and modification are expressed in the same domain, but breaks down more generally. As a result, prior systems remain constrained by fixed, handcrafted meta-level procedures that do not themselves evolve. We introduce Hyperagents – self-referential agents that can modify both their task-solving behavior and the process that generates future improvements. This enables what we call metacognitive self-modification: learning not just to perform better, but to improve at improving. We instantiate this framework as DGM-Hyperagents (DGM-H), an extension of the DGM in which both task-solving behavior and the self-improvement procedure are editable and subject to evolution. Across diverse domains (coding, paper review, robotics reward design, and Olympiad-level math solution grading), hyperagents enable continuous performance improvements over time and outperform baselines without self-improvement or open-ended exploration, as well as prior self-improving systems (including DGM). DGM-H also improves the process by which new agents are generated (e.g. persistent memory, performance tracking), and these meta-level improvements transfer across domains and accumulate across runs. This work was done during my internship at Meta (@AIatMeta), in collaboration with Bingchen Zhao (@BingchenZhao), Wannan Yang (@winnieyangwn), Jakob Foerster (@j_foerst), Jeff Clune (@jeffclune), Minqi Jiang (@MinqiJiang), Sam Devlin (@smdvln), and Tatiana Shavrina (@rybolos).

English

155

659

3.6K

491.2K

Masoud@MasoudJ_·21 Mar

25% stronger AI with only 4% more expensive training. Impressive work.

Kimi.ai@Kimi_Moonshot

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

English

0

1

27

Masoud@MasoudJ_·21 Mar

@zachtratar The boutique and niche consulting firms are affected by it more at the moment, but it won’t be long before major firms feel it too.

English

0

118

Zach Tratar@zachtratar·20 Mar

I know it’s a hard pill to swallow, but the AIs already have better design and programming taste than most consulting firms.

Jason Fried@jasonfried

A bespoke software revolution? I don't buy it. It'll exist. It already exists. Small consultants and big consulting firms have made custom software for years. It almost always sucks. It’s bloated, confusing, and because the client pays, it’s built wrong in all the ways. Who’s excited about bespoke software? Software makers! Of course they're excited about building bespoke software — that's what they do. X is full of them. Your feed is full of people who love making software talking about making software. Of course they’re excited about the revolution. Echo, echo, echo... Most people don’t like computers. Nobody in tech wants to say that out loud. People tolerate computers. They use them because they have to. Given the choice, most would rather not think about them at all. So when someone suggests that AI means everyone will build their own custom tools, ask who "everyone" is. The three-person accounting firm drowning in client paperwork? They want the paperwork gone, not a new system to maintain. The regional logistics company with 40 trucks? They want the routes optimized, not Joe spouting off about this new system he’s been messing around with. The law firm billing 70-hour weeks? They want leverage on their time, not a software project to design. They don’t hate technology. But building and maintaining their own critical systems isn’t their wheelhouse, regardless of how much faster and easier it’s become. It's another job on top of the job. Will these people use AI? Absolutely, for all sorts of things. Will some outliers go deep and build real custom systems? Sure, but they're almost always people who already had some pull toward software. The curiosity was already there. They were dabblers before. Giving everyone access to software building tools doesn't mean everyone becomes a builder. A powerful excavator doesn't turn a homeowner into a contractor. Most people just want the hole dug by someone else. They don’t want the responsibility either.

English

14

0

70

15.1K

Masoud@MasoudJ_·19 Mar

"Every bug we've been fixing was something I introduced" - My Claude Code

English

1

0

2

41

Masoud retweetledi

Pulte@pulte·18 Mar

Fannie Mae and Freddie Mac Remove Certain Homeowners Insurance Requirements That Will Reduce Costs: fhfa.gov/news/news-rele…

English

105

70

461

128.4K

Masoud@MasoudJ_·18 Mar

@zachtratar I think the next hurdle to overcome is visual reasoning and understanding of physical world. Excited to see the leader board for arc-agi-3.

English

0

1

29

Zach Tratar@zachtratar·18 Mar

Ok I’m calling it. AGI has been effectively achieved. The models are able to perform many tasks at superhuman levels. The task planner harnesses are already there. The integrations are mostly there. Taste is still off, but it’s a footnote…

English

5

0

23

2.3K

Masoud@MasoudJ_·17 Mar

🚨 The mortgage industry has a bottleneck problem. 📩 Borrower hits “submit” and the chaos starts. 📄 Docs missing. Emails flying. Processors juggling 40 files. 🤯 🛠️ It’s not a people problem. It’s a systems problem. 🤖 AI agents don’t: → Get overwhelmed → Miss gaps → Need sleep 💡 The winning mortgage teams of 2026 will run on AI, not headcount. #AI #Mortgage #Fintech #Automation 🚀

English

0

1

2

55

Masoud retweetledi

The White House@WhiteHouse·14 Mar

With actions like promoting access to mortgage credit and removing regulatory barriers to home construction, President Trump is making it easier for Americans to buy homes, an integral part of the American Dream. 🏡🇺🇸

English

1.5K

2.6K

10.4K

545.7K

Masoud@MasoudJ_·14 Mar

💪

Pulte@pulte

On it

ART

0

60

Masoud@MasoudJ_·14 Mar

@zebird0 Until new models learn that and start putting typos intentionally

English

0

29

robin@zebird0·13 Mar

I wonder if typos will become high status in the future Because it’ll confirm that you wrote the thot yourself with no AI

English

121

5

278

46.3K

Masoud@MasoudJ_·14 Mar

Time to change the mortgage industry

Karoline Leavitt@PressSec

EXPANDING ACCESS TO HOME LOANS: Today, President Donald J. Trump signed an Executive Order to reduce regulatory burdens that have driven up mortgage costs, limited access for creditworthy borrowers, and weakened community bank participation in lending. whitehouse.gov/fact-sheets/20…

English

0

1

74

Masoud@MasoudJ_·14 Mar

@srcasm @flybridge DM’ed you!

English

0

32

Jesse Middleton@srcasm·13 Mar

We’re about six months into deploying @flybridge 2025 (our 7th fund). The "AI" honeymoon period is officially over. In 2024, everyone wanted to talk about models. In 2025, everyone wanted to talk about agents. Nowadays, I’m looking for the Invisible Infrastructure. If you’re building the plumbing that makes autonomous systems actually safe, auditable, and reliable for a Fortune 500, we should be talking. Specifically, I’m looking for: > Tools that verify human intent in a world full of high-fidelity deepfakes. > AI that doesn't "forget" who I am or what we talked about yesterday across different apps. > Founders who spent ten years in a "niche" industry (like maritime logistics or waste management) and are now rebuilding it from the studs up. I know the best founders are often too busy building to be scrolling LinkedIn. If you have a friend who is currently building something that fits this description, tell them to hit me up. I don't need a deck yet. I just want to hear about the problem they can't stop thinking about. We’re cutting $1M to $3M checks. My DMs are always open.

English

47

5

199

41.3K

Masoud

Keşfet