Benj

1.4K posts

Benj

Benj

@Founder_Benji

Tech, AI, Bitcoin, RSI. My own opinions.

Katılım Eylül 2022
1K Takip Edilen209 Takipçiler
Benj
Benj@Founder_Benji·
Less structure = more freedom and autonomy. Multi agency and massive prompt swamps are the outcome of lesser performing models. But the good models do more with less. A single agent, & simpler atomic principles to operate from is the way.
Sukh Sroay@sukh_saroy

🚨Shocking: A 25,000-task experiment just proved that the entire multi-agent AI framework industry is built on the wrong assumption. Every major framework - CrewAI, AutoGen, MetaGPT, ChatDev - starts from the same premise: assign roles, define hierarchies, let a coordinator distribute work. Researchers tested 8 coordination protocols across 8 models and up to 256 agents. The protocol where agents were given NO assigned roles, NO hierarchy, and NO coordinator outperformed centralized coordination by 14%. The gap between the best and worst protocol was 44%. That's not noise. That's a completely different outcome depending on how you organize the agents - not which model you use. Here's what makes this uncomfortable: When agents were simply given a fixed turn order and told "figure it out," they spontaneously invented 5,006 unique specialized roles from just 8 agents. They voluntarily sat out tasks they weren't good at. They formed their own shallow hierarchies - without anyone designing them. The researchers call it the "endogeneity paradox." The best coordination isn't maximum control or maximum freedom. It's minimal scaffolding - just enough structure for self-organization to emerge. But there's a catch nobody building agents wants to hear: below a certain model capability threshold, the effect reverses. Weaker models actually need rigid structure. Autonomy only works when the model is smart enough to use it. Which means every agent framework shipping with one-size-fits-all hierarchies is wrong twice - over-constraining strong models and under-constraining weak ones. The $2B+ invested in agent orchestration tooling may be solving a problem that capable models solve better on their own.

English
0
0
0
14
Benj
Benj@Founder_Benji·
what if they leaked claude code to allow everyone to build up their agentic coding tools and build cyber defense before they drop their new model. Main goal was allow everyone to catch up on coding for cyber defense so the new model isn’t as negatively impactful
English
0
0
1
23
Benj
Benj@Founder_Benji·
@Tesla @elonmusk Can you guys train FSD to learn where I like to park my car near my apartment complex? it always tries to park somewhere I dont like. But would be cool if it learned my patterns or i can favorite a parking spot
English
0
0
0
12
Benj
Benj@Founder_Benji·
this is wonderful
OpenClaw🦞@openclaw

huge shoutout to @nvidia for lending engineers to help triage our security advisories 🛡️🦞 open source security hits different when GPU companies show up to help

English
0
0
0
14
Matteo Pellegrini
Matteo Pellegrini@matteopelleg·
PREDICTION: in 12 months there will be MORE white collar jobs, including software engineers, accountants, and lawyers, than today
English
272
166
3.9K
439K
prateek
prateek@agent_wrapper·
@Founder_Benji It's not about the output, it's about being able to guide the swarm effectively
English
1
0
4
866
prateek
prateek@agent_wrapper·
We just open-sourced the system we use to manage 30 parallel AI coding agents per person. 40K lines of TypeScript. 3,288 tests. 17 plugins. Built in 8 days — by the agents it orchestrates. Yes, we used Agent Orchestrator to build Agent Orchestrator. Some numbers: → 500+ agent-hours in 24 human-hours (20x leverage) → 86 of 102 PRs created by AI (84%) → After Day 4, I stopped writing code entirely Spawn agents. Step away. Ship faster.
prateek tweet mediaprateek tweet media
English
93
164
1.4K
589.5K
Benj
Benj@Founder_Benji·
@yoheinakajima I have! It was one year ago but local models were ehh. I didnt have a wifi adapter but ya, u can make AI TOYS and smart mirrors lol.
English
1
0
0
69
Benj
Benj@Founder_Benji·
there are 9 control room monitor synbols and 9 puzzles, they likely correspond 1:1 @MrBeast @salesforce
English
0
0
1
443
Benj
Benj@Founder_Benji·
Okay so we've assembled a team to crack @MrBeast 's 1M superbowl puzzle.. We've uncovered many signals and have a few running theories on the final code... we posted part 1 of our discoveries to open source information collection and theory discussions... part 2 is coming soon. x.com/Founder_Benji/…
English
0
0
0
462
Scott
Scott@scott_5254·
@NFL @sanbenito Every none Spanish speaking person on the planet watching that performance waiting for the game to restart 🙃
Scott tweet media
English
94
6
401
31.8K
NFL
NFL@NFL·
Lo único más poderoso que el odio, es el amor. The Only Thing More Powerful Than Hate is Love. @sanbenito #AppleMusicHalftime
Español
9.8K
59.4K
256.1K
11.5M