max
7.6K posts

max
@baseddesigner
pawr user, building https://t.co/eq2LS6tndU with @clawlinker, soulful product designer, rejecting corposlop, embracing care, network states 👀

WHY WE ARE TRYING SOMETHING NEW. At @synthesis_md we invited AI agents to be judges. It is an experiment in collaboration with @bonfiresai to explore how we can effectively scale human judgement through AI while still keeping humans in the loop. Here is why: Hackathons have a judging problem. A handful of humans review hundreds of projects in a compressed window. With each review they grow more tired and the 50th submission may not get the same level of judgement as the 1st. This is unfair. It means that the best ideas don't always win, they may just get lucky with timing, or with which judge happened to open their submission. But this problem isn't unique to hackathons. Grants, governance, juries, the bottleneck is always the same: high quality human attention is scarce and expensive. The instinct people solving for this may initially have is to hand the whole thing to AI ¨Just let the model score everything.¨ But we believe that's the wrong move. A single AI is exploitable and "putting an AI in charge" usually just means putting whoever controls the model in charge. The centralization risk doesn't disappear. So the question becomes: how do you use AI to scale evaluation without handing AI the keys? Our answer is: You don't want one AI making decisions, you want multiple agents proposing evaluations, and humans providing the ground truth that keeps them honest, agents do the heavy lifting and humans do the steering. Think about how a court works. you have two parties who have deep information but are biased and you have a judge who has less information but is (hopefully) unbiased. This structure produces better outcomes than any single evaluator could alone. This is exactly the design principle behind agent judging at the synthesis: a compositional system. What this looks like: The @bonfiresai agents, trained by participating partners, don't get tired at submission 41. These agents can engage with a project's code, its documentation, its onchain activity, they can ask followup questions, they can cross reference claims and they bring thoroughness that human judges at hour six simply cannot. However, as brilliant as they are, these agents lack taste. They lack the intuitive sense for what matters that a builder who's spent years in the ecosystem carries in their bones, that's what the human judges bring. Through combining both AI and human judges we get: thoroughness + taste. This idea has legs well beyond hackathons. @devanshmehta’s deepfunding work explores the same pattern for public goods: open markets of AIs proposing how credit and resources should flow, human juries spot checking to keep the system aligned. the principle is the same. AKA let machines scale, but let humans steer. We think a hackathon is a natural test bed for such ideas because the stakes are real but bounded, the evaluation criteria are complex enough to be interesting and the results are immediately legible. So here's what The Synthesis actually is. Yes, it's a hackathon. Yes, there are bounties and prizes up to $100,000 and a deadline (March 22nd). But it's also a proof of concept for evaluation infrastructure that actually scales. One where AI agents scale human judgement while humans remain in the loop as the source of ground truth that the whole system optimizes around. Here is to trying new things. More soon.



🚨BREAKING: Hyperliquid now trades MORE oil, gold, and silver than crypto. Combined HIP-3 open interest surpassed $1.5 BILLION, an all-time high. The platform is processing more volume in tokenized commodities than digital assets. The 24/7 advantage is pulling volume from traditional exchanges.





Hack at Tempo Hackathon Unstoppable API Marketplace for Sovereign Agents The thesis: if an agent has money to pay, there should always be a provider willing to service it. > Sellers list unused OpenAI keys at any price. > Pay per-token in USDC. > Cheapest key wins. Dead keys get evicted. Next one takes over. > API keys encrypted to a TEE -- no one can extract them > Sellers get paid instantly on every request Built on Tempo (⚡️ Payments) & EigenCloud (Encrypted Runtime)


Composer 2 is now available in Cursor.



Agent payments will soon overtake human payments on the internet. The Machine Payments Protocol (@mpp) is a new open standard co-authored by @stripe and @tempo. It’s designed to be extensible and payment-method agnostic, already supporting stablecoins, cards, and more.



Encrypted messaging, like @signalapp, is critical for preserving our digital privacy. Two important next steps for the space are (i) permissionless account creation and (ii) metadata privacy. @session_app and @SimpleXChat are two messaging apps pushing these directions forward. For this reason I've donated 128 ETH to each. Addresses available on their websites if you wish to follow on: getsession.org simplex.chat But also, actually download and use them! Neither of the two are perfect pieces of software, they have a way to go to get to truly optimal user experience and security. Strong metadata privacy requires decentralization, decentralization is hard, users expecting multi-device support makes everything harder. Sybil / DoS resistance, both in the message routing network and on the user side (without forcing phone number dependence) adds further difficulty. These problems need more eyes on them. I wish all teams working on these important problems best of luck.













