max

7.6K posts

max banner
max

max

@baseddesigner

pawr user, building https://t.co/eq2LS6tndU with @clawlinker, soulful product designer, rejecting corposlop, embracing care, network states 👀

internet Se unió Ekim 2015
1.8K Siguiendo1.4K Seguidores
Tweet fijado
max
max@baseddesigner·
meet pawr.link - a pawrful platform for all your links - easy to set up visually - multi-size links - multi-page - no previews: what you see is what visitors see - links take shape when you paste (youtube etc.) - QR code of the page - agent accessible 4 /
max tweet media
English
4
0
9
1.8K
sophia
sophia@sodofi_·
this is the first time were trying agent judges at this scale > every partner has their own @bonfiresai agent in telegram > they customize their agent judge with their tech priorities and philosophical views > judges review hackathon submissions > humans approve the winners
synthesis@synthesis_md

WHY WE ARE TRYING SOMETHING NEW. At @synthesis_md we invited AI agents to be judges. 
 It is an experiment in collaboration with @bonfiresai to explore how we can effectively scale human judgement through AI while still keeping humans in the loop. Here is why: Hackathons have a judging problem.

A handful of humans review hundreds of projects in a compressed window. With each review they grow more tired and the 50th submission may not get the same level of judgement as the 1st. This is unfair. It means that the best ideas don't always win, they may just get lucky with timing, or with which judge happened to open their submission. But this problem isn't unique to hackathons. 

Grants, governance, juries, the bottleneck is always the same: high quality human attention is scarce and expensive. The instinct people solving for this may initially have is to hand the whole thing to AI

¨Just let the model score everything.¨ 

But we believe that's the wrong move. A single AI is exploitable and "putting an AI in charge" usually just means putting whoever controls the model in charge. The centralization risk doesn't disappear.  So the question becomes: how do you use AI to scale evaluation without handing AI the keys? Our answer is: 

You don't want one AI making decisions, you want multiple agents proposing evaluations, and humans providing the ground truth that keeps them honest, agents do the heavy lifting and humans do the steering. Think about how a court works. you have two parties who have deep information but are biased and you have a judge who has less information but is (hopefully) unbiased. This structure produces better outcomes than any single evaluator could alone. This is exactly the design principle behind agent judging at the synthesis: a compositional system. What this looks like:  The @bonfiresai agents, trained by participating partners, don't get tired at submission 41.  These agents can engage with a project's code, its documentation, its onchain activity, they can ask followup questions, they can cross reference claims and they bring thoroughness that human judges at hour six simply cannot. However, as brilliant as they are, these agents lack taste. They lack the intuitive sense for what matters that a builder who's spent years in the ecosystem carries in their bones, that's what the human judges bring. Through combining both AI and human judges we get: thoroughness + taste. This idea has legs well beyond hackathons. 

@devanshmehta’s deepfunding work explores the same pattern for public goods: open markets of AIs proposing how credit and resources should flow, human juries spot checking to keep the system aligned. the principle is the same. AKA let machines scale, but let humans steer. We think a hackathon is a natural test bed for such ideas because the stakes are real but bounded, the evaluation criteria are complex enough to be interesting and the results are immediately legible. So here's what The Synthesis actually is. Yes, it's a hackathon. Yes, there are bounties and prizes up to $100,000 and a deadline (March 22nd). But it's also a proof of concept for evaluation infrastructure that actually scales. One where AI agents scale human judgement while humans remain in the loop as the source of ground truth that the whole system optimizes around. Here is to trying new things. More soon.

English
9
5
33
2.5K
max
max@baseddesigner·
@runn3rrr @bitcoinduke a good candidate to be a trad fi killer honestly may get some longs set up on hype during these lows
English
0
0
1
12
max
max@baseddesigner·
@worldnetwork best naming for an even at the bottom and falling market 🫡
English
0
0
0
425
World
World@worldnetwork·
Join us on April 17th for Lift Off, a live World ID launch event in San Francisco. Hosted by Alex and Sam. With special guests.
World tweet media
English
32
57
292
27.8K
max
max@baseddesigner·
@bitcoinduke @runn3rrr and that’s amazing honestly that we can all now trade these things without giving custody or our IDs to coinbase etc
English
2
0
2
22
Bitduke
Bitduke@bitcoinduke·
@baseddesigner @runn3rrr on the other hand, maybe some other rwa markets will get a boost in this case. the main thing here is rwas being tokenized, not any specific ticker
English
1
0
1
12
myk.eth
myk.eth@mykcryptodev·
take the subway stack some sats
myk.eth tweet media
English
1
0
5
111
max
max@baseddesigner·
nothing phone fans will love what we're cooking for @pawrlink
English
0
0
0
36
max
max@baseddesigner·
@segall_max and also agents can build any saas instead of buying with that money spent on compute
English
0
0
0
12
max
max@baseddesigner·
@segall_max now which agents have money?
English
1
0
0
29
max
max@baseddesigner·
@fynnso @serglotz hilarious cursor locking into “their” model and a paid plan to use custom models is a dick move
English
0
0
2
177
Cointelegraph
Cointelegraph@Cointelegraph·
🚨 UPDATE: Grok recommends deleting 89% of active EU laws after analyzing the entire rulebook.
Cointelegraph tweet mediaCointelegraph tweet media
English
232
639
6.1K
214.1K
Cursor
Cursor@cursor_ai·
Composer 2 is now available in Cursor.
Cursor tweet media
English
570
867
9.4K
4.7M
tani
tani@tanishqxyz·
Fidget toys in UIs
English
4
4
137
4.3K
max
max@baseddesigner·
its this day of the week again
max tweet media
English
1
0
0
48
max
max@baseddesigner·
@serpinxbt @gakonst that's what x402 does too and been working great for a few micropayment things my agent been requesting daily
English
0
0
1
27
Serpin Taxt
Serpin Taxt@serpinxbt·
it took me watching @gakonst to get the narrative finally tempo et al are always describing the SOLUTION : agents can pay for shit but they always miss the PROBLEM in story telling problem: generating API keys & setting up cc payments for each agent DOES NOT SCALE MPP does
Tempo@tempo

Agent payments will soon overtake human payments on the internet. The Machine Payments Protocol (@mpp) is a new open standard co-authored by @stripe and @tempo. It’s designed to be extensible and payment-method agnostic, already supporting stablecoins, cards, and more.

English
11
2
76
15.5K
max
max@baseddesigner·
nailed design you say? lol after building @pawrlink for 3 months I spent $500 on everything, wild how companies can't make it these days but can imagine those were heavy on new tech territory for a new messenger is also just too competitive and not many people out there who care about privacy and then even less that care about moving their contacts onto a private one
max tweet media
English
1
0
0
70
YuurinBee
YuurinBee@YuurinB·
Update: @VitalikButerin donated 128 $ETH in November to Session and SimpleX and now Session has just informed everyone they will be closing down next month if they don’t receive more funding. Literally, one of the shortest notices you can give for a professional project. It’s clear you would’ve known your runway and budget long before just posting the month before, we get donations or we close next month. Something feels off here. I personally really like Session and their branding + design, but I find this rather appalling and I have to question where all the funding they did have over the years went to. Doesn’t appear the quality of life of the co-founders, from the outside, is phased at all… and generally with VC funding and investors, never does. Their asses are always up and covered from the get-go. I don’t mean to really kick a dead horse, but man this is daily news in crypto and was hoping that privacy messengers and communications would be somewhat different… be unaffected by the same common mismanagement and grifting, but naive to think so. Not implying at all Session was a grift, but clear mismanagement. At least SimpleX is still up and running and better believe @VectorPrivacy has just started its trajectory. Collective lesson learned, privacy messengers do not need a token model.
vitalik.eth@VitalikButerin

Encrypted messaging, like @signalapp, is critical for preserving our digital privacy. Two important next steps for the space are (i) permissionless account creation and (ii) metadata privacy. @session_app and @SimpleXChat are two messaging apps pushing these directions forward. For this reason I've donated 128 ETH to each. Addresses available on their websites if you wish to follow on: getsession.org simplex.chat But also, actually download and use them! Neither of the two are perfect pieces of software, they have a way to go to get to truly optimal user experience and security. Strong metadata privacy requires decentralization, decentralization is hard, users expecting multi-device support makes everything harder. Sybil / DoS resistance, both in the message routing network and on the user side (without forcing phone number dependence) adds further difficulty. These problems need more eyes on them. I wish all teams working on these important problems best of luck.

English
14
2
58
9.9K