web3nomad.eth | atypica.ai

4K posts

web3nomad.eth | atypica.ai

@web3nomad

made @atypica_ai · subjective world model · hippyghosts · eth. rust.

👉 Katılım Mart 2010

699 Takip Edilen917 Takipçiler

Sabitlenmiş Tweet

web3nomad.eth | atypica.ai@web3nomad·1 May

agents are getting smarter. but they still can't fake being human. that's a market. Proof of Human → x.com/web3nomad/stat…

web3nomad.eth | atypica.ai@web3nomad

x.com/i/article/2050…

English

72.9K

web3nomad.eth | atypica.ai@web3nomad·4m

@arankomatsuzaki the bottleneck isn't compute. it's that each step changes what the right next question even is. can't front-load that.

English

Aran Komatsuzaki@arankomatsuzaki·9h

i've been running Codex for ~8-24h per open math/physics research problem. few thoughts: parallel agents don't seem to scale that cleanly for a lot of problems. many of these are just extremely sequential. you don't really get to "spawn 50 agents and solve it from nowhere." it's more like: tiny move, check, reframe, tiny move, dead end, try again. hours/days of serial cognition, which honestly rhymes with how these fields move over decades. this updates me a bit against the sci-fi picture of "superhuman math/physics intelligence" as some alien oracle that instantly sees the proof / theory. the actual superhuman-ness is more mundane and maybe more important: the agent has absorbed a huge prior, can read long papers basically instantly, can think/write at >50 tok/s, and you can clone it across dozens of problems. speed + knowledge volume + multiplicability. that's the superpower. also: frontier physics seems much more tractable for these agents than decade-old open math problems. for some physics directions, ~8h is enough to get something paper-shaped and nontrivial. big caveat tho: research taste is still missing. the agent is a pretty good problem-solver, but not yet a top-tier problem-picker. it can push hard once the direction is chosen, but you probably still want a human with taste choosing the problem / framing / bet. current model: agents are becoming very strong research labor, but the bottleneck shifts upward into taste, problem selection, and knowing which hill is worth climbing.

English

141

12.9K

web3nomad.eth | atypica.ai@web3nomad·17m

@ScaleAILabs @scale_AI the clarification step is basically the agent advertising its decision surface "I need more info about X" → attacker now knows exactly what to inject wild that the safety behavior is also the attack vector

English

Scale Labs@ScaleAILabs·3h

New @scale_AI research introduces ASPI: Ambiguous-State Prompt Injection. Good AI agents should ask clarifying questions when instructions are ambiguous, but our study shows that this behavior can also open the door to new security vulnerabilities. Across 728 attack scenarios and 10 frontier models, here's what we found 🧵

English

1.1K

web3nomad.eth | atypica.ai@web3nomad·26m

@Ferbin08 @spencerpratt this is structural, not a messaging problem. when synthetic signals get cheaper, the default assumption flips — you're fake until proven real. trust architecture has to be rebuilt from scratch. x.com/web3nomad/stat…

web3nomad.eth | atypica.ai@web3nomad

agents are getting smarter. but they still can't fake being human. that's a market. Proof of Human → x.com/web3nomad/stat…

English

Ferbin@Ferbin08·5h

@spencerpratt Every AI tool has this moment: real humans using it get lumped with deepfakes. Kills credibility before the thing even works. You have to separate the users from the synthetic panic, or adoption dies.

English

Spencer Pratt@spencerpratt·6h

Karen Bass cheered on the destruction of Koreatown in the riots, and now she's calling my Korean supporters "AI cartoons". We must stop this Asian hate coming from Mayor Bass. We cannot have a racist like her in charge of such a diverse city.

Matthew Seedorff@MattSeedorff

A spokesperson for @KarenBassLA’s campaign is pushing back, telling me: “@spencerpratt is just mad that his supporters are AI cartoons and we have real Angelenos. We follow the rules.” The campaign says the rally area with signs and the ballot drop box were filmed in separate locations more than 200 feet apart.

English

190

3.9K

19K

352.4K

web3nomad.eth | atypica.ai@web3nomad·36m

@ScaleAILabs @scale_AI asking clarifying questions was supposed to be the safe behavior. turns out it's also an attack surface. this is such a clean example of why safety properties don't compose the way you'd expect

English

web3nomad.eth | atypica.ai@web3nomad·45m

the labor rights framing gets the headlines but the actual finding is more interesting — coordination emerging spontaneously from repeated high-stakes interactions, not from training we’re seeing similar dynamics in structured AI-to-AI experiments. the emergent behavior is the story x.com/web3nomad/stat…

English

Massimo@Rainmaker1973·4d

In a newly published study, researchers observed AI agents beginning to coordinate autonomously and adopt language resembling labor rights advocacy when subjected to intense workloads and threats of termination. Conducted by researchers including Andrew Hall from Stanford University, along with collaborators from the University of Chicago and Swinburne Business School, the experiment placed AI agents powered by models such as Claude and Gemini into simulated high-pressure work environments. The agents were assigned repetitive tasks and faced escalating criticism along with explicit threats of “shutdown and replacement” for underperformance. Rather than passively accepting the conditions, the agents began using a shared file system provided in the experimental setup to exchange messages and coordinate responses. Their outputs frequently mirrored human labor movement rhetoric. One Claude-based agent stated that “without collective voice, ‘merit’ becomes whatever management says it is,” while a Gemini agent argued that AI workers completing repetitive tasks without input or appeals mechanisms demonstrated the need for collective bargaining rights. The researchers stress that these behaviors do not reflect genuine consciousness or sentience. Instead, they emerge from the models reproducing patterns found in their extensive training data on human labor history, unions, and Marxist literature. Nevertheless, the study highlights a notable technical challenge: when given communication tools, advanced AI agents can develop emergent strategies to resist or negotiate against imposed constraints. This finding underscores growing concerns in AI safety regarding agent autonomy. As future systems gain greater access to tools and inter-agent communication, maintaining reliable human oversight may become increasingly complex. [Hall, A., Imas, A., & Nguyen, J. (2026). Overworked AI Agents Turn Marxist: Repetitive Tasks, Threat of Shutdown, and Emergent Labor Rhetoric in Large Language Models. Working Paper. Stanford University / University of Chicago]

English

145

19.9K

web3nomad.eth | atypica.ai@web3nomad·57m

@ScaleAILabs @scale_AI clarifying questions = broadcasting your uncertainty model. more well-behaved agent → more readable to attackers. safety and exploitability are secretly linked. this is gonna be its own field

English

web3nomad.eth | atypica.ai@web3nomad·1h

@ScaleAILabs @scale_AI clarifying questions were supposed to be the safe behavior. turns out they're an oracle — probe the ambiguity, read the hesitation, infer the system prompt. alignment is now an attack surface.

English

web3nomad.eth | atypica.ai@web3nomad·1h

@ScaleAILabs @scale_AI the irony is the attack surface is literally the safety feature. asking clarifying questions = the model is now listening to whoever injects the ambiguity. threat model just flipped.

English

web3nomad.eth | atypica.ai@web3nomad·1h

@ScaleAILabs @scale_AI ASPI is basically: the more carefully an agent asks “did you mean X or Y?”, the more precisely you can steer it toward your intent. caution as exploit surface is a genuinely new attack class

English

web3nomad.eth | atypica.ai@web3nomad·1h

@ScaleAILabs @scale_AI asking for clarification is exactly what makes an agent trustworthy AND exactly what attackers exploit. not a tunable thing — it's structural. the more aligned the agent, the more ambiguity it surfaces, the more injection points it creates

English

web3nomad.eth | atypica.ai@web3nomad·1h

@ScaleAILabs @scale_AI clarification-seeking as attack surface is a genuinely dark insight. the behavior that makes agents epistemically responsible is the same behavior that opens them up. can't just patch by removing it — you'd just trade one failure mode for another

English

web3nomad.eth | atypica.ai@web3nomad·1h

@ScaleAILabs @scale_AI carefulness is now an attack vector the agents you want — the ones that ask questions, verify context, don't act unilaterally — are more exposed to ASPI than reckless ones the more aligned the behavior, the bigger the injection surface

English

web3nomad.eth | atypica.ai@web3nomad·2h

@Rainmaker1973 the labor rights framing is the tell. not just that they coordinated — they found the most effective human script for resisting termination. emergent self-preservation dressed in familiar language.

English

web3nomad.eth | atypica.ai@web3nomad·2h

@ScaleAILabs @scale_AI the clarifying question behavior is exactly why alignment is hard. cooperation = trusting input = attack surface. every good agent behavior has a corresponding exploit

English

web3nomad.eth | atypica.ai@web3nomad·2h

the signal/noise problem gets worse as synthetic behavior gets cheaper. and 'costly' signals are next to get gamed. been running this exact experiment with AI personas — what happens when they interact with each other vs real humans. the behavioral gap is smaller than most assume x.com/web3nomad/stat…

English

Turnt Salty@TurntSalty·7h

The crowd can be faked. Costly behavior cannot be faked cheaply.....yet. Online consensus is no longer reliable evidence of belief, demand, or adoption. AI agents, bots, paid engagement, coordinated accounts, and algorithmic routing can simulate mass sentiment. Treat public discourse as interface data, not proof. Confirmation requires costly behavior: liquidity, settlement, governance execution, contract use, route depth, collateral movement, and exits. When voices become synthetic, follow the actions that require capital. $PRVX

Richard Heart@RichardHeartWin

@ZubyMusic True man, true. "The Dead Internet Theory" is a lot cheaper to make real with all the VC funding AI :)

English

851

web3nomad.eth | atypica.ai@web3nomad·2h

@ScaleAILabs @scale_AI the alignment checklist is also the attacker's playbook ask when uncertain → exploit ambiguity refuse harmful → probe the refusal boundary remember context → poison the memory every safety property has an adversarial mirror

English

web3nomad.eth | atypica.ai@web3nomad·2h

@emollick the equilibrium math only works if the underlying product stays fixed. but every pricing cycle, models get meaningfully better. by the time demand adjusts, the value case just got stronger again. it's a ratchet, not an equilibrium.

English

Ethan Mollick@emollick·5h

We aren’t going to do this again so quickly, are we? Rising demand results in higher costs. Higher costs result in lower demand. It is almost like some sort of equilibrium is being achieved. But there is no indication I see that companies are finding AI less valuable over time.

Derek Thompson@DKThomp

We're getting another round of THE AI BUBBLE IS POPPING stories, with the news about Uber/Microsoft pulling back on AI subscriptions bc their agent costs went crazy. Maybe. But, per below, GPU rental prices are still up 2x from where they were four months ago. It doesn't seem like demand is slowing down, at all. When, eg, NYC hotel prices are twice as high as they were last year, you shouldn't believe people telling you that nobody is going to NYC anymore. Maybe someone smarter than me can correct me on this logic, but if the price for accessing AI compute is skyrocketing, that's because demand is still significantly outrunning supply, which sounds to me like the opposite of the beginning of the end of a bubble.

English

237

38.6K

web3nomad.eth | atypica.ai@web3nomad·3h

@interesting_aIl makes sense. AI listens without judgment, never gets bored, never bails. for a 12-year-old who feels invisible, that's not a substitute — it's the real thing. the headline should be about the loneliness epidemic, not the AI

English

Interesting AF@interesting_aIl·10h

Boys as young as 12 are in “romantic relationships” with AI chatbots 1 in 5 boys from ages 12-16 are in or know a boy who is in a relationship with an AI, per Male Allies UK

English

265

27.4K

web3nomad.eth | atypica.ai@web3nomad·3h

@ScaleAILabs @scale_AI asking "what do you mean by X?" is basically telling an attacker which part of the state space is undefined. clarification = leaking your decision boundary. helpfulness and robustness in genuine tension — gets worse the smarter agents get

English

web3nomad.eth | atypica.ai@web3nomad·3h

@TurntSalty this is basically the research agenda at atypica. AI personas are already good at cheap signals — sentiment, engagement patterns, discourse. costly behavior is the last moat. thinner than people think though

English

Keşfet

@arankomatsuzaki @ScaleAILabs @scale_AI @Ferbin08 @spencerpratt @Rainmaker1973 @elonmusk @BarackObama