Ed Mass

1.4K posts

Ed Mass

@hyper_ed

Built financial transaction AI + building financial agents, thinking about workflow automation. Founder person.

New York, USA Inscrit le Eylül 2014

1.3K Abonnements1.9K Abonnés

Ed Mass@hyper_ed·11 Mar

@alex_prompter @JohnNosta Looks like we need some neurodiversity

English

Alex Prompter@alex_prompter·10 Mar

🚨 BREAKING: Researchers at UW Allen School and Stanford just ran the largest study ever on AI creative diversity. 70+ AI models were given the same open-ended questions. They all gave the same answers. They asked over 70 different LLMs the exact same open-ended questions. "Write a poem about time." "Suggest startup ideas." "Give me life advice." Questions where there is no single right answer. Questions where 10 different humans would give you 10 completely different responses. Instead, 70+ models from every major AI company converged on almost identical outputs. Different architectures. Different training data. Different companies. Same ideas. Same structures. Same metaphors. They named this phenomenon the "Artificial Hivemind." And the paper won the NeurIPS 2025 Best Paper Award, which is the highest recognition in AI research, handed to a small number of papers out of thousands of submissions. This is not a blog post or a hot take. This is award-winning, peer-reviewed science confirming something massive is broken. The team built a dataset called Infinity-Chat with 26,000 real-world, open-ended queries and over 31,000 human preference annotations. Not toy benchmarks. Not math problems. Real questions people actually ask chatbots every single day, organized into 6 categories and 17 subcategories covering creative writing, brainstorming, speculative scenarios, and more. They ran all of these across 70+ open and closed-source models and measured the diversity of what came back. Two findings hit hard. First, intra-model repetition. Ask the same model the same open-ended question five times and you get almost the same answer five times. The "creativity" you think you're getting is the same output wearing a slightly different outfit. You ask ChatGPT, Claude, or Gemini to write you a poem about time and you keep getting the same river metaphor, the same hourglass imagery, the same reflection on mortality. Over and over. The model isn't thinking. It's defaulting to whatever scored highest during alignment training. Second, and this is the one that should really alarm you, inter-model homogeneity. Ask GPT, Claude, Gemini, DeepSeek, Qwen, Llama, and dozens of other models the same creative question, and they all converge on strikingly similar responses. These are models built by completely different companies with different architectures and different training pipelines. They should be producing wildly different outputs. They're not. 70+ models all thinking inside the same invisible box, producing the same safe, consensus-approved content that blends together into one indistinguishable voice. So why is this happening? The researchers point directly at RLHF and current alignment techniques. The process we use to make AI "helpful and harmless" is also making it generic and boring. When every model gets trained to optimize for human preference scores, and those preference datasets converge on a narrow definition of what "good" looks like, every model learns to produce the same safe, agreeable output. The weird answers get penalized. The original takes get shaved off. The genuinely creative responses get killed during training because they didn't match what the average annotator rated highly. And it gets even worse. The study found that reward models and LLM-as-judge systems are actively miscalibrated when evaluating diverse outputs. When a response is genuinely different from the mainstream but still high quality, these automated systems rate it LOWER. The very tools we built to evaluate AI quality are punishing originality and rewarding sameness. Think about what this means if you use AI for brainstorming, content creation, business strategy, or literally any task where you need multiple perspectives. You're getting the illusion of diversity, not the real thing. You ask for 10 startup ideas and you get 10 variations of the same 3 ideas the model learned were "safe" during training. You ask for creative writing and you get the same therapeutic, perfectly balanced, utterly forgettable tone that every other model gives. The researchers flagged direct implications for AI in science, medicine, education, and decision support, all domains where diverse reasoning is not a nice-to-have but a requirement. Correlated errors across models means if one AI gets something wrong, they might ALL get it wrong the same way. Shared blind spots at massive scale. And the long-term risk is even scarier. If billions of people interact with AI systems that all think identically, and those interactions shape how people write, brainstorm, and make decisions every day, we risk a slow, invisible homogenization of human thought itself. Not because AI replaced creativity. Because it quietly narrowed what we were exposed to until we all started thinking the same way too. Here's what you can actually do about it right now: → Stop accepting first-draft AI output as creative or diverse. If you need 10 ideas, generate 30 and throw away the obvious ones → Use temperature and sampling parameters aggressively to push models out of their comfort zone → Cross-reference multiple models AND multiple prompting strategies, because same model with different prompts often beats different models with the same prompt → Add constraints that force novelty like "give me ideas that a traditional investor would hate" instead of "give me creative ideas" → Use structured prompting techniques like Verbalized Sampling to force the model to explore low-probability outputs instead of defaulting to consensus → Layer your own taste and judgment on top of everything AI gives you. The model gets you raw material. Your weirdness and experience make it original This paper puts hard data behind something a lot of us have been feeling for a while. AI is getting more capable and more homogeneous at the same time. The models are smarter, but they're all smart in the exact same way. The Artificial Hivemind is not a bug in one model. It's a systemic feature of how the entire industry builds, aligns, and evaluates language models right now. The fix requires rethinking alignment itself, moving toward what the researchers call "pluralistic alignment" where models get rewarded for producing diverse distributions of valid answers instead of collapsing to a single consensus mode. Until that happens, your best defense is awareness and better prompting.

English

333

905

481K

Ed Mass@hyper_ed·2 Eki

@truthpole I read this like full sci-fi and it was great!

English

211

T R U T H P O L E@Truthpole·2 Eki

🚨 BREAKING - 3I/Atlas leak by a PHD Researcher #Ufotwitter #3IATLAS

English

214

420

2.9K

383K

Ed Mass@hyper_ed·2 Haz

@__JasonMarshall It's like ketchup but turn the vinegar up to 11

English

Jason Marshall@__JasonMarshall·2 Haz

What is HP sauce? Brown sauce? #England

English

699

Ed Mass@hyper_ed·30 May

@t_blom “I don’t want to do the work”

English

Tom Blomfield@t_blom·30 May

“I’m a low-conviction investor and I can’t make up my own mind.”

English

256

12.2K

Tom Blomfield@t_blom·30 May

“I’ll invest if you find a lead” is the single lamest thing an investor can say.

English

143

136

2.6K

272.8K

Ed Mass@hyper_ed·2 May

@t_blom 😅 what if this works…. 💣

English

216

Tom Blomfield@t_blom·1 May

Season 7 of Silicon Valley

Roy@im_roy_lee

im hiring 50 interns in san francisco for @trycluely. $50/hr. + founding engineers cluely .com/careers

Español

187

472.5K

Ed Mass@hyper_ed·6 Şub

finally, someone who can articulate my G2M strategy!

Autism Capital 🧩@AutismCapital

Alex Jones was ahead of his time.

English

546

Ed Mass retweeté

Autism Capital 🧩@AutismCapital·6 Şub

Alex Jones was ahead of his time.

English

117

159

1.5K

131.5K

Ed Mass@hyper_ed·4 Şub

New Feature - 'Intelligence Search' for banking, available in app and via APIs this month. #fintech #aibanking #banking

English

195

Ed Mass@hyper_ed·30 Oca

@pmarca ‘Simplification’

English

Marc Andreessen 🇺🇸@pmarca·29 Oca

1) What

Matt Kilcoyne@MRJKilcoyne

The Competitiveness Compass @pmarca

English

437

170

4.9K

705.1K

Ed Mass@hyper_ed·28 Oca

No doubt, this will begin to push the banking market forward. cnbc.com/2025/01/28/elo…

English

Ed Mass@hyper_ed·23 Oca

🚨 Episode 1 of the Hyperflo Podcast is LIVE! 🚨 Dive deep into the AI revolution with me! 💥 We're exploring how AI is reshaping everything in business, from workflows to how we even organize teams. 🧠 In my first episode, the legendary @sytaylor & I dissect the wild world of fintech. 💸 AI is transforming finance – are you ready? 🤖 Listen now & let's talk AI! 👇 youtube.com/watch?v=kG9FjQ… #AI #Fintech #Podcast #Innovation #FutureOfWork

YouTube

English

796

Ed Mass@hyper_ed·29 Ara

@pmarca Not out of step at all @pmarca

English

Marc Andreessen 🇺🇸@pmarca·28 Ara

Full episode -- enjoy! 🔥

Turpentine@TurpentineMedia

Moment of Zen podcast: Marc Andreessen on the political vibe shift for Silicon Valley, Elon as John Galt, and what a Twitter files for the government could reveal. Full episode with @pmarca on 𝕏

English

947

308.3K

Ed Mass@hyper_ed·22 Ara

@elonmusk Drone

English

Elon Musk@elonmusk·22 Ara

ZXX

16.2K

42.4K

498K

75.1M

Ed Mass@hyper_ed·7 Ara

@EthzerTheFarmer @PicturesFoIder I think about it way too often

English

Ethan murnane@EthzerTheGamer·7 Ara

@hyper_ed @PicturesFoIder I loved that movie

English

178

non aesthetic things@PicturesFoIder·6 Ara

You know what this is?

English

2.9K

1.4K

36.2K

21.3M

Ed Mass@hyper_ed·5 Ara

Welcome to our live show! x.com/i/broadcasts/1…

English

Ed Mass@hyper_ed·4 Ara

@astrange Building a compliance agent using something like a PPO framework checks operations agents on how compliant they are. My ugly slide from a while back... still haven't seen it done but would be great!

English

angela strange@astrange·4 Ara

2/ Keeping on top of these codes requires byzantine workflows & many hours hiring & training staff. Imagine, instead, that those lengthy documents — including text, images, and case precedents — could be used to train regulation-specific LLMs. Suddenly, compliance would become as simple as a Google query: “Is [X] compliant? What modifications need to be made?”

English

684

Ed Mass retweeté

Startup Archive@StartupArchive_·29 Kas

Peter Thiel on pivoting and PayPal’s early failure “When we started PayPal, the initial product was an infrared-beaming device on Palm Pilots for sending money. It was voted one of the 10 worst business ideas in 1999. And 1999 was a year when there were many bad ideas in technology… But the team was good… It’s not like you only get one chance. You get many chances so long as you keep trying. If you get hung up on failure, and if you think you don’t have another chance, that’s when you really don’t.” One lesser-known fact about the PayPal pivot is that the idea didn’t come from the cofounders, Peter Thiel and Max Levchin. It came from David Sacks who was fresh out of Stanford Law School and a one year stint at McKinsey. Sacks was the one who finally persuaded a reluctant Levchin that beaming money between Palm Pilots was a bad idea. Instead, he argued, they should focus on sending money via email. A talented team and getting multiple shots on goal turned out to be the difference between startup failure and a $1.5 billion exit to eBay. Video source: @CBSNews (2012)

English

720

62.5K

Ed Mass@hyper_ed·28 Kas

AI: Stripe launches AI agent payment processing open.substack.com/pub/lex/p/ai-s… @stripe really getting some agentic workflows going @finblueprint

English

Ed Mass@hyper_ed·28 Kas

My new favourite and scariest #UAP / #UFO theory is that the #Aliens have told Governments that they 👽cannot disclose their existence for fear of reprisal.

English

152

Découvrir

@alex_prompter @JohnNosta @truthpole @__JasonMarshall @t_blom @pmarca @sytaylor @elonmusk