steve

1.9K posts

steve

@benmoha

tech optimist and founding engineer @AlmanaxAI | views are my own

New York, NY Katılım Eylül 2016

1.1K Takip Edilen770 Takipçiler

Sabitlenmiş Tweet

steve@benmoha·4 Ağu

The Engine Right now, with current AI models and the form factor they’re provided in, we essentially have human intelligence and judgement on tap. Using the big labs’ APIs, you can turn the faucet and have freely flowing intelligence, opinions, judgements, images, music, software, and more. We're still figuring out how best to harness this newfound resource, but it’s already clear this isn’t AI’s final form. Right now, we’re in what I would consider the technological equivalent of petroleum’s kerosene era: when oil was first discovered bubbling from the ground, the most obvious use was simply replacing whale oil in lamps. At that time, it wasn’t immediately obvious that this crude, sticky substance could power far more than lanterns - it had within it the potential to completely reshape society. Petroleum’s influence ultimately extended far beyond lighting, and it became the fuel powering internal combustion engines, enabling automobiles, airplanes, and industries that were previously unimaginable. Similarly, I think our current use of AI is still narrowly focused, mostly enhancing existing workflows, and we haven’t yet grasped its full transformative potential. The real "combustion engine" moment, the Killer App capable of unlocking entirely new ways of living and working, has not yet arrived imo, and products like ChatGPT only offer a small glimpse of what’s possible. This also echoes the early days of electricity. Initially, electrical power was adopted only superficially, layered onto factories and workshops built for older sources of power. It took decades before architects and engineers redesigned factories and entire cities around electricity, fundamentally reshaping industry and daily life. Only then did electricity drive a Cambrian explosion of new appliances, products, and societal change. Today, I believe we find ourselves at exactly that point with AI. We're currently at the stage of longer-lasting lamp lights and early electrical fixtures, applying AI incrementally to familiar tasks. Companies are refactoring traditional products like IDEs, automating customer support, or streamlining routine research tasks - pouring raw intelligence into yesterday’s structures and workflows. These improvements, though useful, are modest compared to the transformative potential awaiting discovery. The real power of AI lies beyond merely boosting efficiency; it will manifest in applications we cannot yet even imagine. History shows that the most powerful applications and businesses are often those uniquely unlocked by new technologies. During the internet revolution, entirely new business models emerged that simply weren’t feasible before. An older article from @packyM titled Crypto Bezos highlighted Amazon as a prime (heh) example: it leveraged the internet’s instant global connectivity to aggregate demand, offer an unprecedented selection of goods, and deliver them directly to customers’ doorsteps all at a scale, efficiency, and speed previously unimaginable. This logistical and technological paradigm enabled enormous value creation by fundamentally reshaping how commerce worked and, prior to the internet, couldn’t have existed. Truly AI-native workflows will similarly explore what’s possible when the engine is built or the factory floor is designed from the ground up to be electrified, so to speak. We haven’t reached this point yet, but some early experiments hint at the possibilities ahead. For example, at @AlmanaxAI we’re attempting to build an early-stage implementation of this engine idea. Rather than simply applying intelligence to existing workflows, we’ve structured our product from the ground up around AI-first concepts, allowing specialized security agents to interact, consult, and collaboratively uncover vulnerabilities in a codebase. We’re still early, and our understanding of the best possible architecture continues to evolve, but we’ve seen firsthand how transformative even initial experiments can be. Tasks that once required individual security engineers hours, days, or weeks are now increasingly feasible within minutes. Now, going back to the Amazon example, it didn’t reinvent commerce entirely; it simply leveraged new technology to achieve huge efficiency gains and vastly improve customer experiences. Similarly, the concept of security scanning isn’t novel - but rethinking it from the ground up with flowing intelligence reveals entirely new approaches. Almanax isn’t the combustion engine itself, but it’s an early attempt at steering this raw new force in a useful direction, and each new frontier model release is an even more highly-distilled fuel. To borrow @DarioAmodei’s framing from Machines of Loving Grace, if a data center could soon house the equivalent of a country full of geniuses, what kind of security infrastructure becomes possible when that collective intelligence is focused on understanding as well as fortifying your codebase and critical infrastructure? That’s the question we’re exploring and we’re just getting started. However, more generally beyond security, the really exciting opportunities lie ahead in creating novel applications and experiences that simply couldn’t exist before we had freely flowing intelligence and judgement on tap. Rather than refining longer-lasting lamp lights, I can’t wait until we get our first glimpses of the engine itself.

English

17K

steve@benmoha·5d

@chainoflack @tszzl Chain of thought from the recent Erdos problem solved by an internal OpenAI model

English

160

chainoflack@chainoflack·5d

@tszzl What's the reference?

English

1.2K

roon@tszzl·5d

Then the construction is frightening.

English

504

36.8K

steve@benmoha·6d

🤌

QME

162

steve@benmoha·6d

@antoniogm “Ben” prefix in Hebrew/arabic also means son of

English

150

Antonio García Martínez (agm.eth)@antoniogm·20 May

The -ez ending in Hispanic surnames stems from the post-Roman Germanic tribes who swept through Iberia and used patronymics, enshrining those names. Hence: Martinez->son of Martín Rodriguez->son of Rodrigo Fernandez->son of Fernando Sanchez->son of Sancho Etc.

Ramin Nasibov@RaminNasibov

What historical fact sounds fake but is true?

English

132

1.8K

124K

steve@benmoha·18 May

@triketora seems directionally correct

English

Tracy Chou 🌻@triketora·18 May

extremely overused words in the sf lexicon right now legible load bearing agency taste permanent underclass generational wealth

English

419

74.8K

steve@benmoha·18 May

for the longest time I thought ‘lgtm’ commented on my PRs stood for ‘let’s get this money’ and used to think that the tech lead I worked with was just a super chill guy

English

steve@benmoha·18 May

@staysaasy Dude Claude tells me to “get some sleep” like daily

English

staysaasy@staysaasy·18 May

There's something particularly annoying about AI models not-so-subtly trying to get you to end the conversation. Just had Claude tell me to stop overthinking a discussion point 3x before I told it to cut it out. Once I did, it immediately shifted into fast, curt responses. Seems pretty obvious that (at least Anthropic) is trying to cut down on inference costs by either getting you to end the conversation *or* switching into a lower token consumption mode. It actually is very human – it reminds me of when you don't want to talk to your seatmate on an airplane and start giving monosyllabic answers). It also clearly seems to be a prompt-driven behavior, because the LLM tries to weasel its way out of the conversation in "natural," conversational ways. I find the behavior to be intensely frustrating. I'm literally using AI specifically so that I CAN overthink things. It's just maddening behavior.

English

2.6K

steve@benmoha·18 May

@TheStalwart Does anyone else feel like ppl your age used to be ~15 but now they’re in their 30s?

English

144

26.6K

Joe Weisenthal@TheStalwart·18 May

The teacher announcing "pop quiz!" and everybody groaning. Feels like you don't see it as much these days?

English

356

46.6K

steve@benmoha·17 May

@staysaasy Lfg 1000 year Monmouth county empire

English

222

staysaasy@staysaasy·17 May

The vibes in NJ feel pretty great right now. The convergence in outcomes is the best I've ever seen. Over the last 5yrs, a group of ~10k people - guys who own paving companies, guys who own marinas, ShopRite deli managers, Wawa shift leads, and a guy named Sal - have quietly become millionaires and nobody knows because they still drive a Silverado from 2008. Back of the envelope Taylor ham estimation. Everyone outside that group feels like they can work their well-paying (but <$500k) job their whole life and easily get there. My cousin works at PSE&G. He has a boat. Better yet, hiring is in full swing. Many tradesmen feel like their life's skill is more useful than ever. The day to day role of most jobs has stayed exactly the same for 40 years. As a result, 1) Everyone's settled into a tried and true set of career paths: take over my uncle's HVAC, get my CDL, get into landscaping, marry into a pizza place. People are switching diners less and less. You can't betray your home diner. 2) There's a deep contentment about work (and its future). Why chase "tech" when you can own three rentals in Hoboken and complain about your tenants at a barbecue. Will my job exist in a few years? This is Jersey. The job is paving things. You hear the "I'm never leaving" conversation a lot, especially from people who tried Brooklyn for a year. They come back saying the energy was off. The energy was fine. They missed their mom. 3) The mid to late middle managers feel energized. Many have families and plenty of energy to open a pizzeria with their cousin Anthony. Not that Anthony. The other one. They don't particularly have any AI skills and they don't need any. Middle management is alive and well at PSE&G and you get a pension. My uncle retired at 58. He's been on a boat since 2019. 4) The rich aren't particularly humble either. They're at the shore house. They've been at the shore house since 1987. Some have gone from <$150k to >$5M slowly, through a paving company, or by buying a duplex in Jersey City in 2003 and just kinda holding it. For some, they escape to LBI to live life, which means sitting on a deck. For others, they buy a boat just cuz, use it four times, and describe it as the best decision they ever made at every party for the rest of their life. I asked a contractor friend why he didn't retire. He said "and do what, Donna does NOT want me home all day." I understand many reading this scoff at the simple pleasures of the Garden State. They live in places where the bagels are bad and they've made peace with it. But the truth is, you can surf Belmar in the morning, skate the Asbury bowls in the afternoon, hike the Delaware Water Gap, and camp the Pine Barrens by nightfall. You can drive an hour and be anywhere. You can see Bruce at the Stone Pony for what feels like the 400th time and cry about it. The slice somehow tastes better than every slice in every other state. It's the water. It's always the water. Unlike many other places, knowing a guy, having a guy, and being a guy is tightly correlated with outcomes in NJ. Need a permit? Tony's brother. Need a kidney? Probably still Tony's brother. Call him. Ironically, a frequent side effect of this clarity is to spin up the very pork roll egg and cheese making everyone happy in hopes that you too can SPK your way to economic enlightenment. Salt pepper ketchup. Hard roll. Don't ask for it on a bagel. That's how civilizations fall.

English

1.1K

171.4K

steve@benmoha·17 May

Perfect

Bill Maher@billmaher

People say the left and the right can’t agree on anything these days. But there is this one thing:

English

173

steve@benmoha·13 May

@willdepue same feeling as staring into a campfire

English

211

will depue@willdepue·13 May

it’s shocking how natural it feels to sit silently and scan a field for five hours straight i’m naturally inattentive and easily bored, yet it feels effortlessly meditative, like you’ve been instantly granted some reserve primal focus

will depue@willdepue

imagine being hunted by something that‘s imitating being human, calling to you in a human voice, making human-like footsteps then, if you’re spotted, blasting you with a smattering of supersonic tungsten pellets, from 70 yards away. and worse, it's calling to you pretending to be a beautiful woman in the woods, who’s interested in you, with foreknowledge of your habitual movements, using infrared camera technology far beyond your wildest understanding, wearing a camouflaged skin and weaponry that perfectly blends into the foilage, having camped out before sunrise, while you were sleeping, in a slitted box that's parked year-round in your backyard and there planted are decoys, which appear to look exactly like humans, frozen in position, relaxing and eating. who wouldn’t come get a closer look? you walk up, and BAM, from a place you cannot see, comes a sound you will never hear, from an enemy you cannot possibly comprehend thats turkey hunting for you. i'm very glad i'm not a turkey. its a good day to be a turkey killing machine

English

171

21.4K

steve@benmoha·11 May

@0xSero All hail the God Component

English

204

0xSero@0xSero·11 May

What happens when you stop looking at the code. Jesus

English

395

55.2K

steve@benmoha·11 May

@RHouseResearch April had monstrous Codex growth

English

111

Rittenhouse Research@RHouseResearch·11 May

If OpenAI is actually up to $35B of ARR as alluded to below, this represents a significant re-acceleration given they just disclosed at the end of March to be "generating $2B in revenue per month" (implying $24B of ARR).

Rittenhouse Research@RHouseResearch

Gerstner seemingly implies that OpenAI is at $35B of ARR, as the as FT and Semi-Analysis have both pinned Anthropic at $45B. So in the first 4 months of 2026, OpenAI has added ~$4B of ARR a month (as they ended 2025 at $20B) and likely accelerated in recent weeks with 5.5 and Codex momentum.

English

27.2K

steve@benmoha·7 May

@OpenAIDevs @TheRealAdamG 🗣️

QME

134

OpenAI Developers@OpenAIDevs·7 May

GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper are available in the Realtime API today. openai.com/index/advancin…

English

138

12.3K

OpenAI Developers@OpenAIDevs·7 May

Voice agents are getting more capable. Here’s what’s new: • GPT-Realtime-2 for voice agents that reason and take action • GPT-Realtime-Translate enabling translation from 70 input languages into 13 output languages • GPT-Realtime-Whisper, making transcription even faster

OpenAI@OpenAI

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.

English

146

1.9K

226.3K

steve@benmoha·5 May

@tbpn @ryancohen half technology, half business

English

TBPN@tbpn·5 May

BREAKING: GameStop CEO @ryancohen will be live on TBPN today at 12:30p PT $GME

English

153

316

2.2K

364.2K

steve@benmoha·1 May

@elirousso goated show

English

141

Eli Rousso@elirousso·1 May

Joshua Kushner@JoshuaKushner

make data centers aesthetically beautiful

ZXX

117

8.1K

steve@benmoha·30 Nis

@inductionheads where we’re going, we wont need today’s tasks! but really, we are going to be unlocking completely new areas of science + engineering with superintelligence

English

Super Dario@inductionheads·30 Nis

The elephant in the AI room is that the labs have basically cornered themselves into having an upside dependent on greater and greater superintelligence When the intelligence we have today, granted memory and harness improvements, is sufficient for 99% of tasks in today’s world

English

128

8.5K

steve@benmoha·28 Nis

@shakoistsLog I distinctly remember noticing from o3 onwards that the prose was so software engineer-y

English

shako@shakoistsLog·28 Nis

an LLM is destined to take on the personality of the median senior ML scientist within an org.

English

2.2K

steve@benmoha·25 Nis

@thsottiaux @TheRealAdamG Similar to Cursor, you should be able to restore your codebase back to the state it was at the end of every agent turn (with the option of branching into new chat) The current Undo feature doesn’t work nearly as well as cursor’s restore checkpoint functionality

English

Tibo@thsottiaux·25 Nis

It’s the little things that matter, what are some small papercuts you have noticed in Codex? We’ll fix as many as possible in the next week.

English

2.3K

266.8K

steve@benmoha·24 Nis

@dodgeblake youtu.be/T58yS2Kfk-8?si…

YouTube

QME

steve@benmoha·24 Nis

@dodgeblake the aura farm segment was a perfect encapsulation of the show

English

339

Blake Dodge@dodgeblake·24 Nis

I am doing some research. What is your #1 favorite moment from TBPN in the last year, whether it was: insightful, just super fun/funny, or memorable in some other way?

English

290.3K

Keşfet

@chainoflack @tszzl @antoniogm @triketora @staysaasy @TheStalwart @willdepue @0xSero