steve

1.9K posts

steve banner
steve

steve

@benmoha

tech optimist and founding engineer @AlmanaxAI | views are my own

New York, NY Katılım Eylül 2016
1.1K Takip Edilen770 Takipçiler
Sabitlenmiş Tweet
steve
steve@benmoha·
The Engine Right now, with current AI models and the form factor they’re provided in, we essentially have human intelligence and judgement on tap. Using the big labs’ APIs, you can turn the faucet and have freely flowing intelligence, opinions, judgements, images, music, software, and more. We're still figuring out how best to harness this newfound resource, but it’s already clear this isn’t AI’s final form. Right now, we’re in what I would consider the technological equivalent of petroleum’s kerosene era: when oil was first discovered bubbling from the ground, the most obvious use was simply replacing whale oil in lamps. At that time, it wasn’t immediately obvious that this crude, sticky substance could power far more than lanterns - it had within it the potential to completely reshape society. Petroleum’s influence ultimately extended far beyond lighting, and it became the fuel powering internal combustion engines, enabling automobiles, airplanes, and industries that were previously unimaginable. Similarly, I think our current use of AI is still narrowly focused, mostly enhancing existing workflows, and we haven’t yet grasped its full transformative potential. The real "combustion engine" moment, the Killer App capable of unlocking entirely new ways of living and working, has not yet arrived imo, and products like ChatGPT only offer a small glimpse of what’s possible. This also echoes the early days of electricity. Initially, electrical power was adopted only superficially, layered onto factories and workshops built for older sources of power. It took decades before architects and engineers redesigned factories and entire cities around electricity, fundamentally reshaping industry and daily life. Only then did electricity drive a Cambrian explosion of new appliances, products, and societal change. Today, I believe we find ourselves at exactly that point with AI. We're currently at the stage of longer-lasting lamp lights and early electrical fixtures, applying AI incrementally to familiar tasks. Companies are refactoring traditional products like IDEs, automating customer support, or streamlining routine research tasks - pouring raw intelligence into yesterday’s structures and workflows. These improvements, though useful, are modest compared to the transformative potential awaiting discovery. The real power of AI lies beyond merely boosting efficiency; it will manifest in applications we cannot yet even imagine. History shows that the most powerful applications and businesses are often those uniquely unlocked by new technologies. During the internet revolution, entirely new business models emerged that simply weren’t feasible before. An older article from @packyM titled Crypto Bezos highlighted Amazon as a prime (heh) example: it leveraged the internet’s instant global connectivity to aggregate demand, offer an unprecedented selection of goods, and deliver them directly to customers’ doorsteps all at a scale, efficiency, and speed previously unimaginable. This logistical and technological paradigm enabled enormous value creation by fundamentally reshaping how commerce worked and, prior to the internet, couldn’t have existed. Truly AI-native workflows will similarly explore what’s possible when the engine is built or the factory floor is designed from the ground up to be electrified, so to speak. We haven’t reached this point yet, but some early experiments hint at the possibilities ahead. For example, at @AlmanaxAI we’re attempting to build an early-stage implementation of this engine idea. Rather than simply applying intelligence to existing workflows, we’ve structured our product from the ground up around AI-first concepts, allowing specialized security agents to interact, consult, and collaboratively uncover vulnerabilities in a codebase. We’re still early, and our understanding of the best possible architecture continues to evolve, but we’ve seen firsthand how transformative even initial experiments can be. Tasks that once required individual security engineers hours, days, or weeks are now increasingly feasible within minutes. Now, going back to the Amazon example, it didn’t reinvent commerce entirely; it simply leveraged new technology to achieve huge efficiency gains and vastly improve customer experiences. Similarly, the concept of security scanning isn’t novel - but rethinking it from the ground up with flowing intelligence reveals entirely new approaches. Almanax isn’t the combustion engine itself, but it’s an early attempt at steering this raw new force in a useful direction, and each new frontier model release is an even more highly-distilled fuel. To borrow @DarioAmodei’s framing from Machines of Loving Grace, if a data center could soon house the equivalent of a country full of geniuses, what kind of security infrastructure becomes possible when that collective intelligence is focused on understanding as well as fortifying your codebase and critical infrastructure? That’s the question we’re exploring and we’re just getting started. However, more generally beyond security, the really exciting opportunities lie ahead in creating novel applications and experiences that simply couldn’t exist before we had freely flowing intelligence and judgement on tap. Rather than refining longer-lasting lamp lights, I can’t wait until we get our first glimpses of the engine itself.
English
1
3
38
17K
steve
steve@benmoha·
@chainoflack @tszzl Chain of thought from the recent Erdos problem solved by an internal OpenAI model
English
1
0
1
160
roon
roon@tszzl·
Then the construction is frightening.
English
25
25
504
36.8K
steve
steve@benmoha·
🤌
steve tweet media
QME
0
0
9
162
steve
steve@benmoha·
@antoniogm “Ben” prefix in Hebrew/arabic also means son of
English
0
0
0
150
Tracy Chou 🌻
Tracy Chou 🌻@triketora·
extremely overused words in the sf lexicon right now legible load bearing agency taste permanent underclass generational wealth
English
60
5
419
74.8K
steve
steve@benmoha·
for the longest time I thought ‘lgtm’ commented on my PRs stood for ‘let’s get this money’ and used to think that the tech lead I worked with was just a super chill guy
English
0
0
1
56
steve
steve@benmoha·
@staysaasy Dude Claude tells me to “get some sleep” like daily
English
0
0
0
39
staysaasy
staysaasy@staysaasy·
There's something particularly annoying about AI models not-so-subtly trying to get you to end the conversation. Just had Claude tell me to stop overthinking a discussion point 3x before I told it to cut it out. Once I did, it immediately shifted into fast, curt responses. Seems pretty obvious that (at least Anthropic) is trying to cut down on inference costs by either getting you to end the conversation *or* switching into a lower token consumption mode. It actually is very human – it reminds me of when you don't want to talk to your seatmate on an airplane and start giving monosyllabic answers). It also clearly seems to be a prompt-driven behavior, because the LLM tries to weasel its way out of the conversation in "natural," conversational ways. I find the behavior to be intensely frustrating. I'm literally using AI specifically so that I CAN overthink things. It's just maddening behavior.
English
5
0
42
2.6K
steve
steve@benmoha·
@TheStalwart Does anyone else feel like ppl your age used to be ~15 but now they’re in their 30s?
English
4
2
144
26.6K
Joe Weisenthal
Joe Weisenthal@TheStalwart·
The teacher announcing "pop quiz!" and everybody groaning. Feels like you don't see it as much these days?
English
33
13
356
46.6K
steve
steve@benmoha·
@staysaasy Lfg 1000 year Monmouth county empire
English
0
0
1
222
staysaasy
staysaasy@staysaasy·
The vibes in NJ feel pretty great right now. The convergence in outcomes is the best I've ever seen. Over the last 5yrs, a group of ~10k people - guys who own paving companies, guys who own marinas, ShopRite deli managers, Wawa shift leads, and a guy named Sal - have quietly become millionaires and nobody knows because they still drive a Silverado from 2008. Back of the envelope Taylor ham estimation. Everyone outside that group feels like they can work their well-paying (but <$500k) job their whole life and easily get there. My cousin works at PSE&G. He has a boat. Better yet, hiring is in full swing. Many tradesmen feel like their life's skill is more useful than ever. The day to day role of most jobs has stayed exactly the same for 40 years. As a result, 1) Everyone's settled into a tried and true set of career paths: take over my uncle's HVAC, get my CDL, get into landscaping, marry into a pizza place. People are switching diners less and less. You can't betray your home diner. 2) There's a deep contentment about work (and its future). Why chase "tech" when you can own three rentals in Hoboken and complain about your tenants at a barbecue. Will my job exist in a few years? This is Jersey. The job is paving things. You hear the "I'm never leaving" conversation a lot, especially from people who tried Brooklyn for a year. They come back saying the energy was off. The energy was fine. They missed their mom. 3) The mid to late middle managers feel energized. Many have families and plenty of energy to open a pizzeria with their cousin Anthony. Not that Anthony. The other one. They don't particularly have any AI skills and they don't need any. Middle management is alive and well at PSE&G and you get a pension. My uncle retired at 58. He's been on a boat since 2019. 4) The rich aren't particularly humble either. They're at the shore house. They've been at the shore house since 1987. Some have gone from <$150k to >$5M slowly, through a paving company, or by buying a duplex in Jersey City in 2003 and just kinda holding it. For some, they escape to LBI to live life, which means sitting on a deck. For others, they buy a boat just cuz, use it four times, and describe it as the best decision they ever made at every party for the rest of their life. I asked a contractor friend why he didn't retire. He said "and do what, Donna does NOT want me home all day." I understand many reading this scoff at the simple pleasures of the Garden State. They live in places where the bagels are bad and they've made peace with it. But the truth is, you can surf Belmar in the morning, skate the Asbury bowls in the afternoon, hike the Delaware Water Gap, and camp the Pine Barrens by nightfall. You can drive an hour and be anywhere. You can see Bruce at the Stone Pony for what feels like the 400th time and cry about it. The slice somehow tastes better than every slice in every other state. It's the water. It's always the water. Unlike many other places, knowing a guy, having a guy, and being a guy is tightly correlated with outcomes in NJ. Need a permit? Tony's brother. Need a kidney? Probably still Tony's brother. Call him. Ironically, a frequent side effect of this clarity is to spin up the very pork roll egg and cheese making everyone happy in hopes that you too can SPK your way to economic enlightenment. Salt pepper ketchup. Hard roll. Don't ask for it on a bagel. That's how civilizations fall.
English
54
69
1.1K
171.4K
steve
steve@benmoha·
@willdepue same feeling as staring into a campfire
English
0
0
2
211
steve
steve@benmoha·
@0xSero All hail the God Component
English
0
0
0
204
0xSero
0xSero@0xSero·
What happens when you stop looking at the code. Jesus
0xSero tweet media
English
38
7
395
55.2K
Rittenhouse Research
Rittenhouse Research@RHouseResearch·
If OpenAI is actually up to $35B of ARR as alluded to below, this represents a significant re-acceleration given they just disclosed at the end of March to be "generating $2B in revenue per month" (implying $24B of ARR).
Rittenhouse Research tweet media
Rittenhouse Research@RHouseResearch

Gerstner seemingly implies that OpenAI is at $35B of ARR, as the as FT and Semi-Analysis have both pinned Anthropic at $45B. So in the first 4 months of 2026, OpenAI has added ~$4B of ARR a month (as they ended 2025 at $20B) and likely accelerated in recent weeks with 5.5 and Codex momentum.

English
4
6
88
27.2K
OpenAI Developers
OpenAI Developers@OpenAIDevs·
Voice agents are getting more capable. Here’s what’s new: • GPT-Realtime-2 for voice agents that reason and take action • GPT-Realtime-Translate enabling translation from 70 input languages into 13 output languages • GPT-Realtime-Whisper, making transcription even faster
OpenAI@OpenAI

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.

English
92
146
1.9K
226.3K
TBPN
TBPN@tbpn·
BREAKING: GameStop CEO @ryancohen will be live on TBPN today at 12:30p PT $GME
TBPN tweet media
English
153
316
2.2K
364.2K
steve
steve@benmoha·
@inductionheads where we’re going, we wont need today’s tasks! but really, we are going to be unlocking completely new areas of science + engineering with superintelligence
English
0
0
0
50
Super Dario
Super Dario@inductionheads·
The elephant in the AI room is that the labs have basically cornered themselves into having an upside dependent on greater and greater superintelligence When the intelligence we have today, granted memory and harness improvements, is sufficient for 99% of tasks in today’s world
English
46
10
128
8.5K
steve
steve@benmoha·
@shakoistsLog I distinctly remember noticing from o3 onwards that the prose was so software engineer-y
English
0
0
3
60
shako
shako@shakoistsLog·
an LLM is destined to take on the personality of the median senior ML scientist within an org.
English
5
2
66
2.2K
steve
steve@benmoha·
@thsottiaux @TheRealAdamG Similar to Cursor, you should be able to restore your codebase back to the state it was at the end of every agent turn (with the option of branching into new chat) The current Undo feature doesn’t work nearly as well as cursor’s restore checkpoint functionality
English
0
0
0
73
Tibo
Tibo@thsottiaux·
It’s the little things that matter, what are some small papercuts you have noticed in Codex? We’ll fix as many as possible in the next week.
English
2K
54
2.3K
266.8K
steve
steve@benmoha·
@dodgeblake the aura farm segment was a perfect encapsulation of the show
English
1
0
5
339
Blake Dodge
Blake Dodge@dodgeblake·
I am doing some research. What is your #1 favorite moment from TBPN in the last year, whether it was: insightful, just super fun/funny, or memorable in some other way?
English
36
4
80
290.3K