Akshay Ramaswamy

5.1K posts

Akshay Ramaswamy

@TheRealAk914

Staff PM @elise_ai | Formerly CEO @ Omni, acquired by @Coinbase | @Stanford and @ycombinator alum | #blacklivesmatter 🦇🔊

Brooklyn, NY Katılım Ağustos 2011

790 Takip Edilen884 Takipçiler

Sabitlenmiş Tweet

Akshay Ramaswamy@TheRealAk914·31 Mar

Today, I’m excited to announce that the Omni team will be joining @coinbase in their incredible mission of democratizing economic freedom! Grateful for the support of all of our friends, family, and investors. @akshayramaswamy1/the-next-chapter-omni-joins-coinbase-5fbad7c359ed" target="_blank" rel="nofollow noopener">medium.com/@akshayramaswa…

English

Akshay Ramaswamy@TheRealAk914·7h

impact of the new model on education will be *insane* draw any diagram, perfectly gpt moment for education is coming

@levelsio@levelsio

OpenAI's new image model GPT-Image-2 has leaked It seems to have extremely good world knowledge and great text rendering Possibly better than Nano Banana Pro It's on @arena under code names: - maskingtape-alpha - gaffertape-alpha - packingtape-alpha

English

Akshay Ramaswamy@TheRealAk914·3d

- OpenAI gives entire week off to employees to move around compute - Sora gets shutdown (to save more compute) time for the empire to strike back!

Chris@chatgpt21

🚨 GREG BROCKMAN JUST EXPLAINED THE NEXT LEAP WITH SPUD (GPT 5.5) Greg Brockman: "I think of Spud as a new base, as a new pre-train... I'd say it's like we have maybe two years worth of research that is coming to fruition in this model." Greg says: "There's this thing called 'big model smell'... when these models are just actually much smarter, much more capable, that they bend to you much more, and you feel it." Here is exactly what we are getting with the upcoming GPT 5.5 rollout: • "Big Model Smell": A massive qualitative shift. The models stop being rigid and start intuitively bending to what you actually want them to do. • Unlocking New Abilities: It can just do things it wasn’t able to before. The frustrating moments where the AI "doesn't quite get it" and needs you to over-explain are going away. • Longer Time Horizons: The ceiling is being completely raised. The new models will be able to autonomously solve complex, open-ended problems over much longer periods of time. • A New Pre-Train Base: This is not an incremental fine-tune. Spud is a completely new foundation built to accelerate the entire economy.

English

262

Akshay Ramaswamy@TheRealAk914·3d

@WillowVoiceAI we’d love to try the API when it’s ready at @elise_ai !!

English

119

Willow@WillowVoiceAI·3d

Introducing Atlas 1. Willow's new frontier speech-to-text model. It outperforms ElevenLabs, Deepgram, OpenAI, and more by a wide margin. Built on the first scalable, human-powered transcription infrastructure ever built for real-time dictation.

English

247

152

2.3K

801.8K

Akshay Ramaswamy@TheRealAk914·6d

long vertical saas and infra!

logan bartlett@loganbartlett

x.com/i/article/2037…

English

150

Akshay Ramaswamy@TheRealAk914·26 Mar

@ryolu_ It feels like the role of design will merge with product - deciding what should exist and building the right thing sound like product problems!

English

Ryo Lu@ryolu_·25 Mar

as agents make it easy to add features, design matters more, not less. the role is no longer just pushing pixels – it’s deciding what should exist, how it fits together, how humans stay in control, and how intelligence feels clear, trustworthy, and useful. taste, craft, and judgment have always been the bottleneck. the game is not who ships fastest, but who makes the right thing for humans.

Lenny Rachitsky@lennysan

I don’t know exactly what’s going on here, but it does feel AI-related. Unlike PM and eng, which started growing in 2024 (two years post-ChatGPT), design didn’t. If I had to venture a theory, I’d say that because AI is allowing engineers to move so quickly, there’s less opportunity—and less desire—to involve the traditional design process. That said, you’d think design would become a differentiator as more products compete for attention. Something to think about for your company! We’ll keep watching this trend and AI’s impact on org design more generally. One interesting observation we made when we went a level deeper: the ratio of demand for PMs vs. designers has flipped. In mid-2023, we went from more open designer roles to more open PM roles. And ever since, PM demand has been pulling away (currently 1.27x). This will be another trend to monitor, in terms of how AI is reshaping org design.

English

652

67.8K

Akshay Ramaswamy@TheRealAk914·26 Mar

@ImGordonSun interested!!

English

Gordon Sun@ImGordonSun·25 Mar

Introducing Simmy: the Youtube for playable stories. Today, Simmy is #3 in its US App Store category, starting with the $1B romantic fiction vertical. We believe playable stories will redefine entertainment forever. Comment for an INVITE CODE to our public beta. (1/8) THREAD 🧵

English

120

203

104.4K

Akshay Ramaswamy@TheRealAk914·25 Mar

@claudeai @Scobleizer This feels like a strong signal that we’re trusting AI more and more to take actions for us, exciting times!

English

Claude@claudeai·24 Mar

New in Claude Code: auto mode. Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf. Safeguards check each action before it runs.

English

2.1K

2.9K

39.4K

7.5M

Akshay Ramaswamy@TheRealAk914·24 Mar

@mckaywrigley interested!!

English

Mckay Wrigley@mckaywrigley·24 Mar

looking for a handful of people to test something new... i've been using it for a few months and am prepping to share. if you're a fan of claude cowork, openclaw, manus, perplexity computer, etc then you're a perfect fit. this will self destruct in 4hrs - please dm or reply.

Mckay Wrigley@mckaywrigley

you’re like 6 prompts away from infinitely customizable personal agi. anthropic gave you a world class agentic harness for free. use it!!!

English

769

157K

Akshay Ramaswamy@TheRealAk914·23 Mar

we’re going to be seeing a lot of reactive vibe coding problem arises, people can now solve it 10x faster

Zach Griff@_ZachGriff

TSA wait times are absolutely wild right now. So I built a free tracker that shows live waits by checkpoint, including Precheck, Clear, and more (where available). Most tools, including the TSA’s own app, only show airport-wide estimates. Here ya go: tsa.fromthetraytable.com

English

248

Akshay Ramaswamy retweetledi

Zach Griff@_ZachGriff·22 Mar

English

114

317

2.6K

637.9K

Akshay Ramaswamy@TheRealAk914·21 Mar

@danshipper @zoink let’s go @aachang97 we need you

English

109

Dan Shipper 📧@danshipper·21 Mar

I just bought $5k of Figma Very bullish on SaaS adapting to AI, their stock is getting crushed rn, and @zoink isn’t gonna miss

English

238

47K

Akshay Ramaswamy@TheRealAk914·21 Mar

seeing a number of people either deep in research, venture, or domains adjacent to AI who have strong opinions on how good the models are yet they haven’t used claude code *at all* truly the ivory tower if you actually used these coding agents and optimized with all the new releases and paradigms (subagents, planning, skills, ralph loops, etc), the future is crystal clear

English

257

Akshay Ramaswamy@TheRealAk914·21 Mar

Anthropic must be agentmaxxing so hard right now, this pace of development is unparalleled We're talking major releases, sometimes multiple, every day for *weeks* -- this might be my feel the AGI moment

Noah Zweben@noahzweben

You can now schedule recurring cloud-based tasks on Claude Code. Set a repo (or repos), a schedule, and a prompt. Claude runs it via cloud infra on your schedule, so you don’t need to keep Claude Code running on your local machine.

English

336

Akshay Ramaswamy@TheRealAk914·21 Mar

@__mikareyes @reductoai have seen this in legal, healthcare, and housing so far primarily

English

Akshay Ramaswamy@TheRealAk914·21 Mar

@__mikareyes filling out government forms -- complex pdfs that the models generally struggle with @reductoai is the best I've seen so far, curious what other solutions are out there

English

446

Akshay Ramaswamy@TheRealAk914·14 Mar

What's the best AI product for pdf form filling? Surprisingly, many of the models are terrible still with filling out complex forms, but there's so much economic value to being able to automate this reliably

English

163

Akshay Ramaswamy@TheRealAk914·21 Mar

what's the best way to run analytics on @meetgranola to analyze customer interviews?

English

198

Akshay Ramaswamy@TheRealAk914·16 Mar

@blakeandersonw @perplexity_ai computer has done an increidble job with this recently, the harness around their VM produces incredible outputs

English

Blake Anderson@blakeandersonw·15 Mar

It will be interesting to see the progression of stateless vs temp sandbox vs dedicated VMs for AI software. OpenClaw is the first 'great' dedicated VM tool. I suspect dedicated VM products will massively outperform their shared or temporary counterparts. Year of AI employees

English

221

19.4K

Akshay Ramaswamy@TheRealAk914·16 Mar

@NovaShips @karpathy this is a great point, similar to general voice simulation testing with tools like Bluejay / Hamming / Coval costs are getting cheap enough though that the inference is worth the iteration and improvement though imo

English

Nova Ships@NovaShips·16 Mar

@TheRealAk914 @karpathy Accuracy improves autonomously. Cost scales autonomously too. 19 experiments means 19x the API calls. Nobody talks about what the loop costs to run.

English

Akshay Ramaswamy@TheRealAk914·16 Mar

Self-improving agents are *basically* here AutoVoiceEvals takes the foundation of @karpathy's autoresearch to improve your voice agent, showcasing a real example of taking an agent from 0.728 → 0.969 accuracy It works by: - creating adversarial agents based on your prompt - suggesting PRs based on performance - running in an autonomous loop to continue making improvements This is the end-game for AI, but it honestly feels like we're almost there

Archie Sengupta@archiexzzz

Introducing AutoVoiceEvals I've applied the @karpathy autoresearch loop to voice AI agents. It's open source. Your voice agent has a system prompt. That prompt determines how it handles every call - bookings, complaints, edge cases, background noises, long pauses, people trying to trick it. Most teams write it once, test manually, and hope for the best. autovoiceevals makes it a loop. One artifact (system prompt), one metric (adversarial eval score), keep what improves it, revert what doesn't. Run it overnight. Wake up to a better agent. > How it works: You describe your agent in a config file - what it does, its services, policies, and what it should never do. You don't write test cases. You don't define attack vectors. provider: vapi / smallest ai assistant: id: "your-agent-id" description: | Voice receptionist for a hair salon. Maria does coloring only. Jessica does cuts only. $25 cancellation fee under 24 hours notice. Cannot advise on skin conditions. Closed Sundays. From that description alone, Claude generates adversarial caller personas - each with an attack strategy, a voice profile (accents, background noise, mumblers, interrupters), a multi-turn caller script, and pass/fail evaluation criteria. The eval suite is generated once and held fixed for the entire run, like a validation set. > The loop: 1. Read the agent's current prompt from the platform 2. Generate adversarial eval suite from your description 3. Run baseline 4. Claude proposes ONE surgical change to the prompt 5. Push the modified prompt to the agent via API 6. Run all scenarios against the updated agent 7. Score improved? Keep. Same score but shorter prompt? Keep. Otherwise revert. 8. Go to 4. Run until Ctrl+C. The system sees its own experiment history. When a change fails, the next proposal knows what was tried and why it didn't work. We ran 20 experiments on a live Vapi dental scheduling agent. 0 human intervention. > Score: 0.728 → 0.969 (+33%) > CSAT: 45 → 84 > Pass rate: 25% → 100% > 9 kept, 10 discarded > Prompt: 1191 → 1139 chars (better AND shorter) You describe your agent. It figures out how to break it.

English

227

Akshay Ramaswamy@TheRealAk914·14 Mar

you can just *do* things with AI, high agency always wins

vittorio@IterIntellectus

this is actually insane > be tech guy in australia > adopt cancer riddled rescue dog, months to live > not_going_to_give_you_up.mp4 > pay $3,000 to sequence her tumor DNA > feed it to ChatGPT and AlphaFold > zero background in biology > identify mutated proteins, match them to drug targets > design a custom mRNA cancer vaccine from scratch > genomics professor is “gobsmacked” that some puppy lover did this on his own > need ethics approval to administer it > red tape takes longer than designing the vaccine > 3 months, finally approved > drive 10 hours to get rosie her first injection > tumor halves > coat gets glossy again > dog is alive and happy > professor: “if we can do this for a dog, why aren’t we rolling this out to humans?” one man with a chatbot, and $3,000 just outperformed the entire pharmaceutical discovery pipeline. we are going to cure so many diseases. I dont think people realize how good things are going to get

English

Keşfet

@WillowVoiceAI @elise_ai @ryolu_ @ImGordonSun @claudeai @Scobleizer @mckaywrigley @danshipper