Akshay Ramaswamy

5.1K posts

Akshay Ramaswamy banner
Akshay Ramaswamy

Akshay Ramaswamy

@TheRealAk914

Staff PM @elise_ai | Formerly CEO @ Omni, acquired by @Coinbase | @Stanford and @ycombinator alum | #blacklivesmatter 🦇🔊

Brooklyn, NY Katılım Ağustos 2011
790 Takip Edilen884 Takipçiler
Sabitlenmiş Tweet
Akshay Ramaswamy
Akshay Ramaswamy@TheRealAk914·
Today, I’m excited to announce that the Omni team will be joining @coinbase in their incredible mission of democratizing economic freedom! Grateful for the support of all of our friends, family, and investors. @akshayramaswamy1/the-next-chapter-omni-joins-coinbase-5fbad7c359ed" target="_blank" rel="nofollow noopener">medium.com/@akshayramaswa
English
12
4
68
0
Willow
Willow@WillowVoiceAI·
Introducing Atlas 1. Willow's new frontier speech-to-text model. It outperforms ElevenLabs, Deepgram, OpenAI, and more by a wide margin. Built on the first scalable, human-powered transcription infrastructure ever built for real-time dictation.
English
247
152
2.3K
801.8K
Akshay Ramaswamy
Akshay Ramaswamy@TheRealAk914·
@ryolu_ It feels like the role of design will merge with product - deciding what should exist and building the right thing sound like product problems!
English
0
0
0
54
Gordon Sun
Gordon Sun@ImGordonSun·
Introducing Simmy: the Youtube for playable stories. Today, Simmy is #3 in its US App Store category, starting with the $1B romantic fiction vertical. We believe playable stories will redefine entertainment forever. Comment for an INVITE CODE to our public beta. (1/8) THREAD 🧵
Gordon Sun tweet media
English
120
26
203
104.4K
Akshay Ramaswamy
Akshay Ramaswamy@TheRealAk914·
@claudeai @Scobleizer This feels like a strong signal that we’re trusting AI more and more to take actions for us, exciting times!
English
0
0
0
25
Claude
Claude@claudeai·
New in Claude Code: auto mode. Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf. Safeguards check each action before it runs.
English
2.1K
2.9K
39.4K
7.5M
Mckay Wrigley
Mckay Wrigley@mckaywrigley·
looking for a handful of people to test something new... i've been using it for a few months and am prepping to share. if you're a fan of claude cowork, openclaw, manus, perplexity computer, etc then you're a perfect fit. this will self destruct in 4hrs - please dm or reply.
Mckay Wrigley@mckaywrigley

you’re like 6 prompts away from infinitely customizable personal agi. anthropic gave you a world class agentic harness for free. use it!!!

English
1K
15
769
157K
Akshay Ramaswamy retweetledi
Zach Griff
Zach Griff@_ZachGriff·
TSA wait times are absolutely wild right now. So I built a free tracker that shows live waits by checkpoint, including Precheck, Clear, and more (where available). Most tools, including the TSA’s own app, only show airport-wide estimates. Here ya go: tsa.fromthetraytable.com
English
114
317
2.6K
637.9K
Dan Shipper 📧
Dan Shipper 📧@danshipper·
I just bought $5k of Figma Very bullish on SaaS adapting to AI, their stock is getting crushed rn, and @zoink isn’t gonna miss
English
54
4
238
47K
Akshay Ramaswamy
Akshay Ramaswamy@TheRealAk914·
seeing a number of people either deep in research, venture, or domains adjacent to AI who have strong opinions on how good the models are yet they haven’t used claude code *at all* truly the ivory tower if you actually used these coding agents and optimized with all the new releases and paradigms (subagents, planning, skills, ralph loops, etc), the future is crystal clear
English
1
0
5
257
Akshay Ramaswamy
Akshay Ramaswamy@TheRealAk914·
Anthropic must be agentmaxxing so hard right now, this pace of development is unparalleled We're talking major releases, sometimes multiple, every day for *weeks* -- this might be my feel the AGI moment
Noah Zweben@noahzweben

You can now schedule recurring cloud-based tasks on Claude Code. Set a repo (or repos), a schedule, and a prompt. Claude runs it via cloud infra on your schedule, so you don’t need to keep Claude Code running on your local machine.

English
2
0
1
336
Akshay Ramaswamy
Akshay Ramaswamy@TheRealAk914·
@__mikareyes filling out government forms -- complex pdfs that the models generally struggle with @reductoai is the best I've seen so far, curious what other solutions are out there
English
2
1
3
446
Akshay Ramaswamy
Akshay Ramaswamy@TheRealAk914·
What's the best AI product for pdf form filling? Surprisingly, many of the models are terrible still with filling out complex forms, but there's so much economic value to being able to automate this reliably
English
1
0
2
163
Blake Anderson
Blake Anderson@blakeandersonw·
It will be interesting to see the progression of stateless vs temp sandbox vs dedicated VMs for AI software. OpenClaw is the first 'great' dedicated VM tool. I suspect dedicated VM products will massively outperform their shared or temporary counterparts. Year of AI employees
English
40
5
221
19.4K
Akshay Ramaswamy
Akshay Ramaswamy@TheRealAk914·
@NovaShips @karpathy this is a great point, similar to general voice simulation testing with tools like Bluejay / Hamming / Coval costs are getting cheap enough though that the inference is worth the iteration and improvement though imo
English
1
0
1
22
Nova Ships
Nova Ships@NovaShips·
@TheRealAk914 @karpathy Accuracy improves autonomously. Cost scales autonomously too. 19 experiments means 19x the API calls. Nobody talks about what the loop costs to run.
English
1
0
0
13
Akshay Ramaswamy
Akshay Ramaswamy@TheRealAk914·
Self-improving agents are *basically* here AutoVoiceEvals takes the foundation of @karpathy's autoresearch to improve your voice agent, showcasing a real example of taking an agent from 0.728 → 0.969 accuracy It works by: - creating adversarial agents based on your prompt - suggesting PRs based on performance - running in an autonomous loop to continue making improvements This is the end-game for AI, but it honestly feels like we're almost there
Archie Sengupta@archiexzzz

Introducing AutoVoiceEvals I've applied the @karpathy autoresearch loop to voice AI agents. It's open source. Your voice agent has a system prompt. That prompt determines how it handles every call - bookings, complaints, edge cases, background noises, long pauses, people trying to trick it. Most teams write it once, test manually, and hope for the best. autovoiceevals makes it a loop. One artifact (system prompt), one metric (adversarial eval score), keep what improves it, revert what doesn't. Run it overnight. Wake up to a better agent. > How it works: You describe your agent in a config file - what it does, its services, policies, and what it should never do. You don't write test cases. You don't define attack vectors. provider: vapi / smallest ai assistant: id: "your-agent-id" description: | Voice receptionist for a hair salon. Maria does coloring only. Jessica does cuts only. $25 cancellation fee under 24 hours notice. Cannot advise on skin conditions. Closed Sundays. From that description alone, Claude generates adversarial caller personas - each with an attack strategy, a voice profile (accents, background noise, mumblers, interrupters), a multi-turn caller script, and pass/fail evaluation criteria. The eval suite is generated once and held fixed for the entire run, like a validation set. > The loop: 1. Read the agent's current prompt from the platform 2. Generate adversarial eval suite from your description 3. Run baseline 4. Claude proposes ONE surgical change to the prompt 5. Push the modified prompt to the agent via API 6. Run all scenarios against the updated agent 7. Score improved? Keep. Same score but shorter prompt? Keep. Otherwise revert. 8. Go to 4. Run until Ctrl+C. The system sees its own experiment history. When a change fails, the next proposal knows what was tried and why it didn't work. We ran 20 experiments on a live Vapi dental scheduling agent. 0 human intervention. > Score: 0.728 → 0.969 (+33%) > CSAT: 45 → 84 > Pass rate: 25% → 100% > 9 kept, 10 discarded > Prompt: 1191 → 1139 chars (better AND shorter) You describe your agent. It figures out how to break it.

English
1
0
2
227