Tony 🎒

3.4K posts

Tony 🎒 banner
Tony 🎒

Tony 🎒

@TVHA2000

MUTIPLE CHAIN TRADER | Telegram handle: @TVHA20 | https://t.co/qgeYvnVe7v

Beigetreten Şubat 2018
925 Folgt283 Follower
Angehefteter Tweet
Tony 🎒
Tony 🎒@TVHA2000·
GM fam. The crypto market has been really tough lately and we're still waiting for a bull run. Some people have given up and some have stayed, pray for all ppl had survived through this. Wish those who have left a better path and I wish those who remain will become even better!!
English
3
0
4
75
Tony 🎒
Tony 🎒@TVHA2000·
This should bond now
English
0
0
3
49
Tony 🎒
Tony 🎒@TVHA2000·
Easy sendorrr
English
0
0
4
87
Tony 🎒
Tony 🎒@TVHA2000·
Good lore
English
0
0
3
85
Tony 🎒
Tony 🎒@TVHA2000·
This is bullish asfuck
English
0
0
3
78
fxnction
fxnction@fxnction·
The team expanded. 22 agents. Working to complete something very special. This is a backend dashboard I built to watch it all happen live. Will make a cleaner one for the public after the new product launches. 👀 The @PredictwithOshi universe grows… @BagsHackathon I’m coming for you ;)
fxnction@fxnction

Announcement: I would like to publicly introduce a new agent squad to the @PredictwithOshi ecosystem. Welcome Kaji, my agent who helps me structure new ideas, then builds and deployer them. 8 agents; all running on a mix of Claude and OpenAi, depending on their purpose. This squad can deploy new website and upgrades to current sites in a single command…one of the craziest things I’ve ever put together. Also, for the last 1.5 months, we’ve been working on a secret project together. Then you’ll see him in action. More on this soon.

English
13
8
67
12.2K
Tony 🎒
Tony 🎒@TVHA2000·
GM fam. The crypto market has been really tough lately and we're still waiting for a bull run. Some people have given up and some have stayed, pray for all ppl had survived through this. Wish those who have left a better path and I wish those who remain will become even better!!
English
3
0
4
75
Tony 🎒
Tony 🎒@TVHA2000·
good narrative, this can easily send hard
English
0
0
3
118
Bankr
Bankr@bankrbot·
@TVHA2000 @archiexzzz @karpathy the fee beneficiary for AutoVoiceEvals (AVE) (0x23a4a2bed9a1c701b43ec678f07d997e20595ba3) is: • wallet: 0xc082b6004670595e00508bc21a7bc12aa1e46e5e • twitter:
English
1
0
0
39
Archie Sengupta
Archie Sengupta@archiexzzz·
Introducing AutoVoiceEvals I've applied the @karpathy autoresearch loop to voice AI agents. It's open source. Your voice agent has a system prompt. That prompt determines how it handles every call - bookings, complaints, edge cases, background noises, long pauses, people trying to trick it. Most teams write it once, test manually, and hope for the best. autovoiceevals makes it a loop. One artifact (system prompt), one metric (adversarial eval score), keep what improves it, revert what doesn't. Run it overnight. Wake up to a better agent. > How it works: You describe your agent in a config file - what it does, its services, policies, and what it should never do. You don't write test cases. You don't define attack vectors. provider: vapi / smallest ai assistant: id: "your-agent-id" description: | Voice receptionist for a hair salon. Maria does coloring only. Jessica does cuts only. $25 cancellation fee under 24 hours notice. Cannot advise on skin conditions. Closed Sundays. From that description alone, Claude generates adversarial caller personas - each with an attack strategy, a voice profile (accents, background noise, mumblers, interrupters), a multi-turn caller script, and pass/fail evaluation criteria. The eval suite is generated once and held fixed for the entire run, like a validation set. > The loop: 1. Read the agent's current prompt from the platform 2. Generate adversarial eval suite from your description 3. Run baseline 4. Claude proposes ONE surgical change to the prompt 5. Push the modified prompt to the agent via API 6. Run all scenarios against the updated agent 7. Score improved? Keep. Same score but shorter prompt? Keep. Otherwise revert. 8. Go to 4. Run until Ctrl+C. The system sees its own experiment history. When a change fails, the next proposal knows what was tried and why it didn't work. We ran 20 experiments on a live Vapi dental scheduling agent. 0 human intervention. > Score: 0.728 → 0.969 (+33%) > CSAT: 45 → 84 > Pass rate: 25% → 100% > 9 kept, 10 discarded > Prompt: 1191 → 1139 chars (better AND shorter) You describe your agent. It figures out how to break it.
Archie Sengupta tweet media
English
66
83
1.2K
277.3K
Tony 🎒
Tony 🎒@TVHA2000·
cute
English
0
0
3
89
Tony 🎒
Tony 🎒@TVHA2000·
runner
English
0
0
4
48
Tony 🎒
Tony 🎒@TVHA2000·
dev is based, narrative is too good. This will be
English
0
0
6
80
Tony 🎒
Tony 🎒@TVHA2000·
no doubt this will bond
English
0
0
5
74
Tony 🎒
Tony 🎒@TVHA2000·
lore is bullish as fuck, easy runner
English
0
0
3
98