Trickyy

71 posts

Trickyy banner
Trickyy

Trickyy

@matthewjugzy

Something Curse szn.

Katılım Mayıs 2025
26 Takip Edilen2 Takipçiler
Trickyy retweetledi
Base
Base@base·
Base + USDC The agentic lineup
Base tweet media
English
175
142
978
59.1K
Trickyy
Trickyy@matthewjugzy·
Wen ya, TGE😮‍💨
English
0
0
0
4
Trickyy retweetledi
Elon Musk
Elon Musk@elonmusk·
Grok Voice is #1!
Artificial Analysis@ArtificialAnlys

Announcing agentic performance benchmarking for Speech to Speech models on Artificial Analysis. We use 𝜏-Voice to measure tool calling and customer interaction voice agent capabilities in realistic customer service scenarios Even the strongest Speech to Speech (S2S) models today resolve only about half of realistic customer service scenarios end-to-end - a meaningful gap relative to frontier text-based agents on the same tasks. Voice channels introduce significant complexity: challenging accents, background noise, and packet loss, all while requiring fast responses, consistency across long multi-turn conversations, and reliable tool use. Performance also varies considerably by audio condition: in clean audio some models perform notably better, but realistic conditions continue to pose a challenge. Conversation duration also varies meaningfully across models, with implications for both customer experience and operational cost. About 𝜏-Voice: Our Agentic Performance benchmark is based on 𝜏-Voice (Ray, Dhandhania, Barres & Narasimhan, 2026), which extends 𝜏²-bench into the voice modality to evaluate S2S models on realistic customer service tasks. It measures multi-turn instruction following, support of a simulated customer through a complete interaction, and tool use against simulated customer service systems. The simulated user combines an LLM-driven decision model with realistic audio synthesis: diverse accents, background noise, and packet loss modelled on real network conditions. This complements our Big Bench Audio benchmark measuring intelligence and Conversational Dynamics (Full Duplex Bench subset) benchmark measuring conversational naturalness. Scores are the average of three independent pass@1 trials. We evaluate under realistic audio conditions using the 𝜏²-bench base task split across three domains: ➤ Airline (50 scenarios): e.g., changing a flight, rebooking under policy constraints ➤ Retail (114 scenarios): e.g., disputing a charge, processing a return ➤ Telecom (114 scenarios): e.g., resolving a billing issue, troubleshooting a service problem Task success is determined by deterministic checks against expected actions and final database state, consistent with the 𝜏²-bench evaluator. Key results: xAI's Grok Voice Think Fast 1.0 is the clear leader at 52.1%, averaging 5.6 minutes per conversation, the second-longest overall. OpenAI's GPT-Realtime-2 (High) (39.8%, 3.0 min) and GPT-Realtime-1.5 (38.8%, 4.8 min) follow, with Gemini 3.1 Flash Live Preview - High close behind at 37.7% (3.8 min). Speech to Speech is a fast evolving modality and we expect movement in rankings as we continue to add new models with these capabilities, and model robustness improves. Congratulations @xAI @elonmusk! See below for further detail ⬇️

English
1.8K
2.6K
13.1K
4.1M
Trickyy retweetledi
Psy Protocol
Psy Protocol@PsyProtocol·
Congratulations!!! You’ve chosen a path that was never meant to be ordinary. 💊 Now, let the adventure begin: 1️⃣ Follow @lobsternft_lol and @PsyProtocol 2️⃣ Quote repost this post and share your thoughts on why agentic private payments matter (~100 words) 3️⃣ Tag both @lobsternft_lol and @PsyProtocol in your post 4️⃣ Join the Psychonaut Incubation Program (psy.xyz/psychonaut) and drop your post in the “Red Pill Fantasy” creative challenge. Winners will be selected from valid entries.
Project Lobster@lobsternft_lol

The simulation is live. Your choice awaits: 🔵 Take the blue pill — return to the comfort of the ordinary, predictable world. 🔴 Or take the red pill — join us in shaping a trustless, next-generation blockchain where AI agents operate with real autonomy. To welcome early supporters, we’re running a whitelist draw for the next phase of the network. If you’d like to participate, see the details and enter here: psy.xyz/psychonaut @PsyProtocol

English
6
24
416
1.8K
Trickyy
Trickyy@matthewjugzy·
@PsyProtocol @lobsternft_lol Agentic private payments by @PsyProtocol & @lobsternft_lol are crucial for on-chain autonomy. They prevent tracking of AI-driven trades, ensuring bots execute private, front-run-resistant transactions. This privacy layer is the backbone of secure, decentralized financial agents..
Trickyy tweet media
English
0
0
2
32
Trickyy
Trickyy@matthewjugzy·
Agentic private payments by @PsyProtocol & @lobsternft_lol are crucial for on-chain autonomy. They prevent tracking of AI-driven trades, ensuring bots execute private, front-run-resistant transactions. This privacy layer is the backbone of secure, decentralized financial agents..
Trickyy tweet media
Psy Protocol@PsyProtocol

Congratulations!!! You’ve chosen a path that was never meant to be ordinary. 💊 Now, let the adventure begin: 1️⃣ Follow @lobsternft_lol and @PsyProtocol 2️⃣ Quote repost this post and share your thoughts on why agentic private payments matter (~100 words) 3️⃣ Tag both @lobsternft_lol and @PsyProtocol in your post 4️⃣ Join the Psychonaut Incubation Program (psy.xyz/psychonaut) and drop your post in the “Red Pill Fantasy” creative challenge. Winners will be selected from valid entries.

English
0
0
2
54
0xJuggernaut
0xJuggernaut@rdhxlzrdyy·
DOMA SEASON 1: WEEK 4 has just Arrived and Active, I only reached Rank 71😌 competition is still competition. Show your action here app.doma.xyz/join/zo1nq2mpm… Earn Badges by increasing points through trading in DOMA @domaprotocol @D3inc Own And Trade The Internet 💠
0xJuggernaut tweet media
English
4
0
10
106
0xJuggernaut
0xJuggernaut@rdhxlzrdyy·
GM CT✨ @domaprotocol @D3inc DOMA Season 1 is still live✅. Unlock new badges by continuously increasing your points on app.doma.xyz Doma Protocol, DOMA with ticker $DOMA, add liquidity, and listed are soon? 💙🚀 Gdoma Fams 💙✨ $DOMA
0xJuggernaut tweet media
English
7
2
11
209
0xJuggernaut
0xJuggernaut@rdhxlzrdyy·
"NO WORD IS DIFFICULT" Bridges on DOMA are now very easy, flexible, and efficient. They have three bridge options: -Relay -Stargate -Router Bridge from ETH to DOMA, then trade app.doma.xyz Integrated bridge, secure, fast, efficient, flexible, and low fees. Gdoma💙
0xJuggernaut tweet media
English
3
0
8
209
Doma Protocol
Doma Protocol@domaprotocol·
Today, DNS meets ENS on mainnet. DNS twins are live! Tokenized DNS domains on Doma Protocol can act as @ensdomains compatible names. Find your name twin here: doma.xyz/ens
Doma Protocol tweet media
English
83
43
185
34.9K
Trickyy
Trickyy@matthewjugzy·
• This suspended X/Twitter account has been Joining & Participating with Doma @domaprotocol since the beginning (DOMA EARLY TESTNET) • Participate, Join & Follow every Doma Discord Quest & X/Twitter Doma Quest. (EVEN THOUGH I'VE NEVER WON ONCE IN X/TWITTER QUEST)😮‍💨
Trickyy tweet media
English
0
0
1
18
Trickyy
Trickyy@matthewjugzy·
My suspended X/Twitter account has participated, joined & accompanied the Doma Project @domaprotocol since the beginning (DOMA TESTNET EARLY)😮‍💨 Very sad, the X/Twitter account has been suspended. I have a lot of memories with Doma💔
Trickyy tweet media
English
8
0
0
18