Achiever🌶️

1.7K posts

Achiever🌶️ banner
Achiever🌶️

Achiever🌶️

@flip_trinity

software engineer gaming experience 💀 || acknowledge greatness always 💫

dev Katılım Aralık 2024
726 Takip Edilen269 Takipçiler
0xOlami
0xOlami@Olamicryptt·
AlhamduliLlah 🤲 web3 jobs >>>> build your X account oooooo where are my team members ?
0xOlami tweet media0xOlami tweet media
English
145
10
324
10.5K
Elon Musk
Elon Musk@elonmusk·
Grok Voice is #1!
Artificial Analysis@ArtificialAnlys

Announcing agentic performance benchmarking for Speech to Speech models on Artificial Analysis. We use 𝜏-Voice to measure tool calling and customer interaction voice agent capabilities in realistic customer service scenarios Even the strongest Speech to Speech (S2S) models today resolve only about half of realistic customer service scenarios end-to-end - a meaningful gap relative to frontier text-based agents on the same tasks. Voice channels introduce significant complexity: challenging accents, background noise, and packet loss, all while requiring fast responses, consistency across long multi-turn conversations, and reliable tool use. Performance also varies considerably by audio condition: in clean audio some models perform notably better, but realistic conditions continue to pose a challenge. Conversation duration also varies meaningfully across models, with implications for both customer experience and operational cost. About 𝜏-Voice: Our Agentic Performance benchmark is based on 𝜏-Voice (Ray, Dhandhania, Barres & Narasimhan, 2026), which extends 𝜏²-bench into the voice modality to evaluate S2S models on realistic customer service tasks. It measures multi-turn instruction following, support of a simulated customer through a complete interaction, and tool use against simulated customer service systems. The simulated user combines an LLM-driven decision model with realistic audio synthesis: diverse accents, background noise, and packet loss modelled on real network conditions. This complements our Big Bench Audio benchmark measuring intelligence and Conversational Dynamics (Full Duplex Bench subset) benchmark measuring conversational naturalness. Scores are the average of three independent pass@1 trials. We evaluate under realistic audio conditions using the 𝜏²-bench base task split across three domains: ➤ Airline (50 scenarios): e.g., changing a flight, rebooking under policy constraints ➤ Retail (114 scenarios): e.g., disputing a charge, processing a return ➤ Telecom (114 scenarios): e.g., resolving a billing issue, troubleshooting a service problem Task success is determined by deterministic checks against expected actions and final database state, consistent with the 𝜏²-bench evaluator. Key results: xAI's Grok Voice Think Fast 1.0 is the clear leader at 52.1%, averaging 5.6 minutes per conversation, the second-longest overall. OpenAI's GPT-Realtime-2 (High) (39.8%, 3.0 min) and GPT-Realtime-1.5 (38.8%, 4.8 min) follow, with Gemini 3.1 Flash Live Preview - High close behind at 37.7% (3.8 min). Speech to Speech is a fast evolving modality and we expect movement in rankings as we continue to add new models with these capabilities, and model robustness improves. Congratulations @xAI @elonmusk! See below for further detail ⬇️

English
1.6K
2K
10.1K
3.2M
Nipherme
Nipherme@NiphermeDave·
There’s one bro out there still clicking a $0 testnet instead of max bidding on all free NFTs available rn🥀 Learn to adapt and follow the trend.
English
64
4
347
5.7K
Elon Musk
Elon Musk@elonmusk·
Try Grok Voice
X Freeze@XFreeze

Grok Voice Think Fast 1.0 ranks #1 on the Artificial Analysis τ-Voice benchmark for real-world agentic customer service resolution Absolutely outperforming GPT-Realtime-2 (High) and Gemini 3.1 Flash by a huge margin That's a massive 12%+ lead over OpenAI's best model that just released a few days ago Grok is running real-time background reasoning without the latency penalty, which is why it is already handling live Starlink phone operations autonomously at scale

English
2K
2.2K
11.9K
5.1M
Achiever🌶️
Achiever🌶️@flip_trinity·
@Bjay_eth Do more It's gonna click It's like unlocking a vault and finding the last click
English
0
0
1
2
Fabrizio Romano
Fabrizio Romano@FabrizioRomano·
Cristiano Ronaldo at full time. 💔 Al Nassr need one more win to become Saudi Pro League champions.
Fabrizio Romano tweet media
English
2.8K
4.5K
64.3K
1.8M
Anointed
Anointed@Krptonoob·
There is no difference between the Saudi Pro League and the American WWE 100% scripted
English
15
0
60
1.9K
kyros.eth
kyros.eth@0xKyros·
First May win? this just saved me God! look at my wallet (+31250%)😭 This is your cue to participate in contests. If they’re free, just do it. If you don’t win, the content you make serves as Proof of work for you. even if I don’t win the stitch contest, I got a gig from it.
kyros.eth tweet mediakyros.eth tweet mediakyros.eth tweet mediakyros.eth tweet media
kyros.eth@0xKyros

x.com/i/article/2052…

English
9
0
9
172
Adeife
Adeife@IfeKrawn·
No food has rewarm value pass Party jollof
English
33
1
43
1.1K
Achiever🌶️
Achiever🌶️@flip_trinity·
@0xKyros Yes very true , been seeing a lot of enquires requesting for POW , and like you said , already made work will do the job
English
1
0
1
20
kyros.eth
kyros.eth@0xKyros·
@flip_trinity Exactly man If it’s free and you actually understand the concept Do your best on it If you don’t win the contest, your work still serves as proof of work for you People can see it
English
1
0
1
12
bigchog
bigchog@bigchog·
as a guy, you should collect things during your time on this earth > collect memories > collect NFTs > collect pokemon cards > collect money > collect phone numbers > collect anything that makes you happy
English
40
0
79
746
stanxbt
stanxbt@standotsui·
I got followed by two accounts that open doors One opens the door to the Sei ecosystem The other opens the door to the club of bald, short men Excited to join both
stanxbt tweet media
English
112
0
217
3K
Achiever🌶️
Achiever🌶️@flip_trinity·
@0xgiwa Crypto space in a whole can be greater than some 9 - 5 jobs
English
1
0
1
34
0XGIWA
0XGIWA@0xgiwa·
it's crazy how I started crypot/web3 as a side hustle... now it's my major source of income thank you God 🙏
English
29
3
87
1.1K
Twister♦️
Twister♦️@Twisterfx001·
JUST IN 🚨🚨🚨: Nikita will be activating this new feature on x next week We all need em right?
Twister♦️ tweet media
English
25
1
34
275