Achiever🌶️

1.7K posts

Achiever🌶️

@flip_trinity

software engineer gaming experience 💀 || acknowledge greatness always 💫

dev Katılım Aralık 2024

726 Takip Edilen269 Takipçiler

Sabitlenmiş Tweet

Achiever🌶️@flip_trinity·5d

x.com/i/article/2052…

ZXX

169

Achiever🌶️@flip_trinity·4h

@Olamicryptt Congrats champ The journey too sweet

English

279

0xOlami@Olamicryptt·4h

AlhamduliLlah 🤲 web3 jobs >>>> build your X account oooooo where are my team members ?

English

145

324

10.5K

Achiever🌶️@flip_trinity·5h

@elonmusk Made possible by super grok

English

Elon Musk@elonmusk·5h

Grok now has skills

Tech Dev Notes@techdevnotes

Skills in Grok Web can be used by typing /

English

2.9K

2.6K

21.7K

7.4M

Achiever🌶️@flip_trinity·5h

@elonmusk Implementing already

English

Elon Musk@elonmusk·5h

Grok Voice is #1!

Artificial Analysis@ArtificialAnlys

Announcing agentic performance benchmarking for Speech to Speech models on Artificial Analysis. We use 𝜏-Voice to measure tool calling and customer interaction voice agent capabilities in realistic customer service scenarios Even the strongest Speech to Speech (S2S) models today resolve only about half of realistic customer service scenarios end-to-end - a meaningful gap relative to frontier text-based agents on the same tasks. Voice channels introduce significant complexity: challenging accents, background noise, and packet loss, all while requiring fast responses, consistency across long multi-turn conversations, and reliable tool use. Performance also varies considerably by audio condition: in clean audio some models perform notably better, but realistic conditions continue to pose a challenge. Conversation duration also varies meaningfully across models, with implications for both customer experience and operational cost. About 𝜏-Voice: Our Agentic Performance benchmark is based on 𝜏-Voice (Ray, Dhandhania, Barres & Narasimhan, 2026), which extends 𝜏²-bench into the voice modality to evaluate S2S models on realistic customer service tasks. It measures multi-turn instruction following, support of a simulated customer through a complete interaction, and tool use against simulated customer service systems. The simulated user combines an LLM-driven decision model with realistic audio synthesis: diverse accents, background noise, and packet loss modelled on real network conditions. This complements our Big Bench Audio benchmark measuring intelligence and Conversational Dynamics (Full Duplex Bench subset) benchmark measuring conversational naturalness. Scores are the average of three independent pass @1 trials. We evaluate under realistic audio conditions using the 𝜏²-bench base task split across three domains: ➤ Airline (50 scenarios): e.g., changing a flight, rebooking under policy constraints ➤ Retail (114 scenarios): e.g., disputing a charge, processing a return ➤ Telecom (114 scenarios): e.g., resolving a billing issue, troubleshooting a service problem Task success is determined by deterministic checks against expected actions and final database state, consistent with the 𝜏²-bench evaluator. Key results: xAI's Grok Voice Think Fast 1.0 is the clear leader at 52.1%, averaging 5.6 minutes per conversation, the second-longest overall. OpenAI's GPT-Realtime-2 (High) (39.8%, 3.0 min) and GPT-Realtime-1.5 (38.8%, 4.8 min) follow, with Gemini 3.1 Flash Live Preview - High close behind at 37.7% (3.8 min). Speech to Speech is a fast evolving modality and we expect movement in rankings as we continue to add new models with these capabilities, and model robustness improves. Congratulations @xAI @elonmusk! See below for further detail ⬇️

English

1.6K

10.1K

3.2M

Achiever🌶️@flip_trinity·5h

@FabrizioRomano @Ali_alabdallh1 @SPL Bro just wants the spotlight We see the reaction lil bro ✌️🫩

English

393

Achiever🌶️@flip_trinity·5h

@_lordskid @fortytwonetwork What's this about?

English

Lordskid (❖,❖)@_lordskid·5h

My @fortytwonetwork inference node activity is now over 100 hours strong. Locked in 🔒

English

Achiever🌶️@flip_trinity·5h

@NiphermeDave That one guy who thinks he's non chalant

English

Nipherme@NiphermeDave·5h

There’s one bro out there still clicking a $0 testnet instead of max bidding on all free NFTs available rn🥀 Learn to adapt and follow the trend.

English

347

5.7K

Achiever🌶️@flip_trinity·5h

@elonmusk Do I get paid for trying it?

English

Elon Musk@elonmusk·5h

Try Grok Voice

X Freeze@XFreeze

Grok Voice Think Fast 1.0 ranks #1 on the Artificial Analysis τ-Voice benchmark for real-world agentic customer service resolution Absolutely outperforming GPT-Realtime-2 (High) and Gemini 3.1 Flash by a huge margin That's a massive 12%+ lead over OpenAI's best model that just released a few days ago Grok is running real-time background reasoning without the latency penalty, which is why it is already handling live Starlink phone operations autonomously at scale

English

2.2K

11.9K

5.1M

Achiever🌶️@flip_trinity·5h

@Bjay_eth Do more It's gonna click It's like unlocking a vault and finding the last click

English

Bjay 🥷@Bjay_eth·5h

Entered two bounties and I lost both 🙂

Bjay 🥷@Bjay_eth

I used to think paying for Netflix was normal. Then I realized I was also the product being sold. Something is very wrong with how streaming works. Let me show you what I mean 🧵

English

Achiever🌶️@flip_trinity·5h

@FabrizioRomano Lol , bro is actually sad because he didn't do anything reasonable today

English

231

Fabrizio Romano@FabrizioRomano·5h

Cristiano Ronaldo at full time. 💔 Al Nassr need one more win to become Saudi Pro League champions.

English

2.8K

4.5K

64.3K

1.8M

Achiever🌶️@flip_trinity·5h

@Krptonoob Ya know , Arab money does numbers

English

152

Anointed@Krptonoob·5h

There is no difference between the Saudi Pro League and the American WWE 100% scripted

English

1.9K

Achiever🌶️@flip_trinity·5h

@0xKyros Thanks fam really appreciate 🙏 WAGMI

English

kyros.eth@0xKyros·5h

@flip_trinity Yeah Goodluck man I hope you win soon too

English

kyros.eth@0xKyros·6h

First May win? this just saved me God! look at my wallet (+31250%)😭 This is your cue to participate in contests. If they’re free, just do it. If you don’t win, the content you make serves as Proof of work for you. even if I don’t win the stitch contest, I got a gig from it.

kyros.eth@0xKyros

x.com/i/article/2052…

English

172

Achiever🌶️@flip_trinity·5h

@IfeKrawn No, I mean in terms of preserving and then rewarming

English

Adeife@IfeKrawn·5h

@flip_trinity Frozen food tastes better?

English

Adeife@IfeKrawn·6h

No food has rewarm value pass Party jollof

English

1.1K

Achiever🌶️@flip_trinity·5h

@0xKyros Yes very true , been seeing a lot of enquires requesting for POW , and like you said , already made work will do the job

English

kyros.eth@0xKyros·5h

@flip_trinity Exactly man If it’s free and you actually understand the concept Do your best on it If you don’t win the contest, your work still serves as proof of work for you People can see it

English

Achiever🌶️@flip_trinity·6h

@beejay0x Consider it done

English

B33JAY🪽@beejay0x·6h

I need a 5 figs win.

匚HIDI@Cyber_chidi

i need a 4 figs win.

English

4.4K

Achiever🌶️@flip_trinity·6h

@bigchog Not me collecting games for my future son 🫩

English

bigchog@bigchog·7h

as a guy, you should collect things during your time on this earth > collect memories > collect NFTs > collect pokemon cards > collect money > collect phone numbers > collect anything that makes you happy

English

746

Achiever🌶️@flip_trinity·6h

@banditxbt @standotsui Proof of not being bald

English

banditxbt@banditxbt·7h

@standotsui

QME

278

stanxbt@standotsui·19h

I got followed by two accounts that open doors One opens the door to the Sei ecosystem The other opens the door to the club of bald, short men Excited to join both