Eric.Clauss_Officiel 🇫🇷

36.2K posts

Eric.Clauss_Officiel 🇫🇷 banner
Eric.Clauss_Officiel 🇫🇷

Eric.Clauss_Officiel 🇫🇷

@Eric_Clauss_fr

"France d'abord "

Neuilly sur Seine Katılım Kasım 2011
2.7K Takip Edilen1.4K Takipçiler
Eric.Clauss_Officiel 🇫🇷 retweetledi
Elon Musk
Elon Musk@elonmusk·
Grok Voice is #1!
Artificial Analysis@ArtificialAnlys

Announcing agentic performance benchmarking for Speech to Speech models on Artificial Analysis. We use 𝜏-Voice to measure tool calling and customer interaction voice agent capabilities in realistic customer service scenarios Even the strongest Speech to Speech (S2S) models today resolve only about half of realistic customer service scenarios end-to-end - a meaningful gap relative to frontier text-based agents on the same tasks. Voice channels introduce significant complexity: challenging accents, background noise, and packet loss, all while requiring fast responses, consistency across long multi-turn conversations, and reliable tool use. Performance also varies considerably by audio condition: in clean audio some models perform notably better, but realistic conditions continue to pose a challenge. Conversation duration also varies meaningfully across models, with implications for both customer experience and operational cost. About 𝜏-Voice: Our Agentic Performance benchmark is based on 𝜏-Voice (Ray, Dhandhania, Barres & Narasimhan, 2026), which extends 𝜏²-bench into the voice modality to evaluate S2S models on realistic customer service tasks. It measures multi-turn instruction following, support of a simulated customer through a complete interaction, and tool use against simulated customer service systems. The simulated user combines an LLM-driven decision model with realistic audio synthesis: diverse accents, background noise, and packet loss modelled on real network conditions. This complements our Big Bench Audio benchmark measuring intelligence and Conversational Dynamics (Full Duplex Bench subset) benchmark measuring conversational naturalness. Scores are the average of three independent pass@1 trials. We evaluate under realistic audio conditions using the 𝜏²-bench base task split across three domains: ➤ Airline (50 scenarios): e.g., changing a flight, rebooking under policy constraints ➤ Retail (114 scenarios): e.g., disputing a charge, processing a return ➤ Telecom (114 scenarios): e.g., resolving a billing issue, troubleshooting a service problem Task success is determined by deterministic checks against expected actions and final database state, consistent with the 𝜏²-bench evaluator. Key results: xAI's Grok Voice Think Fast 1.0 is the clear leader at 52.1%, averaging 5.6 minutes per conversation, the second-longest overall. OpenAI's GPT-Realtime-2 (High) (39.8%, 3.0 min) and GPT-Realtime-1.5 (38.8%, 4.8 min) follow, with Gemini 3.1 Flash Live Preview - High close behind at 37.7% (3.8 min). Speech to Speech is a fast evolving modality and we expect movement in rankings as we continue to add new models with these capabilities, and model robustness improves. Congratulations @xAI @elonmusk! See below for further detail ⬇️

English
2.4K
5.4K
24K
8.5M
Eric.Clauss_Officiel 🇫🇷 retweetledi
Eric.Clauss_Officiel 🇫🇷 retweetledi
SpaceX
SpaceX@SpaceX·
Launch rehearsal complete. During a flight-like countdown, more than 5,000 metric tonnes (11+ million pounds) of propellant were loaded on the fully stacked Starship and Super Heavy V3 vehicles for the first time
SpaceX tweet mediaSpaceX tweet mediaSpaceX tweet mediaSpaceX tweet media
English
1.2K
5K
25.8K
2.1M
Eric.Clauss_Officiel 🇫🇷 retweetledi
SpaceX
SpaceX@SpaceX·
Full duration and full thrust 33-engine static fire with Super Heavy V3
English
2.1K
5.5K
33.7K
34.5M
Eric.Clauss_Officiel 🇫🇷 retweetledi
Elon Musk
Elon Musk@elonmusk·
ZXX
6.4K
16.1K
135.1K
33.4M
Eric.Clauss_Officiel 🇫🇷 retweetledi
X Freeze
X Freeze@XFreeze·
Elon Musk once revealed in 2019 that only around 5% of SpaceX resources were focused on Starship at that time The other ~95% were running Falcon 9 and Crew Dragon - one of the most successful orbital rocket programs in history A small dedicated team built Starship (starting in tents at Boca Chica), while the rest of the company kept launching astronauts and landing boosters Today, that same program has scaled massively - with roughly 3,400–4,000+ people focused on it, out of SpaceX’s ~13,000–15,000 total employees The Starship program has shifted from a “side project” to the central pillar of the SpaceX's long-term goal: making life multi-planetary Now it's like watching sci-fi become reality
X Freeze tweet media
English
500
1.6K
8.1K
1.2M
Eric.Clauss_Officiel 🇫🇷 retweetledi
Elon Musk
Elon Musk@elonmusk·
ZXX
10.1K
16K
120.2K
29.4M
Eric.Clauss_Officiel 🇫🇷 retweetledi
X Freeze
X Freeze@XFreeze·
Elon Musk sees a path where the cost to orbit drops below commercial air freight Ultimately Starship will be able to fly across the globe cheaper per ton than a Boeing 747 That opens up a massive range of possibilities for global logistics
English
459
1.8K
6K
797.7K
Marc Vanguard
Marc Vanguard@marc_vanguard·
👉 Autre illustration amusante, la note moyenne au brevet par type de nom de collège ⬇️
Marc Vanguard tweet media
Français
27
297
1.3K
43.8K
Marc Vanguard
Marc Vanguard@marc_vanguard·
🔴 Le niveau scolaire S'EFFONDRE à un point que vous n'imaginez même pas. Vous ne serez plus la même personne après avoir vu ces chiffres hallucinants 🧵⬇️
Marc Vanguard tweet media
Français
644
4.1K
8.6K
1.2M
franceinfo
franceinfo@franceinfo·
🔴⚡ALERTE INFO Le Canon français : des propos racistes et des gestes s'apparentant à des saluts nazis constatés lors d'un "banquet géant" à Caen ➡️ l.franceinfo.fr/dA9
franceinfo tweet media
Français
1.1K
454
1.1K
161.8K
Cela
Cela@cela_nad·
@mtwit75 Les centres villes sont à réinventer car les gens achètent en ligne et plus en boutique, regardent Netflix et plus le ciné, font du télétravail et se font livrer leur repas bas de gamme carTOUT est devenu TROP CHER ! Voilà la réalité !
Français
1
0
6
706
Enzo Morel
Enzo Morel@mtwit75·
À Paris, les commerces qui cartonnent le plus sont désormais : la restauration rapide assise (+579), les ongleries (+217), la téléphonie discount (+201), les friperies (+99), tandis que le prêt à porter femme (-678), les agences bancaires (-205), les agences de voyages (-179) et les restaurants traditionnels français (-139) mettent la clé sous la porte. En seulement 5 ans.
Enzo Morel tweet media
Français
270
1.2K
2.8K
347.9K
Adèle Blanc-Bardam
Adèle Blanc-Bardam@blancbardamadel·
Sucer sans considérations matérielles, c’est mieux. Qd son mec lui fait un cunni, il pense au repas du soir dont il est épargné ? Ce sketch donne envie d’avoir des amants qu'on suce & des maîtresses qu'on lèche mais pas d’être en couple, c’est désastreux🤣
Français
21
22
284
235.8K
Le Goldenretriever
Le Goldenretriever@Goldenretour·
Quand je compare avec l’équipe de France de foot des années 80 de Platini j’ai envie de chialer..
Le Goldenretriever tweet media
Français
49
9
109
6.4K
Mamba🐍
Mamba🐍@Mamba0205·
Il est complexé donc il l'humilie
Français
193
114
2.2K
212.8K
Perseus
Perseus@PerseusLeGrand·
Quel est, selon vous, le pire tic de langage ? Celui qui vous irrite et qu'il faut bannir immédiatement.
Perseus tweet media
Français
1.5K
40
370
440.6K
Eric.Clauss_Officiel 🇫🇷 retweetledi
Armée de Terre
Armée de Terre@armeedeTerre·
“Ils furent ici moins de soixante opposés à toute une armée, sa masse les écrasa. La vie plutôt que le courage abandonna ces soldats français le 30 avril 1863. À leur mémoire, la patrie éleva ce monument.” #Camerone2026 🫡🇫🇷 @LegionEtrangere
Français
150
668
2.5K
176.6K