Alex Avery

1.6K posts

Alex Avery

@alavery2

Building: @SierraPlatform Learning: 🏄🏼‍♂️👨🏼‍💻🥖🧘🏼‍♂️🪚 Past: Founder @gotgather (acquired) @stanford @StanfordGSB

San Francisco, CA Katılım Temmuz 2009

1.1K Takip Edilen663 Takipçiler

Sabitlenmiş Tweet

Alex Avery@alavery2·5d

Voice agents feel magical…until you benchmark them. In τ-Voice (read "Tau Voice"), voice agents perform significantly worse than text agents with the same tools. Here's what's going wrong...

Sierra@SierraPlatform

Voice is quickly becoming the default way we interact with agents, but natural conversation brings a new set of challenges like interruptions, background noise, and accents. Voice agents need to handle all of this while still completing real tasks. Existing benchmarks only measure these skills in isolation – not whether voice agents can do both at once, under realistic conditions. τ-voice tests agents in realistic voice conversations – and it's surfacing real, fast progress on real tasks. In just 8 months, pass rates jumped from ~30% to ~67%, and voice agents now retain ~79% of text capability, up from ~45%. Learn more: sierra.ai/blog/tau-voice…

English

1.6K

Alex Avery retweetledi

Tesla Semi@tesla_semi·18h

With fewer moving parts and no exhaust system, Semi doesn't need oil changes or engine repairs, requiring far less maintenance than its diesel-powered counterparts

English

277

3.6K

63.5K

Alex Avery@alavery2·1d

The name of the game

Neil Rahilly@neilrahilly

x.com/i/article/2051…

English

Alex Avery retweetledi

Clay Bavor@claybavor·2d

Sierra is raising $950 million from new and existing investors, led by Tiger Global and GV, at a valuation of over $15 billion. We now have more than $1 billion to invest in becoming the global standard for companies wanting to transform their customer experiences with AI. Two years ago, most of our customers’ agents were limited to support — tracking orders, troubleshooting devices, and resetting passwords. Fast forward to today, and AI agents built on Sierra are powering all parts of the customer life cycle, from purchase consideration to product discovery to retention and more. We are so excited for what’s ahead, and are deeply grateful to our customers and partners for being on this journey with us. sierra.ai/blog/better-cu…

English

16.2K

Alex Avery retweetledi

Bret Taylor@btaylor·2d

Sierra is raising $950 million from new and existing investors, led by Tiger Global and GV, at a valuation of over $15 billion. We now have more than $1 billion to invest in becoming the global standard for companies wanting to transform their customer experiences with AI. We’ve never had such conviction in the opportunity for Sierra and our customers. Just a couple of years ago, we had four design partners. Now, Sierra is serving over 40% of the Fortune 50, and agents built on our platform are powering billions of customer interactions — everything from refinancing homes to processing insurance claims, returning orders, and helping people raise millions in fundraisers. We’re deeply grateful to our customers for helping show what’s possible. If you’re not yet using Sierra, we’d love to partner with you. sierra.ai/blog/better-cu…

English

285.9K

Alex Avery retweetledi

Matt Mullin@matthewwmullin·3d

NASA HAS RELEASED OVER 12,000 IMAGES OF THE ARTEMIS II MISSION. Unbelievable perspectives captured by the Crew! The aurora on the eclipse is incredible.

English

278

8.9K

61.7K

Alex Avery retweetledi

Joshua Kushner@JoshuaKushner·3d

the industrial revolution made goods abundant. ai will do the same for services

English

116

210

2.5K

299.5K

Alex Avery@alavery2·4d

@broblas @levelsio @Jason @airthings @NorbertDragan link?

English

Nick@broblas·4d

@levelsio @Jason @airthings @NorbertDragan Guys buy calibrated sensors why are you messing with this garbage

English

1.4K

@levelsio@levelsio·4d

I got the @airthings Plus (unaffiliated, I just like the product), I think @norbertdragan recommended it I been through so many air sensors, and I really think this is the best one so I bought one for living room and then another one for bedroom, and will get another one for my coworking Why so many? Well one of the sensors I bought turned out to be a fake random number generator 😂 Another one kept phoning home to Chinese servers, kinda dodgy. Another one had values that made sense but turned out to be based on kinda estimating from other sensor values, so it didn't actually HAVE the sensor it displayed about (this is common to save money) Why the Airthings is so great: - The device is just super thoughtful and non-invasive, the screen is e-ink (I think?), no backlit, no LEDs shining at you, just black and white, it looks like a paper screen, beautiful, it knows its place! - It measures A LOT of things: AQI (PM2.5+PM10), CO2 (!), VOC, Radon (!), humidity and temperature, and it actually has sensors for all! - You don't need to pair it to WiFi, it just works by itself! (why is this great? So I remember getting that Awair sensor and I was in a hotel nomading and I couldn't even set it up cause captive hotel portal, such an Internet of Shit design to not be able to set up without WiFi) - But when you do pair it with WiFi, it easily connects to your Home Assistant and sends your sensor data to HA without any issue, that lets you automate stuff based on your air quality I love it :D

Vadym 🇺🇦🇩🇪🇪🇺@voituk

@levelsio Which sensor you are using? Would you recommend it? P.S. Totally got the same dilemma

English

1.7K

Alex Avery@alavery2·4d

@XFreeze For those that want to learn more about where it still has room to improve x.com/alavery2/statu…

Alex Avery@alavery2

Voice agents feel magical…until you benchmark them. In τ-Voice (read "Tau Voice"), voice agents perform significantly worse than text agents with the same tools. Here's what's going wrong...

English

490

X Freeze@XFreeze·5d

Grok Voice brutally dominates the top of the τ-voice Bench Grok scores 67.3%, while Gemini sits at 43.8% and GPT Realtime at 35.3% This is a massive lead over the competitors and it's not even close The best real-time reasoning voice agent out there

English

188

205

1.1K

11.8M

Alex Avery@alavery2·5d

@Keller sadly the ones i’ve heard around town already have squeaky brakes. hopefully they can ramp up on hyundai quick to leave this bad decision in the past.

English

297

Keller Cliffton@Keller·5d

Wow, I just learned that Waymo's next-gen robotaxi is built on top of a Chinese vehicle platform. Is Google's vision really to build a national autonomous rideshare across the US on top of a Chinese car? This seems insane. Is this common knowledge?

English

108

978

83.4K

Alex Avery@alavery2·5d

@david__booth @levie At @SierraPlatform, we have an Acceleration team

English

146

David Booth@david__booth·5d

ok help me out here team. i want to talk to people who are this role at their company..👇👇 @levie's tweet has the cleanest definition, but i'm still struggling what to call it. what do you put in the JD? - "internal FDE, whose job it is to wire up internal systems and get agents working with them effectively." - @tkkong says "leverage engineering" - @EricFriedman says "outcome engineers" - have also seen "agent operator", "director of agents" i like "ops engineer" ? maybe it doesn't need a title, it's just "head of operations" and/or "bizops but good at AI stuff" ? DM me pls i / founders tag your "person" who is thinking about this stuff, i wanna chat to you about something 👀

Aaron Levie@levie

Starting to hire and retrain for new agent engineering roles for *internal* functions to help get more powerful agents working well on critical business processes. I expect this type of role to be a very big deal over time at Box and other companies. It looks something like an internal FDE, whose job it is to wire up internal systems and get agents working with them effectively. The person will be extremely technical and capable of building secure, governed agents for internal workflows that connect to business systems (like Box, Salesforce, Workday, etc.), and codify workflows in skills. In some cases this person may understand the business process well enough to do it fully, but in most cases I expect them to work with the business directly in an embedded fashion. Ironically, that may introduce another new role on the business side that is more akin to agent product management for internal processes. The key is that you need technical + process people that can span multiple teams or functions in an organization. It’s not about brining automation to a job, but bringing automation to a process. This is going to be a very big trend in most companies going forward. Fun to watch the early innings of what this will look like.

English

18.5K

Alex Avery@alavery2·5d

@STLChrisH Good stuff. The gap between slick demo and production agent continues to exist. x.com/alavery2/statu…

Alex Avery@alavery2

Voice agents feel magical…until you benchmark them. In τ-Voice (read "Tau Voice"), voice agents perform significantly worse than text agents with the same tools. Here's what's going wrong...

English

409

Chris Hoffmann@STLChrisH·5d

Several companies doing this — with enterprise values of these individual startups already reaching hundreds of millions based on latest fundraising rounds. But voice agents alone are boring and will become commoditized. An end-to-end agentic AI powered system — from intake thru capacity management, dispatching, and marketing— now THAT is interesting and will push EVs of the category leaders well above $1B And beware of the many many garbage voice agents that will show you a slick product in a sales demo.

Codie Sanchez@Codie_Sanchez

One of the most overlooked AI opportunities in the next 24 months is voice agents for boring businesses. Every HVAC, plumbing, and pest control company in America is sending calls to voicemail after 5pm. One AI voice agent fixes it overnight. Most owners have never even heard of this technology, yet alone know how to implement it. The person who packages this up and sells it to 500 of them is going to be very rich...very quietly.

English

30.3K

Alex Avery@alavery2·5d

@MarcusSpillane @Codie_Sanchez Even the voice agent itself has room to improve. But agree that's just the beginning. x.com/alavery2/statu…

Alex Avery@alavery2

Voice agents feel magical…until you benchmark them. In τ-Voice (read "Tau Voice"), voice agents perform significantly worse than text agents with the same tools. Here's what's going wrong...

English

109

Marcus@MarcusSpillane·5d

@Codie_Sanchez The voice agent is the easy part. The hard part is integrating with whatever janky software the plumber actually runs. Last mile of integration is where these deals die.

English

7.1K

Codie Sanchez@Codie_Sanchez·5d

English

302

1.7K

348.7K

Alex Avery@alavery2·5d

@Codie_Sanchez Still a lot of value left to provide. x.com/alavery2/statu…

Alex Avery@alavery2

Voice agents feel magical…until you benchmark them. In τ-Voice (read "Tau Voice"), voice agents perform significantly worse than text agents with the same tools. Here's what's going wrong...

English

Alex Avery@alavery2·5d

And before that, @OfficialLoganK at @GeminiApp held the top spot x.com/OfficialLoganK…

Logan Kilpatrick@OfficialLoganK

Our latest Live model is # 1 on Tau Voice Bench! Excited to see this new frontier of voice models cross the chasm of usability in production.

English

Alex Avery@alavery2·5d

Just last week, @grok Voice Think Fast 1.0 set a new record with a +29 pp jump. x.com/xai/status/204…

xAI@xai

Introducing Grok Voice Think Fast 1.0 A state-of-the-art voice model built for complex, multi-step workflows with snappy responses and high accuracy. It takes the top spot on the Tau Voice Bench and handles real-world messiness like noise, accents, and interruptions better than any other model in the world. x.ai/news/grok-voic…

English

169

Alex Avery@alavery2·5d

Voice agents feel magical…until you benchmark them. In τ-Voice (read "Tau Voice"), voice agents perform significantly worse than text agents with the same tools. Here's what's going wrong...

Sierra@SierraPlatform

English

1.6K

Keşfet

@broblas @levelsio @Jason @airthings @NorbertDragan @norbertdragan @XFreeze @Keller