Zach Koch

3.4K posts

Zach Koch banner
Zach Koch

Zach Koch

@zachk

cofounder & ceo @ultravox_dot_ai // making AIs communicate like humans // jack of some trades

Seattle Katılım Ekim 2006
550 Takip Edilen977 Takipçiler
Sabitlenmiş Tweet
Zach Koch
Zach Koch@zachk·
Incredibly excited for this one! The team has worked incredibly hard over the last couple of months to not just bridge the gap with OpenAI, but actually exceed when it comes to speech understanding. Small teams can do amazing shit.
Ultravox AI@ultravox_dot_ai

Today we're releasing Ultravox v0.5, the next iteration of our open-weight speech language model With this release, we've closed the gap with proprietary models. Ultravox now outperforms GPT-4o Realtime & Gemini 1.5 Flash on key benchmarks for speech understanding 🧵

English
3
2
21
1.8K
Zach Koch retweetledi
Ultravox AI
Ultravox AI@ultravox_dot_ai·
We're hosting dinner in SF for builders and leaders working in Voice AI. If you're interested in joining us on April 9, please request a seat here: go.ultravox.ai/voice-ai-leade…
English
0
1
5
245
Zach Koch retweetledi
Ultravox AI
Ultravox AI@ultravox_dot_ai·
New to voice AI? We created a list of key terms and concepts you're likely to encounter. Notice something missing? Let us know! ultravox.ai/voice-ai/voice…
English
0
1
3
171
Gustavo Garcia
Gustavo Garcia@anarchyco·
@ultravox_dot_ai @zachk Nice post. I think what we need there is the ability to run multiple LLMs in parallel. I usually call it the "conversational LLM" that can be a S2S model like Ultravox and the "background LLM" that can be running tasks in parallel and interacting with the other LLM.
English
1
0
1
33
Ultravox AI
Ultravox AI@ultravox_dot_ai·
Agentic use cases are quickly beginning to dominate the world of text-based LLMs, so why are voice AI systems stuck in a world of deterministic flows and node builders? @zachk on what we need to build truly *agentic* voice experiences: ultravox.ai/blog/what-we-n…
English
1
1
6
614
Zach Koch retweetledi
Ultravox AI
Ultravox AI@ultravox_dot_ai·
3 non-negotiable properties of agentic voice systems: - Fast - Fluent - Fluid Smarter models alone aren't enough to bridge the gap - voice agents need a harness built for real-time interactions. ultravox.ai/blog/what-we-n…
English
0
1
2
204
Zach Koch retweetledi
Ultravox AI
Ultravox AI@ultravox_dot_ai·
Speech-to-speech models are poised to displace the voice AI component stack — and benchmarks like the new AIEWF eval from @kwindla prove it. ultravox.ai/blog/why-speec…
English
1
3
8
485
Zach Koch
Zach Koch@zachk·
@elevenlabs Do you have a technical report sharing how you did the evaluation?
English
0
0
0
182
ElevenLabs
ElevenLabs@ElevenLabs·
Today we’re introducing Scribe v2: the most accurate transcription model ever released. While Scribe v2 Realtime is optimized for ultra low latency and agents use cases, Scribe v2 is built for batch transcription, subtitling, and captioning at scale.
English
94
264
2K
554.9K
Zach Koch
Zach Koch@zachk·
I think Dylan is right over the next 2-3 years but wrong over the next 5-10
Dylan Field@zoink

Thoughts: 1. In the future, the probability something is generated entirely by AI will be inversely proportional to its intended lifespan. 2. For conceptually simple artifacts that are intended to have short lifespans, humans will still be involved just at a different level of abstraction. For example, I'm super excited about @Weavy_ai (Figma Weave) because it shows what's possible when you treat AI generation like clay to shape rather than the final output. Workflow building is a new skill to explore and learn. 3. If you intend for an artifact to have a long lifespan (ex: software, a novel, a movie), then AI might still aid you in your creative process. But you will bring great intention to the work. You will think through many different approaches. You will care about the smallest of details. You will lean into the craft. Because if you don't, it won't be good enough to last. It won't be noticed. It won't be loved. It won't matter. 4. Focusing just on software now... people don't like it when software changes. Everyone who has shipped a redesign knows this! So you might be generating new content within a piece of software frequently but of course you wouldn't redesign the fundamental UX of the software all the time. Users would hate it. As a grounding metaphor, consider a house. Yes, you might change the photos and papers and magnets stuck to your fridge a few times a week. Once in a while, you reorganize stuff or move furniture around. After living in the house for a while, you maybe notice issues around how you use the space and — with great intention — embark on a remodel. Some parts of the house, like the fridge, change a lot. But the overall structure of the house changes less. When asking what will be generated by AI, don't confuse the whole for the parts, the long lasting for the ephemeral. 5. It's intellectually interesting to think about whether a brand might want to adapt their software on a user by user basis. (Certainly individuals will be able to make more software for themselves if they are so inclined. For example, see Figma Make.) That said, my strong gut right now is that we will not end up in a world where brands customize software on a per user basis. People learn how to use software from other humans. Snapchat is a great example. For a new user, Snapchat is kind of confusing. You can see this as a design issue or an advantage... I argue it's an advantage. By leaning into custom patterns and a learnable (but arguably non-intuitive) interface, the resulting network is a more intentional space. If you're young, you'll learn how to use Snapchat by watching your friends use Snapchat. And if you're older, well, you might not be the intended demographic. 6. To wrap up... we are in a world where the amount of software is growing at an exponential rate. If you want to win, design is the differentiator. Invest in design, craft, storytelling and a bold point of view. Use AI as a tool, but don't expect it to build the next big thing for you on its own. Don't expect it to make something that no one has ever seen or imagined before. That's your job.

English
0
0
1
151
Adriana Porter Felt
Adriana Porter Felt@__apf__·
I don't know how to get other people as excited about watching rainfall totals as I am
English
2
0
8
1.3K
Adriana Porter Felt
Adriana Porter Felt@__apf__·
(sports commentator voice) look at the 2026 water year coming out of the tunnel like a house on fire, already clocking 13.9 inches! the early 25-26 season stands at 173% and the historical averages are wondering what just hit 'em
Adriana Porter Felt tweet mediaAdriana Porter Felt tweet media
English
1
0
6
3.3K
Josh Larson
Josh Larson@jplhomer·
Big changes afoot
Josh Larson tweet media
Français
2
0
8
586
Zach Koch
Zach Koch@zachk·
@JustJake It's nice of them to help you with the comparison math
English
0
0
1
199
Jake
Jake@JustJake·
Horrifying achievement tbh What has my life become?
Jake tweet media
English
3
0
75
5.3K
Zach Koch
Zach Koch@zachk·
@oanaolt I call this phase "post-FOMO", and it is a step on the path to enlightenment
English
0
0
1
25
Ryan Florence
Ryan Florence@ryanflorence·
@FlorinPop17 - first big thing for me was options at a startup that went public ($250k) - bought a duplex with that and rented it out, rent + growth (+200k) - primary residence growth (+300k buying/selling 5 homes) - high income from self-employment and buying VOO
English
7
0
130
11.4K
Zach Koch
Zach Koch@zachk·
@nbaschez Number 2 is so true, especially as a founder. But kids are the best thing on this earth and I wouldn't give them up for anything!
English
0
0
1
72
Nathan Baschez
Nathan Baschez@nbaschez·
Two things I’ve observed since becoming a dad: 1. Getting help is massive, and it’s important to find creative ways within your budget to make life easier 2. Part of my brain will always whisper that I’m not doing enough at work anymore, and no amount of help will fix it
Nichole Wischoff@NWischoff

For those of you with demanding careers and little kids, please share the amount of care you have (babies, cleaning, food). Have a 16mo and a newborn coming in January and overwhelmed by it all. Currently have coverage 7-4pm but it isn’t even close to manageable.

English
6
0
39
5.6K
Zach Koch
Zach Koch@zachk·
@zachtratar I dig it! Highly usable. Different than the same looking stuff as everyone else.
English
1
0
2
55
Zach Koch
Zach Koch@zachk·
Beware Voice AI claims that are <1s (they are probably lying). But prove me wrong with demo links!
English
0
0
1
100
Zach Koch
Zach Koch@zachk·
I should note: this is true, end-to-end latency in production for customers (i.e., from user finishes speaking to speech output from Ultravox)
English
1
0
2
106
Zach Koch
Zach Koch@zachk·
Ultravox inference just got about 20% faster overnight. Average response time across thousands of calls (find me a faster solution!):
Zach Koch tweet media
English
1
1
5
677