matt brown

4K posts

matt brown banner
matt brown

matt brown

@mnbbrown

engineering @ Elyos. ex @GoCardless, @bcgdv. pilot. redhead.

London, England شامل ہوئے Mart 2010
2.6K فالونگ1.4K فالوورز
andy
andy@b1rdmania·
London tech and ai scene heatmap. Adding more now. Will sync to office spaces next week. londonmaxxxing.com
English
95
65
739
172K
matt brown
matt brown@mnbbrown·
This is the classic min/max tweet that adds nothing to the conversation about voice agents. The real headline: NVIDIA released an iteration on speech to speech agents. We're still a long way from these being useful in production. The voices suck, tool calling with be bad, etc
Aakash Gupta@aakashgupta

The part most people will skip: NVIDIA just made every voice AI API a commodity. OpenAI charges $0.06/min input and $0.24/min output for Realtime API. Gemini Live bills 25 tokens/second of audio. Every startup building voice agents is hemorrhaging cash on per-minute API fees to run what is fundamentally a pipeline problem: ASR → LLM → TTS, three models stitched together with latency at every seam. PersonaPlex replaces that entire pipeline with one 7B model. Runs on a single A100. Open weights, MIT license, commercial use permitted. Response latency: 0.170 seconds for turn-taking, 0.240 seconds for interruptions. It scores higher on dialog naturalness than Gemini (2.95 vs 2.80 MOS) and handles interruptions better than every commercial system they benchmarked. This tells you everything about NVIDIA’s playbook. They don’t need to charge for the model. They need you to buy the GPU. Every company that self-hosts PersonaPlex instead of paying OpenAI per-minute is another A100/H100 sale. Every voice agent startup that drops their API dependency is another enterprise GPU contract. NVIDIA open-sourced the fishing rod because they sell the lake. Built on the Moshi architecture from Kyutai, fine-tuned with under 5,000 hours of data. The voice AI margin is migrating from the application layer to the hardware layer. And NVIDIA is the only company that profits no matter which model wins. 330,000 downloads in the first month. That’s infrastructure capture disguised as generosity.

English
0
0
0
39
matt brown
matt brown@mnbbrown·
I wish claude code had a way to have tools emit progress bars..
English
1
0
0
45
matt brown
matt brown@mnbbrown·
Real world benchmarking … @DeepgramAI @covaldev @cekuraAi @livekit
Panos Stravopodis@pstrav

At @Elyos_AI We benchmarked 13 STT providers on 100 real customer calls from the trades businesses. Not synthetic lab data. Real calls with: - Background noise & multiple speakers - UK postcodes & addresses - Regional accents (England, Scotland, Ireland) - Short confirmations to long explanations Top performers: 🥇 @DeepgramAI Flux - 15.9% WER 🥈 @soniox_ai - 16.9% WER 🥉 @Speechmatics - 17.7% WER @OpenAI Whisper? 39.8% WER - wouldn't recommend for production voice AI. What's your experience with STT models? Are we there yet?

English
0
0
0
67
matt brown
matt brown@mnbbrown·
Yo @sama how do we get OpenAI support to respond to our support requests :( we’re being left on read.
English
1
0
0
32
matt brown
matt brown@mnbbrown·
@FlintCasey 2. Scaling to support complex support flows really hurt. We ended up with 1000s of custom fields, a couple of hundred custom forms and maintaining them was painful.
English
1
0
0
33
Casey Flint
Casey Flint@FlintCasey·
What support ticket tooling do people prefer to use these days? Zendesk, Freshdesk, Intercom etc?
English
5
0
2
732
matt brown
matt brown@mnbbrown·
@alanchanguk Or we can just stop developing new aircraft but that sounds boring.
English
1
0
0
53
matt brown
matt brown@mnbbrown·
@alanchanguk We can either develop new Jet A1 aircraft, or new electric aircraft. Both about the same development cost. Electric significantly cheaper to operate and maintain. Better ROI. 100% cost driven.
English
2
0
0
71
Alan Chang
Alan Chang@alanchanguk·
I don’t understand why there are so many start ups working on electric or hydrogen aviation, it’s a basic physics limitation, energy density of battery ~50X lower, ~4X lower for hydrogen
English
2
0
9
1.2K
matt brown
matt brown@mnbbrown·
@alanchanguk I remember reading that a new power train requires $1bil USD to get certified, same again for an airframe.
English
1
0
0
78
matt brown
matt brown@mnbbrown·
@alanchanguk Exactly - long haul will require an order of magnitude improvement. No startup (that I know of at least) is tackling that.. and press releases from airbus, etc are nothing but vapour. The real work is being done in short haul but the upfront capital req is massive.
English
1
0
0
94
matt brown
matt brown@mnbbrown·
@dsp_ @samuelcolvin why do the python SDKs for MCP and PydanticAI use docstrings + method args instead of pydantic models/dataclasses for defining tool, resource, etc arguments? Was there a strategy there or just the way the cookie crumbled? CC @ghaidar0
English
0
0
1
55
matt brown ری ٹویٹ کیا
Tom Blomfield
Tom Blomfield@t_blom·
Getting real IBM Watson vibes from all these Salesforce AgentForce ads
English
15
11
385
28.4K
matt brown
matt brown@mnbbrown·
Why is iOS speech to text so much better at numbers, addresses and postcodes than @DeepgramAI, @OpenAI whisper, etc? It nails them - the rest are mediocre at best.
English
0
0
0
126