Stephan (@stephangazarov) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

Stephan@stephangazarov·1 May

Before, coding agents had no way to actually verify if the voice agents they ship work. I built a CLI that lets your agent place real calls to voice agents on Vapi, LiveKit, and other platforms so they can autonomously fix broken logic with better context. > Runs on Haiku 4.5 and speaks via Deepgram (configurable persona + 7 languages) > Forwards every event to stdout - STT transcript, tool calls, transfers, costs, provider warnings > Measures call from inside - mouth-to-ear latency at p50-p99, per-turn TTS/STT/LLM, audio quality (clipping, SNR, drops) All open source. npx vent-hq@latest init - no setup needed. Your agent auto-authenticates and generates an access token.

English

2

1

6

773

Stephan@stephangazarov·3d

@xandercogan Send 2-3 years San Francisco and forget

English

1

0

1

23

Xander Cogan@xandercogan·4d

“Everyone had a plan until they get punched in the face.” Throwback to my fight a few years ago. Sometimes, it’s as easy as just not giving up.

English

1

0

3

56

Stephan@stephangazarov·5d

@ArtemKozlovets 50/50, a lot of bugs with audio buffers in turn taking specifically - each platform has a different way of connecting and managing audio flows

English

0

12

Artem K@ArtemKozlovets·5d

@stephangazarov was it hard to build a CLI tool?

English

1

0

14

Stephan@stephangazarov·1 May

Before, coding agents had no way to actually verify if the voice agents they ship work. I built a CLI that lets your agent place real calls to voice agents on Vapi, LiveKit, and other platforms so they can autonomously fix broken logic with better context. > Runs on Haiku 4.5 and speaks via Deepgram (configurable persona + 7 languages) > Forwards every event to stdout - STT transcript, tool calls, transfers, costs, provider warnings > Measures call from inside - mouth-to-ear latency at p50-p99, per-turn TTS/STT/LLM, audio quality (clipping, SNR, drops) All open source. npx vent-hq@latest init - no setup needed. Your agent auto-authenticates and generates an access token.

English

2

1

6

773

Stephan@stephangazarov·5d

@ArtemKozlovets Yes, of course. With STT and TTS

English

1

0

1

22

Artem K@ArtemKozlovets·6d

@stephangazarov looks interesting. does it actually runs an audio during the call?

English

1

0

1

23

Stephan@stephangazarov·13 May

@xandercogan Best marketing

English

0

1

37

Xander Cogan@xandercogan·12 May

These people can’t be serious

English

1

0

2

56

Stephan@stephangazarov·9 May

@charlieholtz Might be worth building a browser inside Conductor so it’d be easier to work on web projects (just like Cursor has it)

English

0

5

330

Charlie Holtz@charlieholtz·9 May

I like this idea. It got me wondering, since we have an option to preview Markdown, why not also preview HTML! Will be in next release 🫡

Thariq@trq212

HTML is the new markdown. I've stopped writing markdown files for almost everything and switched to using Claude Code to generate HTML for me. This is why.

English

28

5

372

42.2K

Stephan@stephangazarov·8 May

CI for voice AI is essentially done. Parallel LiveKit calls now work - as for every other platform adapter. Iterate 10x faster and load test from your editor.

Stephan@stephangazarov

Before, coding agents had no way to actually verify if the voice agents they ship work. I built a CLI that lets your agent place real calls to voice agents on Vapi, LiveKit, and other platforms so they can autonomously fix broken logic with better context. > Runs on Haiku 4.5 and speaks via Deepgram (configurable persona + 7 languages) > Forwards every event to stdout - STT transcript, tool calls, transfers, costs, provider warnings > Measures call from inside - mouth-to-ear latency at p50-p99, per-turn TTS/STT/LLM, audio quality (clipping, SNR, drops) All open source. npx vent-hq @latest init - no setup needed. Your agent auto-authenticates and generates an access token.

English

0

1

3

295

Stephan@stephangazarov·4 May

github.com/livekit/agents…

ZXX

0

1

122

Stephan@stephangazarov·4 May

Fix I shipped for a 6-month-old onnxruntime crash in @livekit agents silero VAD plugin just got merged. Feels nice.

English

2

0

4

169

Stephan@stephangazarov·1 May

Your coding agent places a call, reads back the full trace, patches the agent, and calls again. It keeps looping until the voice agent passes (default behavior, can opt out). All runs are also auto-persisted locally, so your agent can diff them whenever it needs to check for regressions.

English

0

3

155

Stephan@stephangazarov·1 May

A bit of engineering context. The caller has to behave like a real user, or the coding agent ends up fixing bugs that don’t matter. Per turn, Haiku picks one of four decision modes: continue, wait, close, or hang up. On the listening side, Vent’s own VAD detects when the agent stops speaking. It’s a vendored TEN VAD compiled to WebAssembly, in-process, no external service. Two filters in front: it ignores quiet background noise, and it waits for two consecutive voice frames before deciding it’s speech (a single noise blip can’t fool it). The silence threshold isn’t fixed. It adapts per turn (200–3000ms) based on how the agent responds in order to not cut the response mid sentence or inflate latency with silence.

English

1

0

2

217

Stephan

Keşfet