
Farhan H
3.7K posts

Farhan H
@FarhanSoftware
Sharing ideas, productivity tips, and thoughts on developments in AI & Tech. ex-CTO @ wAI Industries












Local native-audio voice agent running on an RTX 5090. - @NVIDIAAI Nemotron 3 Nano - audio|text ➡️ text - patched vLLM to implement complete turn prefix caching - ~125ms TTFT - @kyutai_labs Pocket TTS - text ➡️ audio - Nemotron Speech ASR - streaming audio ➡️ text - @pipecat_ai Smart Turn end-of-utterance - ~500ms total voice-to-voice latency - runs bash via tool calls If you're interested in voice and realtime multi-modal AI, come join us at the SF Voice AI Meetup on Thursday May 7th. Talk to engineers from NVIDIA, Kyutai, and Pipecat about what you're building! Links to meetup registration, code, and models on @huggingface below ...







Reply here if you want to be part of Canadian builder group chat in X 🇨🇦


Maxell is bringing back a classic, w/ their brand new Cassette Player 🥳🎉 -Wireless AND Wired 🙌 -Rechargeable ⚡️ -11 Hours of Battery 🤯 * Step back into the 80’s with Maxell *





Electromagnetic mass drivers on the Moon

Announcing TERAFAB: the next step towards becoming a galactic civilization twitter.com/i/broadcasts/1…







