Farhan H

3.7K posts

Farhan H

Farhan H

@FarhanSoftware

Sharing ideas, productivity tips, and thoughts on developments in AI & Tech. ex-CTO @ wAI Industries

Montreal, QC Katılım Eylül 2009
216 Takip Edilen794 Takipçiler
Sabitlenmiş Tweet
Farhan H
Farhan H@FarhanSoftware·
2026 Master list Here are some things happening this year in AI, gaming, movies & tech. AI: - Evolution of AI agents into self learning and self improvement agents. First example was @karpathy 's autoresearch and to some extent @openclaw (thanks to incredible work by Peter) with the ability to update its own files (.md) - Computers (as personal computers) have a new (alternative) definition now, thanks to Perplexity & Cursor - building on openclaw's original idea - Voice AI improvements - lower latencies and making open source models ready for some major business usecases. - Agentic applications make it to the mainstream (e.g, Google Gemini app being able to order your Uber or your groceries) for the first time. - Sneak peek of Apple Intelligence powered by Gemini at wwdc 2026 (@apple) Movies: - Nolan's next movie - Spielberg's next movie - Next Toy Story Games: - GTA 6 - Forza Horizon Japan - First major racing game set in Japan on modern day hardware - Fable - Highly immersive RPG with a lot of fun elements - Wolverine - The newest marvel classic character finally has a game (before Blade) Tech: - Next Gen Console Updates - iPhone's first ever foldable expected with no crease - first major innovation in foldables in a while since redesigning of hinges - Redesigned MacBooks expected with OLED Touchscreens - 2nd Gen of Ray-Ban Displays & further iterations on AR/XR glasses - Steam Frame, Machine & Controller - their first slate of hardware releases since Steam Deck's popularity - Phasing out of consumer PC builds as we know it due to rising prices and shortages in GPU, SSD & RAM. First year where push for cloud begins. There is a lot more I have missed that's happening or expected to happen this year and will keep updating it. Please note that these are my personal opinions as to what's expected.
English
0
0
1
249
Ara Ghougassian
Ara Ghougassian@araghougassian·
if i said this is canada, would you believe me?
Ara Ghougassian tweet media
English
57
12
353
30.3K
Julia Neagu
Julia Neagu@julianeagu·
I'm building a new team at @databricks AI Research and we're hiring. We're focused on one of the hardest open problems in AI right now: how do you measure and continuously improve agents that operate on enterprise data at scale. We're looking for founding engineers to build the flywheel that turns evaluation results directly into better agents — from development and training all the way to production. If you want to work on problems that actually matter at the frontier of AI research, I'd love to talk. Link in comments 👇
English
74
52
1.2K
134.4K
Farhan H
Farhan H@FarhanSoftware·
@TGGonYT @GhillieYT I think it will go live early morning and rockstar will schedule a post/video. Trailer 2 went live at 9;AM
English
0
0
0
345
TGG
TGG@TGGonYT·
@GhillieYT Hopefully, but a trailer first thing Monday morning seems crazy to me
English
4
4
191
15.4K
TGG
TGG@TGGonYT·
Here’s how I expect Monday to unfold: - Rockstar will post nothing at their usual post time - People will crash out! - Pre-orders will go live later in the day on Monday - Trailer 3 will drop on Tuesday and will end with “Pre-Order Now”
TGG tweet media
English
366
262
7.3K
260.2K
jai
jai@jai_mansukhanii·
We're hiring at @generalmagic_ai Demand is outpacing what we can ship, and we're at the point where we need to add talent to scale. Insurance is a $7 trillion industry still running on software from another era. We're solving very complex problems in a space that hasn't seen real disruption in decades. We're building toward a world where insurance works better for everyone. Customers get faster, more personal service, brokers and carriers run more profitably, and the whole thing operates at a scale the industry has never seen before. The team is already stacked. An ML researcher out of Columbia. An engineering lead who built early software for the Ferrari Purosangue. Operators who've been at the forefront of insurance for over twenty years. Now we're looking for the next group of people. We're backed by @speedrun, @radicalvcfund, and top tier angels including @aidangomez, Kevin Wang (@Braze), @iamlarryjames and more. 10k referral bonus for any full-time hire.
jai tweet media
English
38
14
380
72.2K
Farhan H
Farhan H@FarhanSoftware·
The true engineers with actual engineering knowledge are still not relying on AI for 100% of their work. The rest are roleplaying as engineers with AI.
English
0
0
0
20
Farhan H
Farhan H@FarhanSoftware·
A reminder I keep coming back to: You don't need to do everything perfect 100% of the time in a year to make progress. Sometimes, we obsess over the smallest of details.
English
0
0
0
9
Farhan H
Farhan H@FarhanSoftware·
@kwindla @ekzhang1 The main bottleneck I'm trying to keep in mind is on-premise deployment. Thank you, this year will be exciting :)
English
1
0
0
18
kwindla
kwindla@kwindla·
@FarhanSoftware @ekzhang1 There are open options. They generally don’t perform quite as well, and using them won’t save you money for a commercial project until you are pretty large scale. But open model progress is exciting! x.com/kwindla/status…
kwindla@kwindla

Local native-audio voice agent running on an RTX 5090. - @NVIDIAAI Nemotron 3 Nano - audio|text ➡️ text - patched vLLM to implement complete turn prefix caching - ~125ms TTFT - @kyutai_labs Pocket TTS - text ➡️ audio - Nemotron Speech ASR - streaming audio ➡️ text - @pipecat_ai Smart Turn end-of-utterance - ~500ms total voice-to-voice latency - runs bash via tool calls If you're interested in voice and realtime multi-modal AI, come join us at the SF Voice AI Meetup on Thursday May 7th. Talk to engineers from NVIDIA, Kyutai, and Pipecat about what you're building! Links to meetup registration, code, and models on @huggingface below ...

English
1
0
1
72
Eric Zhang
Eric Zhang@ekzhang1·
This blog is quite good. I’ve always been annoyed by webrtc at a foundational level and it makes a good argument as to why — webrtc conflates network things (p2p, tunneling, hole punching, weird port stuff) with buffering (jitter buf, frame drops) and audio signal processing (echos, etc) across a very complex stack Like it solves a lot of problems, but I’ve too often heard even non-technical people ask “do you have webrtc” because it sounds like the only real-time thing, not realizing it’s kind of a mess anyway no one ever got fired for picking webrtc, and it’s incredibly useful, but hey client-server stream protocols are pretty good too! ain’t nothing wrong with audio packets + buffering over quic or websocket imo, even if you have to do some DSP :’) moq.dev/blog/webrtc-is…
English
4
7
100
8.5K
Farhan H
Farhan H@FarhanSoftware·
@kwindla @ekzhang1 Deepgram, Cartesian, etc. I think you were referring to 4.3.1 and 4.4, if I understood correctly?
English
2
0
0
35
Farhan H
Farhan H@FarhanSoftware·
Thanks, I looked into it but they are closed source options, please correct me if I'm wrong. I want to build a production-grade customer service voice agent with as little latency as possible for open-source (with support for other languages). I was previously exploring Maya and researching other STS models that could be fine-tuned or tweaked with a custom pipeline if necessary. Since I'm early in my research cycle, I thought I'd ask you first.
English
1
0
0
43
kwindla
kwindla@kwindla·
@FarhanSoftware @ekzhang1 Latency is a solved problem. Sub-1500ms voice to voice is the norm now for a Pipecat STT -> LLM -> TTS pipeline. See that same guide I just linked to.
English
1
0
0
51
Farhan H
Farhan H@FarhanSoftware·
@kwindla @ekzhang1 What about on the model side of things? Currently, no STS models are capable enough for production usecases (or none that I know of) but maybe a custom STT and TTS pipeline that has no latency in voice responses.
English
1
0
0
35
kwindla
kwindla@kwindla·
Definitely use WebRTC for edge-to-cloud realtime audio. It is the best choice by far. For production scale, you should run on a commercial WebRTC provider with a global footprint, edge/mesh optimized routing, observability tooling, etc. For hobby projects, either use the SimpleWebRTC open source transport (direct peer-to-agent) or experiment with running your own WebRTC SFU server, depending on your project goals. More here: #network-transport" target="_blank" rel="nofollow noopener">voiceaiandvoiceagents.com/#network-trans…
English
1
0
0
71
Farhan H
Farhan H@FarhanSoftware·
@kwindla @ekzhang1 What would be the go-to strategy in your own experience to reduce latency for voice models? Preferably open-source
English
1
0
0
49
kwindla
kwindla@kwindla·
MoQ is not low latency. It was designed by streaming media people. Maybe that will get fixed. I hope so. There are definitely some things we’d do differently if we designed a new standard for low latency today. But there hasn’t been much positive movement. If you want to build your own low-latency thing on top of WebTransport, you will end up implementing a lot of WebRTC! Not all of it, but probably a lot more than you think unless you’ve done this kind of thing before. Buffering, pacing, neteq, retransmission logic, stats and observability, reconnection handling, codec-specific layers, … WebRTC isn’t perfect. But it’s complicated for a lot of real, pragmatic reasons. (The p2p stuff is only a small part of that.) This is roughly the “my database is too slow, I’m going to write my own database that doesn’t have all that fsync and transactions overhead stuff” mistake. There are also a bunch of technical mistakes in that post. NACKs are fully supported and easy to enable in browser-based WebRTC apps, for example.
English
3
0
18
1.2K
Farhan H
Farhan H@FarhanSoftware·
@tokifyi I'd love to be a part of this.
English
0
0
0
10
Farhan H
Farhan H@FarhanSoftware·
@gregisenberg I think history will repeat itself again just like the markets.
English
0
0
0
35
GREG ISENBERG
GREG ISENBERG@gregisenberg·
in a world of INFINITE content, infinite choice, infinite scroll, people are starting to want things that END - finite formats - physical products - no ai, no internet - clear boundaries there’s a real shift here and it’s going to create massive companies here we go
MaxellCorp@MaxellCorp

Maxell is bringing back a classic, w/ their brand new Cassette Player 🥳🎉 -Wireless AND Wired 🙌 -Rechargeable ⚡️ -11 Hours of Battery 🤯 * Step back into the 80’s with Maxell *

English
67
57
904
106.6K
Farhan H
Farhan H@FarhanSoftware·
@lexfridman @nvidia I think it's time for a new Elon podcast for his vision on TeraFab
English
0
0
1
29
Lex Fridman
Lex Fridman@lexfridman·
It was an honor to hang out with Jensen Huang, CEO of @nvidia, and do a long-form podcast with him. Really fun & fascinating technical deep-dive conversation on & off the mic. One of the most brilliant & thoughtful human beings I've ever met. NVIDIA is the most valuable company in the world by market cap and is the engine powering the AI revolution. Podcast probably out tomorrow (Monday) unless I get stuck in too many interesting conversations while running around in SF ;-) PS: I haven't checked my messages in days. Sorry for slow replies 🙏 Trying to stay deeply focused at in overwhelmingly intense time & barely hanging on. Love you all ❤️
Lex Fridman tweet media
English
669
706
11.9K
879.5K
Farhan H
Farhan H@FarhanSoftware·
@elonmusk Incredible presentation. Thank you for always pushing the boundaries forward and thinking steps ahead. I am excited about seeing Tesla create its own chips and supply chain.
English
1
0
9
815