Max K

1.6K posts

Max K banner
Max K

Max K

@max_does_tech

building @visionagents_ai. dev advocate @getstream_io, ex: @IBM, @Vonage public speaker, OS maintainer. python, vision AI, APIs

Katılım Aralık 2021
259 Takip Edilen364 Takipçiler
Max K retweetledi
Vision Agents
Vision Agents@visionagents_ai·
Real-time avatars are now available in Vision Agents with @Anam__ai to bring custom, responsive experiences to the world! Here's one cleverly deciding not to trust @stefanjblos with all its company's money... write up in 🧵
English
2
4
10
290
Max K retweetledi
Vision Agents
Vision Agents@visionagents_ai·
Let’s build a vision + voice agent with the new Gemini 3.1 Flash Live model 🔥 Following this tutorial, you'll build a multimodal agent that helps you sell your used items! Check out the full tutorial on the @googledevs YouTube channel! youtube.com/watch?v=8lA6bF…
YouTube video
YouTube
English
2
2
12
1.9K
Max K
Max K@max_does_tech·
Now this one I'm proud of...
Vision Agents@visionagents_ai

Using @roboflow's Neural Architecture Search to make a video moderation bot with under 1.8ms average inference time! In this demo we're able to moderate video coming in on a video call so quickly that you almost don't see the offending content before it's censored 🙌

English
0
0
2
43
Max K retweetledi
Vision Agents
Vision Agents@visionagents_ai·
Changelog update time! We've added: - @huggingface object detection support - @AssemblyAI support, including diarization Upgraded: - K8s deployment example - Real time transcript buffering and handling - SFU error handling Imminent: - Non-Stream video options
English
1
2
7
342
Max K retweetledi
Vision Agents
Vision Agents@visionagents_ai·
here's @XiaomiMiMo v2-omni being very very polite to me
English
1
1
4
167
Max K
Max K@max_does_tech·
It was really satisfying to see my vibe coded frontend actually work
Vision Agents@visionagents_ai

Here's @NVIDIAAIDev Nemotron-3-Super-49B used in a real-time Vision Agents application as a fraud assistant! You can see every action it takes, and it's all happening in real-time 😮 Using the Nemotron model hosted on @baseten for reliability.

English
0
0
2
71
Max K retweetledi
Thariq
Thariq@trq212·
We just added /btw to Claude Code! Use it to have side chain conversations while Claude is working.
English
1.2K
1.6K
25.9K
2.8M
Max K
Max K@max_does_tech·
actually pretty impressive how responsive this one is
Vision Agents@visionagents_ai

This is @GoogleDeepMind Gemini 3.1 Flash-Lite responding in real time in a Vision Agents app. It's able to handle a lot of different video understanding questions much more quickly than the previous gen... and this is on release day, when everyone's hitting the API! 😆

English
0
0
0
52
Max K
Max K@max_does_tech·
Not bad for a realtime VLM that's only 4.5GB!
Vision Agents@visionagents_ai

Running the new @Alibaba_Qwen 3.5 2B parameter model LOCALLY here in a Vision Agents app. This is all in realtime and it can understand my handwriting and respond to questions... this wouldn't have been possible even MONTHS ago!

English
0
0
1
47