Max K

1.6K posts

Max K banner
Max K

Max K

@max_does_tech

building @visionagents_ai. dev advocate @getstream_io, ex: @IBM, @Vonage public speaker, OS maintainer. python, vision AI, APIs

Bergabung Aralık 2021
258 Mengikuti371 Pengikut
Max K me-retweet
Vision Agents
Vision Agents@visionagents_ai·
Changelog update time! We've added: - @huggingface object detection support - @AssemblyAI support, including diarization Upgraded: - K8s deployment example - Real time transcript buffering and handling - SFU error handling Imminent: - Non-Stream video options
English
1
2
6
234
Max K me-retweet
Vision Agents
Vision Agents@visionagents_ai·
here's @XiaomiMiMo v2-omni being very very polite to me
English
1
1
4
100
Max K
Max K@max_does_tech·
It was really satisfying to see my vibe coded frontend actually work
Vision Agents@visionagents_ai

Here's @NVIDIAAIDev Nemotron-3-Super-49B used in a real-time Vision Agents application as a fraud assistant! You can see every action it takes, and it's all happening in real-time 😮 Using the Nemotron model hosted on @baseten for reliability.

English
0
0
2
65
Max K me-retweet
Thariq
Thariq@trq212·
We just added /btw to Claude Code! Use it to have side chain conversations while Claude is working.
English
1.2K
1.6K
26K
2.7M
Max K
Max K@max_does_tech·
actually pretty impressive how responsive this one is
Vision Agents@visionagents_ai

This is @GoogleDeepMind Gemini 3.1 Flash-Lite responding in real time in a Vision Agents app. It's able to handle a lot of different video understanding questions much more quickly than the previous gen... and this is on release day, when everyone's hitting the API! 😆

English
0
0
0
47
Max K
Max K@max_does_tech·
Not bad for a realtime VLM that's only 4.5GB!
Vision Agents@visionagents_ai

Running the new @Alibaba_Qwen 3.5 2B parameter model LOCALLY here in a Vision Agents app. This is all in realtime and it can understand my handwriting and respond to questions... this wouldn't have been possible even MONTHS ago!

English
0
0
1
42
Max K
Max K@max_does_tech·
i honestly didn't know if it would be able to read my handwriting at all
Vision Agents@visionagents_ai

The new @Alibaba_Qwen 3.5 Flash's vision understanding capabilities are really impressive, it can even read this handwriting! Available immediately for your vision AI app with Vision Agents 🇳🇱🙌

English
0
0
1
34
Max K
Max K@max_does_tech·
i'm not THAT bald
Vision Agents@visionagents_ai

@Alibaba_Qwen this is really accurate! our case, painfully accurate 😅 here's Qwen 3.5 running in an app built with our Vision Agents SDK that's designed to roast us as hard as possible...

English
0
0
0
47
Max K me-retweet
Vision Agents
Vision Agents@visionagents_ai·
@claudeai Sonnet 4.6 is so creative and responsive! Here it is used in an app built with our vision agents SDK. This is one take, all in real-time! 😮 it's super responsive and its level of comprehension is really high.
English
1
3
15
5K