Sam

49 posts

Sam banner
Sam

Sam

@shar_dev7

Developer | Open Source Contributor | ML Engineer Building, learning, and shipping ideas 🚀

New Delhi Katılım Mart 2026
21 Takip Edilen3 Takipçiler
Sam
Sam@shar_dev7·
Our paper "Do Thought Streams Matter?" got featured in a YouTube video 👀 Quick watch, simple breakdown, and some cool insights on Gemini video reasoning. Watch here: youtube.com/watch?v=8HN19n…
YouTube video
YouTube
Sam tweet media
English
1
1
5
53
Sam
Sam@shar_dev7·
Can video AI really reason about what it sees, or does it just sound confident? We explored this in our new paper on Gemini vision-language models for video scene understanding. Some interesting results came out of it 👀
English
2
0
6
62
Sam
Sam@shar_dev7·
This matters because a lot of people assume more reasoning = better answers. But for video models, the story may be more nuanced. Our paper looks at this closely and helps make sense of how reasoning works in real video understanding settings.
English
1
0
5
30
Sam
Sam@shar_dev7·
@ashu_trv Interesting perspective on reasoning efficiency in VLMs.
English
1
0
4
30
Ashu
Ashu@ashu_trv·
We just released a new benchmark looking inside the "black box" of Gemini 2.5 reasoning for video understanding. Does "thinking more" always lead to better results? The answer is more nuanced than you’d think 💭
English
5
5
17
459
Sam retweetledi
Min Choi
Min Choi@minchoi·
It's happening. New leak from Anthropic appears they are building a full-stack app builder inside Claude. They are coming for everything.
Min Choi tweet mediaMin Choi tweet mediaMin Choi tweet mediaMin Choi tweet media
English
142
113
1.1K
114.9K
Sam
Sam@shar_dev7·
No logs No screenshots No guessing Just context This is what happens when agents can see + hear + remember Feels like debugging with memory
English
1
0
5
24
Sam
Sam@shar_dev7·
Tried something new today 👀 My auth flow broke Instead of checking logs I asked: “what was I doing before this?”
Sam tweet media
English
1
0
9
32
Sam
Sam@shar_dev7·
Videos → searchable Videos → interactive Videos → usable This is how video should work
English
1
0
4
16
Sam
Sam@shar_dev7·
What if your video wasn’t just something you watch… But something you can talk to That’s Agentic Videos 👇
Sam tweet media
English
1
1
6
58
Sam
Sam@shar_dev7·
Now featured on Agent Community Making agents understand video, screens, and real-world context Check it out 👇 agentcommunity.org/m/videodb
English
0
0
5
11
Sam
Sam@shar_dev7·
AI agents can read text But what if they could see and hear too? That’s where VideoDB comes in Building the perception layer for agents 👇
Sam tweet media
English
1
0
5
24