Tanmay Garg

607 posts

Tanmay Garg banner
Tanmay Garg

Tanmay Garg

@garg10may

I am randomness in chaos

Katılım Temmuz 2009
372 Takip Edilen61 Takipçiler
Tanmay Garg
Tanmay Garg@garg10may·
@ChatGPTapp tata, bye bye - FillApp, FillMate, Fillify, Fill Hero, AgentFillAI, VeloFill, WebFill, FillAI.
English
0
0
0
10
ChatGPT
ChatGPT@ChatGPTapp·
Paperwork is better when you can just talk through it. With Images in ChatGPT and voice mode, you can upload a form, say what to fill in, and get back a completed version.
English
310
538
7.9K
2.6M
The AI SuperApp
The AI SuperApp@aisuperapp·
AI Super App Rankings 1. Codex 2. Claude Desktop App 3. Cursor 4. Gemini
English
15
11
170
62.4K
Tanmay Garg
Tanmay Garg@garg10may·
Education is the foundation of all progress. It currently have all world knowledge to guide you but currently it's limited by post trained text output upto a particular length. Can it output a small book as per user requirement with proper index, chapters, diagrams, animations, like a interactive book, curated to the level of the user? Would need a different post train and different harness
English
0
0
0
408
Sam Altman
Sam Altman@sama·
what problem do you most hope AI will solve in the future? maybe we can help!
English
14.9K
732
12.6K
3.5M
Tanmay Garg
Tanmay Garg@garg10may·
7/7 So yes, Omni is cool. But the real Google I/O story is not video. It’s permission. Who gets to act for you, where they act, and how much of your digital life they can touch without feeling creepy. That’s the agent race.
English
0
0
0
21
Tanmay Garg
Tanmay Garg@garg10may·
6/7 My actual question: who gets trusted first? The model that gives the best answer? Or the ecosystem that already has your inbox, files, calendar, browser, and payment context? That answer probably decides more than benchmark charts.
English
1
0
0
9
Tanmay Garg
Tanmay Garg@garg10may·
5/7 This is why Gemini Spark matters more than it looks. A 24/7 personal agent sounds like marketing until you connect it to: calendar email browser search payments shopping workspace files then it becomes less like a chatbot and more like an operating layer.
English
1
0
0
19
Tanmay Garg
Tanmay Garg@garg10may·
4/7 OpenAI wants ChatGPT to become the super app. Anthropic is winning a lot of serious workplace trust. Google’s counter is different: put the agent inside Search, Chrome, Android, Gmail, Docs, shopping, coding, YouTube. boring distribution beats beautiful demos.
English
1
0
0
29
Tanmay Garg
Tanmay Garg@garg10may·
3/7 The model race is still real. But “best model” is becoming a temporary advantage. The more durable fight is: who owns the place where AI actually does the work?
English
1
0
0
6
Tanmay Garg
Tanmay Garg@garg10may·
2/7 Look at the pattern: Search agents Daily Brief Gemini Spark Universal Cart Antigravity That’s not “we made a smarter chatbot.” That’s “AI now gets a workspace, memory, tools, and permission to act.”
English
1
0
0
44
Tanmay Garg
Tanmay Garg@garg10may·
1/7 🧵 Google I/O takeaway: everyone is yelling about Omni video. fair. it’s flashy. but I think the bigger move is quieter: Google is trying to turn the whole internet into an agent runtime.
English
1
0
1
14
Tanmay Garg retweetledi
Tanmay Garg
Tanmay Garg@garg10may·
Small thing I think people are underweighting: The public video leaderboards are already crowded: Seedance, Kling, Veo, Sora-style models all trade blows on first-render quality. Omni’s interesting bet is different. Not “can it make a good 10 sec clip?” More: can it survive the 5th edit?
English
1
1
1
1.6K
Tanmay Garg
Tanmay Garg@garg10may·
@GoogleDeepMind So the benchmark I’d love to see is not just blind A/B: “which clip looks better?” It’s: give every model the same starting ref make 5 natural-language edits then judge whether the final video still holds together That’s probably where the next real moat is.
English
0
0
0
48
Tanmay Garg
Tanmay Garg@garg10may·
That’s the part creators actually care about. Same character after a few changes. Same scene logic. Camera still makes sense. Physics doesn’t quietly break. The reference you gave it still matters. A lot of AI video looks great on attempt #1, then falls apart when you try to direct it.
English
1
0
0
64
Google DeepMind
Google DeepMind@GoogleDeepMind·
We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵
English
403
1.3K
8.4K
1.2M
David K 🎹
David K 🎹@DavidKPiano·
Been using Codex much more than Claude Code lately It's wild that Claude became popular *because* it had the best-in-class models for coding... and all it had to do was hold the lead
English
118
18
1.3K
240.2K
ChatGPT
ChatGPT@ChatGPTapp·
ChatGPT is now available as an add-on in Excel and Google Sheets. It can help analyze messy data, write formulas, update spreadsheets, and explain what it’s doing along the way—without leaving your spreadsheet. Powered by GPT-5.5. chatgpt.com/apps/spreadshe…
English
226
623
6.6K
954.2K
Tanmay Garg
Tanmay Garg@garg10may·
@sama Put the same in codex, fine tuned for coding, it doesn't understand tech stack specific terms. Like how do you even pronounce shadcn
English
0
0
0
122
Sam Altman
Sam Altman@sama·
people are really starting to use voice to interact with AI, especially when they have a lot of context to dump. GPT-Realtime-2 comes to the API today; it is a pretty big step forward. (we are working on improvements to voice in chat.)
English
876
290
7.1K
486.8K
Tanmay Garg
Tanmay Garg@garg10may·
@Dimillian If it connects to unity and creates the same, that would be game changer
English
0
0
0
571
Thomas Ricouard
Thomas Ricouard@Dimillian·
I've asked Codex for simple polygons but it seems like it cooked way beyond that goal
Thomas Ricouard tweet media
English
80
26
1.1K
169.2K