Tanmay Garg

607 posts

Tanmay Garg

@garg10may

I am randomness in chaos

Katılım Temmuz 2009

372 Takip Edilen61 Takipçiler

Tanmay Garg@garg10may·2d

@ChatGPTapp tata, bye bye - FillApp, FillMate, Fillify, Fill Hero, AgentFillAI, VeloFill, WebFill, FillAI.

English

ChatGPT@ChatGPTapp·2d

Paperwork is better when you can just talk through it. With Images in ChatGPT and voice mode, you can upload a form, say what to fill in, and get back a completed version.

English

310

538

7.9K

2.6M

Tanmay Garg@garg10may·3d

@thsottiaux @aisuperapp log

Tibo@thsottiaux·3d

@aisuperapp Linear or log scale?

English

176

11.5K

The AI SuperApp@aisuperapp·4d

AI Super App Rankings 1. Codex 2. Claude Desktop App 3. Cursor 4. Gemini

English

170

62.4K

Tanmay Garg@garg10may·3d

@thsottiaux Bruh,

English

310

Tibo@thsottiaux·3d

Dark magic. Codex.

OpenAI Developers@OpenAIDevs

Codex anywhere and everywhere, all the time. Now your Mac doesn’t have to be unlocked for Codex to use your computer. From your phone, Codex can securely use apps on your Mac, even when the screen is off and locked. #locked-use" target="_blank" rel="nofollow noopener">developers.openai.com/codex/app/comp…

English

184

1.7K

178.8K

Tanmay Garg@garg10may·3d

Education is the foundation of all progress. It currently have all world knowledge to guide you but currently it's limited by post trained text output upto a particular length. Can it output a small book as per user requirement with proper index, chapters, diagrams, animations, like a interactive book, curated to the level of the user? Would need a different post train and different harness

English

408

Sam Altman@sama·3d

what problem do you most hope AI will solve in the future? maybe we can help!

English

14.9K

732

12.6K

3.5M

Tanmay Garg@garg10may·5d

7/7 So yes, Omni is cool. But the real Google I/O story is not video. It’s permission. Who gets to act for you, where they act, and how much of your digital life they can touch without feeling creepy. That’s the agent race.

English

Tanmay Garg@garg10may·5d

6/7 My actual question: who gets trusted first? The model that gives the best answer? Or the ecosystem that already has your inbox, files, calendar, browser, and payment context? That answer probably decides more than benchmark charts.

English

Tanmay Garg@garg10may·5d

5/7 This is why Gemini Spark matters more than it looks. A 24/7 personal agent sounds like marketing until you connect it to: calendar email browser search payments shopping workspace files then it becomes less like a chatbot and more like an operating layer.

English

Tanmay Garg@garg10may·5d

4/7 OpenAI wants ChatGPT to become the super app. Anthropic is winning a lot of serious workplace trust. Google’s counter is different: put the agent inside Search, Chrome, Android, Gmail, Docs, shopping, coding, YouTube. boring distribution beats beautiful demos.

English

Tanmay Garg@garg10may·5d

3/7 The model race is still real. But “best model” is becoming a temporary advantage. The more durable fight is: who owns the place where AI actually does the work?

English

Tanmay Garg@garg10may·5d

2/7 Look at the pattern: Search agents Daily Brief Gemini Spark Universal Cart Antigravity That’s not “we made a smarter chatbot.” That’s “AI now gets a workspace, memory, tools, and permission to act.”

English

Tanmay Garg@garg10may·5d

1/7 🧵 Google I/O takeaway: everyone is yelling about Omni video. fair. it’s flashy. but I think the bigger move is quieter: Google is trying to turn the whole internet into an agent runtime.

English

Tanmay Garg retweetledi

Tanmay Garg@garg10may·5d

Small thing I think people are underweighting: The public video leaderboards are already crowded: Seedance, Kling, Veo, Sora-style models all trade blows on first-render quality. Omni’s interesting bet is different. Not “can it make a good 10 sec clip?” More: can it survive the 5th edit?

English

1.6K

Tanmay Garg@garg10may·5d

@GoogleDeepMind So the benchmark I’d love to see is not just blind A/B: “which clip looks better?” It’s: give every model the same starting ref make 5 natural-language edits then judge whether the final video still holds together That’s probably where the next real moat is.

English

Tanmay Garg@garg10may·5d

That’s the part creators actually care about. Same character after a few changes. Same scene logic. Camera still makes sense. Physics doesn’t quietly break. The reference you gave it still matters. A lot of AI video looks great on attempt #1, then falls apart when you try to direct it.

English

Google DeepMind@GoogleDeepMind·5d

We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵

English

403

1.3K

8.4K

1.2M

Tanmay Garg@garg10may·12 May

@bcherny @DavidKPiano Opus dies after 2 request in normal plan

English

Boris Cherny@bcherny·12 May

@DavidKPiano Hey, Boris from the team here. What can we do better?

English

507

1.4K

233.5K

David K 🎹@DavidKPiano·11 May

Been using Codex much more than Claude Code lately It's wild that Claude became popular *because* it had the best-in-class models for coding... and all it had to do was hold the lead

English

118

1.3K

240.2K

Tanmay Garg@garg10may·9 May

@ChatGPTapp For PPTs also pls

English

ChatGPT@ChatGPTapp·6 May

ChatGPT is now available as an add-on in Excel and Google Sheets. It can help analyze messy data, write formulas, update spreadsheets, and explain what it’s doing along the way—without leaving your spreadsheet. Powered by GPT-5.5. chatgpt.com/apps/spreadshe…

English

226

623

6.6K

954.2K

Tanmay Garg@garg10may·8 May

@sama Put the same in codex, fine tuned for coding, it doesn't understand tech stack specific terms. Like how do you even pronounce shadcn

English

122

Sam Altman@sama·7 May

people are really starting to use voice to interact with AI, especially when they have a lot of context to dump. GPT-Realtime-2 comes to the API today; it is a pretty big step forward. (we are working on improvements to voice in chat.)

English

876

290

7.1K

486.8K

Tanmay Garg@garg10may·6 May

And they said we have hit a wall

Alexander Whedon@alex_whedon

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

English