Roopal Garg

915 posts

Roopal Garg

Roopal Garg

@roopalgarg

Sn Staff Research Engineer @GoogleDeepMind | MultiModal & i18n Research for Gemini models | 🦋@roopalgarg | Views are my own

Austin, TX Se unió Mart 2010
1.2K Siguiendo761 Seguidores
Tweet fijado
Roopal Garg
Roopal Garg@roopalgarg·
📢 Excited to unveil our latest research, ImageInWords (IIW)! 🚀We're pushing the boundaries of image descriptions with a new seeded, sequential, human-in-the-loop approach producing SOTA, articulate, hyper-detailed descriptions. arXiv: arxiv.org/abs/2405.02793 🧵1/12
Roopal Garg tweet media
English
5
30
138
51K
Roopal Garg retuiteado
Roopal Garg retuiteado
Demis Hassabis
Demis Hassabis@demishassabis·
You can vibe design some incredible interfaces with @stitchbygoogle
Google Labs@GoogleLabs

Introducing the new @stitchbygoogle, Google’s vibe design platform that transforms natural language into high-fidelity designs in one seamless flow. 🎨Create with a smarter design agent: Describe a new business concept or app vision and see it take shape on an AI-native canvas. ⚡️ Iterate quickly: Stitch screens together into interactive prototypes and manage your brand with a portable design system. 🎤 Collaborate with voice: Use hands-free voice interactions to update layouts and explore new variations in real-time. Try it now (Age 18+ only. Currently available in English and in countries where Gemini is supported.) → stitch.withgoogle.com

English
78
164
2K
242.7K
Roopal Garg retuiteado
Google AI Developers
Google AI Developers@googleaidevs·
Try 👁 Agentic Vision with Gemini 3 Flash in @GoogleAIStudio or Vertex AI. This new capability enables the model to effectively use code and reasoning to improve performance for common vision tasks. See Agentic Vision in action: goo.gle/3Z05KxK
English
24
113
855
170.2K
Roopal Garg
Roopal Garg@roopalgarg·
Wishing everyone a happy new year!!
English
0
0
0
49
Roopal Garg retuiteado
Sundar Pichai
Sundar Pichai@sundarpichai·
We’re back in a Flash ⚡ Gemini 3 Flash is our latest model with frontier intelligence built for lightning speed, and pushing the Pareto Frontier of performance and efficiency. It outperforms 2.5 Pro while being 3x faster at a fraction of the cost. With this release, Gemini 3’s next-generation intelligence is now rolling out to everyone across our products including @Geminiapp + AI Mode in Search. Devs can build with it in the Gemini API @GoogleAIStudio, Gemini CLI, and Google @antigravity and enterprises can get it in Vertex AI and Gemini Enterprise.
Sundar Pichai tweet media
English
342
630
7.1K
531.6K
Roopal Garg retuiteado
Josh Woodward
Josh Woodward@joshwoodward·
Nano Banana Pro can: Spell Spell in 10+ languages Spell in 10+ languages grounded in world knowledge Spell in 10+ languages grounded in world knowledge and do precise edits Spell in 10+ languages grounded in world knowledge and do precise edits w/ 14 reference images Spell in 10+ languages grounded in world knowledge and do precise edits w/ 14 reference images and it's all watermarked with SynthID I'm still blown away by it, it just doesn't get old
English
78
59
747
127K
Roopal Garg retuiteado
Sundar Pichai
Sundar Pichai@sundarpichai·
1/ Some are saying they’ll always remember the day our Gemini 2.5 Flash Image (aka nano 🍌) was released - that was the biggest thing to happen on August 26th, right?:) Try it if you haven’t yet: gemini.google.com  More from an exciting week:
English
199
222
4K
457.3K
Roopal Garg retuiteado
Oriol Vinyals
Oriol Vinyals@OriolVinyalsML·
A mere ~16 point jump on livebench.ai. Such a good model! Gemini 2.5 Pro ♊️
Oriol Vinyals tweet media
English
8
25
351
45.5K
Roopal Garg retuiteado
Arena.ai
Arena.ai@arena·
BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer Query, and Multi-Turn! Massive congrats to @GoogleDeepMind for this incredible Arena milestone! 🙌 More highlights in thread👇
Arena.ai tweet media
Google DeepMind@GoogleDeepMind

Think you know Gemini? 🤔 Think again. Meet Gemini 2.5: our most intelligent model 💡 The first release is Pro Experimental, which is state-of-the-art across many benchmarks - meaning it can handle complex problems and give more accurate responses. Try it now → goo.gle/4c2HKjf

English
72
399
2.3K
467.3K
Roopal Garg retuiteado
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Introducing Gemini 2.5 Pro, the world's most powerful model, with unified reasoning capabilities + all the things you love about Gemini (long context, tools, etc) Available as experimental and for free right now in Google AI Studio + API, with pricing coming very soon!
Logan Kilpatrick tweet media
English
263
439
4K
333K
Roopal Garg retuiteado
Sundar Pichai
Sundar Pichai@sundarpichai·
We have liftoff! After a successful launch this weekend, the first FireSat satellite is now orbiting Earth 🛰️ It’s the first of a 50+ satellite constellation that will help detect + track wildfires as small as 5x5 meters, using AI. Huge thanks to partners @MuonSpace @EarthFireAll @MooreFound, and special thanks to @SpaceX for the ride! Here’s a look at the satellite on the launch pad (it’s behind the yellow rectangle).
Sundar Pichai tweet media
English
199
441
4K
206.1K
Roopal Garg
Roopal Garg@roopalgarg·
@simi_97k @gneubig interesting use case.. thank you @simi_97k clearly more work to do on this axis 👍...stay tuned side note: the un-cut version of`tamagoyaki` aka Japanese rolled omelette is what it tried perhaps to link to the `dosa` which seems closer.
English
1
0
1
78
Simran Khanuja
Simran Khanuja@simi_97k·
final output! not perfect I know, but this was so fun to play around with :) I don't know much at all about japanese food but @gneubig says that you don’t dip tamagoyaki in sauce and one sees crepes in Japan, but the presentation seems more european 🇪🇺 If y'all have any inputs on what the model should generate / how we should prompt it, feel free to lmk! (fin)
Simran Khanuja tweet media
English
1
0
1
428
Simran Khanuja
Simran Khanuja@simi_97k·
Playing around with Gemini 2.0 Flash for image editing! Some things I noticed: Image editing significantly improves when you ask the model to retrieve a natural image of what its trying to generate! Its very chatty though, also defensive and overly-apologetic :') A thread on my attempts at image transcreation using the model 🧵(1/n)
Oriol Vinyals@OriolVinyalsML

Gemini 2.0 Flash debuts native image gen! Create contextually relevant images, edit conversationally, and generate long text in images. All totally optimized for chat iteration. Try it in AI Studio or Gemini API. Blog: developers.googleblog.com/en/experiment-…

English
1
1
15
4.6K
Roopal Garg retuiteado
Demis Hassabis
Demis Hassabis@demishassabis·
One of the features I've mosted wanted in AI Studio for a long time! Just paste a YouTube link into the command line and ask Gemini 2.0 questions about it - it's multimodal understanding is kind of mindblowing. Try it here: aistudio.google.com
Logan Kilpatrick@OfficialLoganK

Introducing YouTube video 🎥 link support in Google AI Studio and the Gemini API. You can now directly pass in a YouTube video and the model can usage its native video understanding capabilities to use that, with just a link! 🚢

English
233
334
2.9K
663.2K
Roopal Garg retuiteado
Radu Soricut
Radu Soricut@RSoricut·
The 3rd 🚀 today, the one we sweated a lot over -- Native image generation with Gemini 2.0 Flash available to all developers as an experimental release in the Gemini API and Google AI Studio: developers.googleblog.com/en/experiment-…
English
1
5
35
1.8K
Roopal Garg retuiteado
Robert Riachi
Robert Riachi@robertriachi·
some cool examples with Gemini 2.0 native image output 🧵
Robert Riachi tweet media
English
65
182
3.8K
481.2K
Roopal Garg retuiteado
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Native image generation with Gemini 2.0 Flash is now available to all developers via an experimental release in the Gemini API and Google AI Studio!! The chat based image editing and creation is so much fun to play with 🧵
GIF
English
287
174
1.4K
1M