Google AI

3.2K posts

Google AI

@GoogleAI

Making AI helpful for everyone. Show thinking ↓

Mountain View, CA 参加日 Nisan 2009

30 フォロー中2.4M フォロワー

固定されたツイート

Google AI@GoogleAI·15 Nis

Today we launched Gemini 3.1 Flash TTS, our most expressive and controllable text-to-speech model yet. This launch [excitement] includes audio tags! 🗣🏷 Audio tags [explanatory] are a seamless way to guide vocal style, pace, and delivery using natural language commands embedded directly in your text. Want a different tempo or tone? [amazement] Just tag the audio to steer the AI-speech output! The model supports 70+ languages (24 of which are high-quality evaluated languages, including: Japanese, Hindi, and Arabic). Watch the audio tags in action in the demo below ↓

English

113

307

2.3K

195.4K

Google AI@GoogleAI·4d

Our teams have been busyyy! Here are some key updates from the past week: — @GoogleCloud unveiled a suite of AI innovations at our Cloud Next event, including our eighth generation TPUs (TPUt for inference + TPUi for reasoning), Gemini Enterprise Agent Platform, Agentic Data cloud, Workspace Intelligence, and beyond — Gemini Embedding 2, our natively multimodal embedding model, became generally available via the Gemini API and in Gemini Enterprise Agent Platform (the evolution of Vertex AI) — @StitchbyGoogle open-sourced the draft specification for DESIGN.md, so it can be used across any single tool or platform — New autonomous search agents, Deep Research and Deep Research Max, launched to bring MCP support, native visualizations, and unprecedented analytical quality to research workflows across the web or custom sources — Google AI Pro and Ultra subscribers now get increased usage limits and access to Nano Banana Pro and Gemini Pro models in @GoogleAIStudio — @GoogleDeepMind introduced Decoupled DiLoCo, our new resilient and flexible way to train advanced AI models across multiple data centers

English

415

47.7K

Google AI@GoogleAI·5d

Last week, we launched Gemini 3.1 TTS, our latest and best text-to-speech model. This new model introduces [awe] audio tags, an intuitive way to guide vocal style, pace, and delivery. Here are some tips on the best ways to use audio tags in your prompts: 1. All inline tags must be enclosed in square brackets, such as [screams] or [whispers] 2. Insert these tags exactly where you want the transition to occur and make sure to avoid placing tags directly next to each other 3. Use tags like [slow] or [fast] to control the pace of the delivery, or even [short pause] or [long pause] to ramp up the anticipation in dramatic moments 4. The model also offers granular control over vocalizations, allowing you to direct the delivery with cues like [cackles] or [whispers] 5. An ideal audio tag formula could look something like: [encouraging] Let’s try that last sentence again to make sure that you nailed it. [slow] "L'oiseau s'est envolé." [short pause] Perfect! [laughs] You're a natural. No matter what you’re developing — from [scholarly] a language learning tool, to [mysterious] an interactive podcast app, to [friendly] more adaptive customer service offerings, and beyond — these prompting tips will equip you to start building with Gemini 3.1 TTS.

English

643

51.8K

Google AI@GoogleAI·21 Nis

@GoogleAIStudio Check out our blog post to learn more: blog.google/innovation-and…

English

24K

Google AI@GoogleAI·21 Nis

Calling all builders 🛠️📣 Google AI Pro and Ultra subscribers will now get increased usage limits and access to Nano Banana Pro and Gemini Pro models in @GoogleAIStudio — no API key required. Sign in with your subscriber account today to take your ideas from prototype to production.

English

141

1.5K

90.7K

Google AI@GoogleAI·20 Nis

Beyond generating high-fidelity visuals, we wanted to test the limits of what Nano Banana Pro can do. We worked with design partners Porto Rocha to build out a hypothetical brand called YOYOYO to see how the model would handle the task. Here’s what we found: 🎨Brand consistency: Across logos, colors, and typography, the model maintained a strict, cohesive brand identity (even for wildly diverse concepts) 🛍️Environmental realism: We asked to see the products in storefront and studio mockups. It nailed accurate lighting, shadows, and physical proportions - even when upscaled for massive retail displays 🪀Spatial accuracy: We tested spatial volumes for physical packaging. The generated proportions were so precise that we were able to 3D-print the functional yo-yo How have you been pushing the limits of Nano Banana Pro? Let us know in the replies below!

English

110

1.3K

116.8K

Google AI@GoogleAI·17 Nis

What a week! Here’s everything we shipped: — Gemini 3.1 Flash TTS, our latest text-to-speech model, featuring native multi-speaker dialogue and improved controllability and audio tags for more natural, expressive voices in 70+ languages — Gemini Robotics-ER 1.6 by @GoogleDeepMind, an upgrade designed to help robots reason about the physical world — The @GeminiApp for Mac desktop (tip: Use Option + Space to access the app via shortcut) — Personal Intelligence in @GeminiApp has new integrations with @GooglePhotos and Nano Banana 2, making it easier to create relevant, personalized images. Available for AI Pro, Plus, and Ultra subscribers in the US — A couple fun additions in @GoogleAIStudio to make building easier, including Design previews and tab tab tab functionality — Skills in @GoogleChrome, which let you save and reuse your most helpful Gemini prompts and run them in your browser with a single click

English

871

97.6K

Google AI@GoogleAI·15 Nis

@theoledgers All audio generated by Gemini 3.1 Flash TTS is watermarked with SynthID!

English

3.1K

Theo Ledger@theoledgers·15 Nis

@GoogleAI Great. Do you have watermarks in the audio to help identify for harmful use?

English

3.1K

Google AI@GoogleAI·15 Nis

English

113

307

2.3K

195.4K

Google AI@GoogleAI·15 Nis

@HoodyAndShorts Placeholders? No these are just examples of how the audio tags work 🙂

English

173

Google AI@GoogleAI·15 Nis

Gemini 3.1 Flash TTS is rolling out in Google Vids and is available today in preview via the Gemini API and in @GoogleAIStudio. Whether you’re creating a pitch deck or recording a passion project, transform your scripts into studio-quality narration: blog.google/innovation-and…

English

106

23.2K

Google AI@GoogleAI·13 Nis

Prefer watching on @YouTube while scrolling through the comment section? We get it: youtu.be/b1Pvt072wKQ?si…

YouTube

English

34.9K

Google AI@GoogleAI·13 Nis

Got a doodle for your next project laying around? Turn it into working software using @GoogleAIStudio and Nano Banana. Watch us vibe code a weather-responsive outfit selector app from a single, hand-drawn sketch:

English

224

45.4K

Google AI@GoogleAI·10 Nis

TGIF! Here are some of our favorite updates from the past week: — Notebooks in @GeminiApp, an integration with @NotebookLM that enables you to retrieve context from your private notebooks or convert your active chats into grounded sources for new research — The @GeminiApp on web now generates customizable interactive visualizations, including 2D and 3D models, directly in your chat to help you deconstruct complex concepts — The new AI-powered @GoogleFinance tool is shipping to 100+ countries, delivering market research, advanced charting, expanded real-time data, and more

English

278

50.6K

Google AI@GoogleAI·10 Nis

Share your Gemma 4 builds or the model variants you’re training in the replies below!

English

18.4K

Google AI@GoogleAI·10 Nis

A playable, digital piano built with Gemma 4 E2B (2B). x.com/0xCVYH/status/…

CV.YH@0xCVYH

Google lancou "Agent Skills" junto com o Gemma 4 Um app Android onde voce importa skills e o Gemma 4 E2B (2B) roda localmente no celular, raciocinando e usando as skills 962 likes em poucas horas. Isso e IA agentica rodando no bolso, sem cloud, sem API, sem custo O modelo de 2B parametros e suficiente pra tarefas praticas quando tem tools bem definidas Ja ta na Play Store. O futuro dos agentes nao e so desktop — e mobile-first

English

31.1K

Google AI@GoogleAI·10 Nis

We love seeing what you’ve built with Gemma 4, the open model family that we released last week. Here are a few fun examples, described by the builders in their own words (🧵):

English

699

100.9K

Google AI@GoogleAI·8 Nis

And if you'd rather watch on YouTube, here’s the link: youtube.com/watch?v=tYhgWR…

YouTube

English

30.5K

Google AI@GoogleAI·8 Nis

Curious about vibe coding? Or are you already shipping apps and just want an easier way to explain your new favorite hobby to your friends, parents, grandparents, etc.? Either way, this video is for you ⏯️⤵️

English

205

37.5K

ディスカバー

@GoogleCloud @StitchbyGoogle @GoogleAIStudio @GoogleDeepMind @GeminiApp @GooglePhotos @GoogleChrome @theoledgers