
Roshan Sharma
177 posts

Roshan Sharma
@RoshanSSharma2
Research Scientist @GoogleDeepMind | PhD @CMU_ECE | #SpeechProc #NLProc | Previously @AIatMeta @Qualcomm





It’s LIVE😏!! We heard your feedback on function calling, conversation quality, and handling background noise and interruptions. Our latest native audio model is out on preview🔥🔥 Go build with it and send us your feedback!






Gemini 2.5 Flash Preview now supports native audio output via the Live API for seamless and natural spoken interactions. With support for 30+ voices, build conversational AI agents and experiences that feel more intuitive and natural → #native-audio-output" target="_blank" rel="nofollow noopener">ai.google.dev/gemini-api/doc…

Gemini 2.5 Flash Preview now supports native audio output via the Live API for seamless and natural spoken interactions. With support for 30+ voices, build conversational AI agents and experiences that feel more intuitive and natural → #native-audio-output" target="_blank" rel="nofollow noopener">ai.google.dev/gemini-api/doc…

💬 Smarter dialogue: Gemini-powered native audio means Project Astra has better context and customizable accents. 🕹️ Takes action: Computer control lets it open and engage with apps at your direction. 🤝 Personalized help: Integrates with your @Gmail, @GoogleCalendar and more behind the scenes.





After 40 months of excellent research, on Jan 15th @Umberto_Senpai successfully completed his PhD journey. Umberto was definitely among our top students with several high-level publications and collaborations with top-notch labs. Congratulations Umberto🎉🎉🎉@FBK_research
