Azade Nova

21 posts

Azade Nova

@Azade_na

Research Scientist at Google DeepMind

Katılım Ocak 2013

94 Takip Edilen222 Takipçiler

Azade Nova retweetledi

Rasool Fakoor@rasoolfa·27 Nis

Too many RL ideas die at the edge of the LLM/VLM/VLA training stack. Not anymore. With FeynRL, new algorithms ideas do not have to fight the whole stack 🚀. Focus on the alg while still training very large models. github.com/FeynRL-project… Try it, 🌟 it, send feedback.

English

171

Azade Nova@Azade_na·17 May

Happy to share what I've been working on for the past few months. Check out updated Gemini 1.5 report. goo.gle/GeminiV1-5

Jeff Dean@JeffDean

Gemini 1.5 Model Family: Technical Report updates now published In the report we present the latest models of the Gemini family – Gemini 1.5 Pro and Gemini 1.5 Flash, two highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. Our latest report details notable improvements in Gemini 1.5 Pro within the last four months. Our May release demonstrates significant improvement in math, coding, and multimodal benchmarks compared to our initial release in February. Furthermore, the 1.5 Pro Model is now stronger than 1.0 Ultra. The latest Gemini 1.5 Pro is now our most capable model for text and vision understanding tasks, surpassing 1.0 Ultra on 16 of 19 text benchmarks and 18 of 21 of the vision understanding benchmarks. The table below highlights the improvement in average benchmark performance for different categories in 1.5 Pro since Feb, and also shows the strength of the model relative to the 1.0 Pro and 1.0 Ultra models. The 1.5 Flash model also compares very well against the 1.0 Pro and 1.0 Ultra models. One clear example of this can be seen on MMLU On MMLU we find that 1.5 Pro surpasses 1.0 Ultra in the regular 5-shot setting scoring 85.9% versus 83.7%. However with additional inference compute, via majority voting on top of multiple language model samples, we can get a performance of 91.7% versus Ultra’s 90.0%, which extends the known performance ceiling of this task. @OriolVinyalsML and I are very proud of the whole Gemini team, and it’s fantastic to see this progress and to share these highlights from our Gemini Model Family. Read the updated report here: goo.gle/GeminiV1-5

English

2.5K

Azade Nova@Azade_na·25 Tem

If you want to learn about efficient inference in LLMs, stop by to our poster #733 on Wed (July 26) at 11am icml.cc/virtual/2023/p…, #ICML23

Azade Nova@Azade_na

Check out our gradient-free structural pruning approach for large models. No need for retraining or labeled data! In just a few minutes on one GPU, reduces up to 40% of the original FLOPs at minimal loss of accuracy. w/ @hanjundai, Dale Schuurmans paper: arxiv.org/abs/2303.04185

English

655

Azade Nova@Azade_na·9 Mar

English

4.3K

Azade Nova retweetledi

Mimee // smart casual dark and academic@MimeeXu·5 Ara

Truly an honor. Didn’t sleep much but we did it!

Mimee // smart casual dark and academic tweet media

Jeff Dean@JeffDean

Thank you to everyone on the organizing and steering committee for putting together a great ML for Systems workshop at NeurIPS!

English

Azade Nova@Azade_na·3 Ara

Organizers @martin_maas @Azade_na @BenoitSteiner Neel Kant @DZhang50 Steering @annadgoldie @Azaliamirh @jonathanrraiman @miladhash @kswersk

English

Azade Nova@Azade_na·3 Ara

Join us tomorrow, Dec 3 at #NeurIPS2022 for the 6th edition of the ML for Systems workshop. We will kick off with a keynote presentation from Google SVP Jeff Dean (@JeffDean). Tune in tomorrow at 8:30am Central Time to see what’s new at Google AI. (Room 396) #mlforsystems

English

Azade Nova retweetledi

Google AI@GoogleAI·1 Ara

Interested in learning more about ML for Systems research at Google? Stop by the booth today at 10:30 am to hear @martin_maas discuss the latest in ML-driven systems! And if you want to learn more about the area, join the ML for Systems workshop on Saturday, December 3rd.

English

104

Azade Nova retweetledi

Martin Maas@martin_maas·1 Ara

Join us on Saturday, December 3 at #NeurIPS2022 for the 6th edition of the ML for Systems workshop. We have an exciting program with 5 invited talks and 19 accepted papers, as well as plenty of opportunities to chat with researchers in the area. mlforsystems.org

English

Azade Nova retweetledi

Mimee // smart casual dark and academic@MimeeXu·2 Ara

As a 4th time organizer, I must say we have an amazing program this year at ML for Systems at NeurIPS. Before I forget — Organizers @martin_maas @Azade_na @BenoitSteiner Neel Kant @DZhang50 Steering @annadgoldie @Azaliamirh @jonathanrraiman @miladhash @kswersk + PC. 🙏

English

Azade Nova retweetledi

Hanie Sedghi@HanieSedghi·17 Kas

🔥 LLMs can do OOD reasoning:We show that we can teach algorithms to LLMs with only three examples and it generalizes to much longer input length as much as the context length allows! We can also teach multiple algorithms,compose them to teach complex ones & use them as tools! 🔥

Hattie Zhou@oh_that_hat

“LLMs can’t even do addition” 📄🚨We show that they CAN add! To teach algos to LLMs, the trick is to describe the algo in enough detail so that there is no room for misinterpretation w/ @Azade_na @hugo_larochelle @AaronCourville @bneyshabur @HanieSedghi arxiv.org/abs/2211.09066

English

Azade Nova retweetledi

Anna Goldie@annadgoldie·9 Haz

Excited to share that our work has been published in Nature! Our RL agent generates chip layouts in just a few hours, whereas human experts can take months. These superhuman AI-generated layouts were used in Google's latest AI accelerator (TPU-v5)! nature.com/articles/s4158…

English

322

Azade Nova retweetledi

Azalia Mirhoseini@Azaliamirh·9 Haz

Thrilled to announce that our work on RL for chip floorplanning was published in Nature & used in production to design next generation Google TPUs, with potential to save thousands of hours of engineering effort for each next generation ASIC: nature.com/articles/s4158… (1/7)

English

232

1.5K

Azade Nova retweetledi

Hanjun Dai@hanjundai·12 Tem

Autoregressive graph generation is powerful but slow. Our recent work reduces its complexity from O(V^2) to O((E+V) log V), with sublinear mem cost and training parallelism. Paper: arxiv.org/abs/2006.15502 Code: github.com/google-researc… w/ @Azade_na @liyuajia @daibond_alpha Dale

GIF

English

141

Azade Nova retweetledi

Alex Smola@smolix·7 Nis

Check out TraDE, our new transformer based density estimator arxiv.org/abs/2004.02441. It beats NAF, BNAF, TAN, FFJORD, NSF, AEM on reference datasets. Works well for #bumblebee, too (left: original, right: estimate). @rasoolfa @pratikac and Jonas Mueller are the real stars here.

English

Azade Nova@Azade_na·3 Ara

@Uber_Support I contacted you via in-and and sent you DM here. My luggage was in the trunk of the car. Uber driver left without opening the trunk and giving my things. All my things in the Uber car. I haven't heard back from you, can u update?

English

Azade Nova retweetledi

Google DeepMind@GoogleDeepMind·23 Kas

Today we are excited to release video recordings of lectures from "Advanced Deep Learning and Reinforcement Learning", a course on deep RL taught at @UCL earlier this year by DeepMind researchers: youtube.com/playlist?list=… Enjoy!

English

1.6K

4.4K

Azade Nova retweetledi

WIRED@WIRED·12 Eki

We've seen @BostonDynamics' infamous robot dog open doors, and climb stairs. But now it can navigate complicated construction sites, and do detailed work inspections too 👀: wired.trib.al/0CcVEmR

English

318

551

Azade Nova retweetledi

Jeff Dean@JeffDean·10 Eki

If you want to do ML research, consider applying for the 2019 Google AI Residency program! You'll have the opportunity to conduct cutting-edge research working in a wide variety of areas, and this year we're expanding to host residents in even more locations.

Google AI@GoogleAI

Applications for the 2019 Google AI Residency program are now open! Visit g.co/airesidency/ap… for more information on how to apply. To learn more about the accomplishments of the recently graduated second class of residents, visit ↓ goo.gl/5QZbsF

English

132

436

Azade Nova retweetledi

Kyunghyun Cho@kchonyc·12 Eki

the paper behind BERT is now online: arxiv.org/abs/1810.04805 BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova arxiv.org/abs/1810.04805

English

205

Keşfet

@hanjundai @martin_maas @BenoitSteiner @DZhang50 @annadgoldie @Azaliamirh @jonathanrraiman @miladhash