Azade Nova

21 posts

Azade Nova

Azade Nova

@Azade_na

Research Scientist at Google DeepMind

Katılım Ocak 2013
94 Takip Edilen222 Takipçiler
Azade Nova retweetledi
Rasool Fakoor
Rasool Fakoor@rasoolfa·
Too many RL ideas die at the edge of the LLM/VLM/VLA training stack. Not anymore. With FeynRL, new algorithms ideas do not have to fight the whole stack 🚀. Focus on the alg while still training very large models. github.com/FeynRL-project… Try it, 🌟 it, send feedback.
English
1
1
3
171
Azade Nova
Azade Nova@Azade_na·
Happy to share what I've been working on for the past few months. Check out updated Gemini 1.5 report. goo.gle/GeminiV1-5
Jeff Dean@JeffDean

Gemini 1.5 Model Family: Technical Report updates now published In the report we present the latest models of the Gemini family – Gemini 1.5 Pro and Gemini 1.5 Flash, two highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. Our latest report details notable improvements in Gemini 1.5 Pro within the last four months. Our May release demonstrates significant improvement in math, coding, and multimodal benchmarks compared to our initial release in February. Furthermore, the 1.5 Pro Model is now stronger than 1.0 Ultra. The latest Gemini 1.5 Pro is now our most capable model for text and vision understanding tasks, surpassing 1.0 Ultra on 16 of 19 text benchmarks and 18 of 21 of the vision understanding benchmarks. The table below highlights the improvement in average benchmark performance for different categories in 1.5 Pro since Feb, and also shows the strength of the model relative to the 1.0 Pro and 1.0 Ultra models. The 1.5 Flash model also compares very well against the 1.0 Pro and 1.0 Ultra models. One clear example of this can be seen on MMLU On MMLU we find that 1.5 Pro surpasses 1.0 Ultra in the regular 5-shot setting scoring 85.9% versus 83.7%. However with additional inference compute, via majority voting on top of multiple language model samples, we can get a performance of 91.7% versus Ultra’s 90.0%, which extends the known performance ceiling of this task. @OriolVinyalsML and I are very proud of the whole Gemini team, and it’s fantastic to see this progress and to share these highlights from our Gemini Model Family. Read the updated report here: goo.gle/GeminiV1-5

English
0
1
12
2.5K
Azade Nova
Azade Nova@Azade_na·
Check out our gradient-free structural pruning approach for large models. No need for retraining or labeled data! In just a few minutes on one GPU, reduces up to 40% of the original FLOPs at minimal loss of accuracy. w/ @hanjundai, Dale Schuurmans paper: arxiv.org/abs/2303.04185
Azade Nova tweet media
English
0
3
16
4.3K
Azade Nova
Azade Nova@Azade_na·
Join us tomorrow, Dec 3 at #NeurIPS2022 for the 6th edition of the ML for Systems workshop. We will kick off with a keynote presentation from Google SVP Jeff Dean (@JeffDean). Tune in tomorrow at 8:30am Central Time to see what’s new at Google AI. (Room 396) #mlforsystems
English
1
0
13
0
Azade Nova retweetledi
Google AI
Google AI@GoogleAI·
Interested in learning more about ML for Systems research at Google? Stop by the booth today at 10:30 am to hear @martin_maas discuss the latest in ML-driven systems! And if you want to learn more about the area, join the ML for Systems workshop on Saturday, December 3rd.
Google AI tweet media
English
4
23
104
0
Azade Nova retweetledi
Martin Maas
Martin Maas@martin_maas·
Join us on Saturday, December 3 at #NeurIPS2022 for the 6th edition of the ML for Systems workshop. We have an exciting program with 5 invited talks and 19 accepted papers, as well as plenty of opportunities to chat with researchers in the area. mlforsystems.org
English
0
3
11
0
Azade Nova retweetledi
Hanie Sedghi
Hanie Sedghi@HanieSedghi·
🔥 LLMs can do OOD reasoning:We show that we can teach algorithms to LLMs with only three examples and it generalizes to much longer input length as much as the context length allows! We can also teach multiple algorithms,compose them to teach complex ones & use them as tools! 🔥
Hattie Zhou@oh_that_hat

“LLMs can’t even do addition” 📄🚨We show that they CAN add! To teach algos to LLMs, the trick is to describe the algo in enough detail so that there is no room for misinterpretation w/ @Azade_na @hugo_larochelle @AaronCourville @bneyshabur @HanieSedghi arxiv.org/abs/2211.09066

English
1
7
64
0
Azade Nova retweetledi
Anna Goldie
Anna Goldie@annadgoldie·
Excited to share that our work has been published in Nature! Our RL agent generates chip layouts in just a few hours, whereas human experts can take months. These superhuman AI-generated layouts were used in Google's latest AI accelerator (TPU-v5)! nature.com/articles/s4158…
English
46
322
2K
0
Azade Nova retweetledi
Azalia Mirhoseini
Azalia Mirhoseini@Azaliamirh·
Thrilled to announce that our work on RL for chip floorplanning was published in Nature & used in production to design next generation Google TPUs, with potential to save thousands of hours of engineering effort for each next generation ASIC: nature.com/articles/s4158… (1/7)
Azalia Mirhoseini tweet media
English
46
232
1.5K
0
Azade Nova retweetledi
Alex Smola
Alex Smola@smolix·
Check out TraDE, our new transformer based density estimator arxiv.org/abs/2004.02441. It beats NAF, BNAF, TAN, FFJORD, NSF, AEM on reference datasets. Works well for #bumblebee, too (left: original, right: estimate). @rasoolfa @pratikac and Jonas Mueller are the real stars here.
Alex Smola tweet media
English
0
7
34
0
Azade Nova
Azade Nova@Azade_na·
@Uber_Support I contacted you via in-and and sent you DM here. My luggage was in the trunk of the car. Uber driver left without opening the trunk and giving my things. All my things in the Uber car. I haven't heard back from you, can u update?
English
0
0
0
0
Azade Nova retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
Today we are excited to release video recordings of lectures from "Advanced Deep Learning and Reinforcement Learning", a course on deep RL taught at @UCL earlier this year by DeepMind researchers: youtube.com/playlist?list=… Enjoy!
English
45
1.6K
4.4K
0
Azade Nova retweetledi
WIRED
WIRED@WIRED·
We've seen @BostonDynamics' infamous robot dog open doors, and climb stairs. But now it can navigate complicated construction sites, and do detailed work inspections too 👀: wired.trib.al/0CcVEmR
English
41
318
551
0
Azade Nova retweetledi
Jeff Dean
Jeff Dean@JeffDean·
If you want to do ML research, consider applying for the 2019 Google AI Residency program! You'll have the opportunity to conduct cutting-edge research working in a wide variety of areas, and this year we're expanding to host residents in even more locations.
Google AI@GoogleAI

Applications for the 2019 Google AI Residency program are now open! Visit g.co/airesidency/ap… for more information on how to apply. To learn more about the accomplishments of the recently graduated second class of residents, visit ↓ goo.gl/5QZbsF

English
8
132
436
0