alex zook

3.1K posts

alex zook

@zookae

machine learning engineer at NVIDIA (prev Unity, Blizzard; AI PhD @ Georgia Tech) all tweets my own

Katılım Eylül 2012

145 Takip Edilen873 Takipçiler

alex zook retweetledi

Andrew White 🐦‍⬛@andrewwhite01·5 Haz

At FutureHouse, we’ve noticed scientific agents are good at applying average intelligence across tasks. They always seem to make the obvious choices, which is good, but discovery sometimes requires more intuition and insight than average. We’ve made the first step today towards superhuman insight by training a reasoning model for a specific domain of science: designing drug-like molecules. We’re releasing a 24B open-weights reasoning model called 𝚎𝚝𝚑𝚎𝚛𝟶. 𝚎𝚝𝚑𝚎𝚛𝟶 has been trained with reinforcement learning to exceed frontier and human experts across a range of molecular design tasks. 𝚎𝚝𝚑𝚎𝚛𝟶 takes in natural language, reasons in English, and outputs a new molecule. 𝚎𝚝𝚑𝚎𝚛𝟶 is now a tool for our chemistry design agent, Phoenix, which can call upon it to design molecules. Training a reasoning model for a scientific domain like chemistry, rather than math or programming, required a number of small technical advances. For example, we developed an iterative method of split specialist models and aggregation of reasoning traces. Another example is we used LLMs to rewrite questions that were partially solved. A major finding from this work is that we can train with >10x efficiency per experimental measurement when using a reasoning model, rather than fine-tuning. We also found that reasoning models can learn new tasks, developed specifically for this paper and not in pretraining corpora. We even saw a task have 0% performance until 100 steps into RL, at which it randomly solved once. This, along with our change in modality from natural language to molecules, bodes well for applying reasoning models far from natural language. Reasoning models in science are the future. Scientific tasks are naturally verifiable rewards: the physical world is the ultimate arbiter of accuracy, rather than human contractors. The data efficiency gain and ability to exceed frontier models with relatively few parameters/compute mean that we should expect more scientific reasoning models soon. Congrats to team @SidN137, James, @Ryan__Rhys, Albert, @GWellawatte , @maykcaldas , @ludomitch , and @SGRodriques. Thanks to @VoltagePark @nvidia and @huggingface for supporting us, and huge thanks to @ericschmidt for funding @FutureHouseSF The model weights, reward model, and new benchmark are open source. You can also read more about scientific reasoning models in our exclusive with Nature.

English

411

80.4K

alex zook@zookae·21 May

amazing progress in ai directly contributing novel scientific discoveries!

Sam Rodriques@SGRodriques

Today, we’re announcing the first major discovery made by our AI Scientist with the lab in the loop: a promising new treatment for dry AMD, a major cause of blindness. Our agents generated the hypotheses, designed the experiments, analyzed the data, iterated, even made figures for the paper. The resulting manuscript is a first-of-a-kind in the natural sciences, in which everything that needed to be done to write the paper was done by AI agents, apart from actually conducting the physical experiments in the lab and writing the final manuscript. We are also introducing Robin, the first multi-agent system that fully automates the in-silico components of scientific discovery, which made this discovery. This is the first time that we are aware of that hypothesis generation, experimentation, and data analysis have been joined up in closed loop, and is the beginning of a massive acceleration in the pace of scientific discovery that will be driven by these agents. We will be open-sourcing the code and data next week. Robin is a multi-agent system that uses Crow, Falcon, and Finch, the agents on our platform, to generate novel hypotheses, plan experiments, and analyze data. We asked Robin to find a new treatment for dry age-related macular degeneration. Robin considered the disease mechanisms associated with dry AMD, proposed a specific experimental assay that could be used to evaluate hypotheses in the wet lab, and proposed specific molecules we could test in that assay. We tested the molecules and gave it the resulting data, which it analyzed before proposing more experiments. In the end, it identified Ripasudil, a Rho Kinase inhibitor (ROCK inhibitor) that is approved in Japan for several other diseases, which seems very promising as potential treatment for dry AMD. It also identified specific molecular mechanisms that might underlie the effects of Ripasudil in RPE cells, from an RNA sequencing experiment it proposed. To be clear, no one has proposed using ROCK inhibitors to treat dry AMD in the literature before, as far as we can find, and I think it would have been very difficult for us to come up with this hypothesis without the agents. We have also run the proposed treatment by several experts in AMD, who confirm that it is interesting and novel. Moreover, this project was fast: with Robin in hand, the entire project took about 10 weeks, which is way shorter than it would have taken if we had been doing all of the in-silico components ourselves. Important caveats: We are real biologists at FutureHouse, so I want to be clear that although the discovery here is exciting, we are not claiming that we have cured dry AMD. Fully validating this hypothesis as a treatment for dry AMD will take human trials, which will take much longer. Also, this discovery is cool, but it is not yet a "move 37"-style discovery. At the current rate of progress, I'm sure we will get to that level soon. Congratulations to the team. Congratulations in particular to Robin, which generated the hypotheses, proposed the experiments, analyzed the data and generated the figures. And major congratulations also to the human team, which built Robin: @MichaelaThinks, @agreeb66, @benjamin0chang, @ludomitch, Mo Razzak, Kiki Szostkiewicz, and Angela Yiu.

English

166

alex zook retweetledi

Frost Giant Studios@Frost_Giant·3 Nis

One of our friends announced his new indie game today--give it a look! MoteMancer is an automation game (think Factorio🏭) in a magical fantasy setting.🧙‍♂️ 🎁Wishlist here! store.steampowered.com/app/3320980/Mo…

English

5.2K

alex zook@zookae·3 Mar

new workshop opportunity to get your work out on games research!

Ekta Prashnani@ekta_prashnani

The 2nd workshop on Computer Vision for Videogames will be organized at CVPR 2025: this is a great venue for gaming-related research (think AI, genAI, graphics, RL, agents, HCI — with applications to videogames). There is still time to submit: sites.google.com/view/cv2-2025/ #CVPR2025

English

105

alex zook retweetledi

RL_Conference@RL_Conference·2 Ara

The call for papers for RLC is now up! Abstract deadline of 2/14, submission deadline of 2/21! Please help us spread the word. rl-conference.cc/callforpapers.…

English

24.1K

alex zook@zookae·29 Eki

@hardmaru perhaps related

Tania Babina 🇺🇸 🇺🇦@TaniaBabina

This cool paper shows that robot adoption in Japan led to INCREASED employment! Nice new evidence on the debate about the effects of robots on employment.

English

hardmaru@hardmaru·28 Eki

“On average, 50% of global respondents (64% in America) feel nervous [about AI]. Few Japanese reckon AI will hasten the apocalypse: just 12% of Japanese think that it will make the future worse, the 2nd-lowest share in a study of 21 countries” No paywall: archive.is/BrW6G

English

7.5K

hardmaru@hardmaru·28 Eki

Japan is remarkably open to AI, but slow to make use of it @TheEconomist “The land of Doraemon embraces the new technology in theory but not in practice.” economist.com/asia/2024/10/2…

English

24.1K

alex zook@zookae·28 Eyl

exciting potential for robotics and simulations more broadly. of particular interest is getting foundation models to produce output aligned with details of the input prompt (not just “vaguely right”)

Fan-Yun Sun@sunfanyun

Training RL/robot policies requires extensive experience in the target environment, which is often difficult to obtain. How can we “distill” embodied policies from foundational models? Introducing FactorSim! #NeurIPS2024 We show that by generating prompt-aligned simulations and training a policy on them without collecting any experience in the target environment, we can achieve zero-shot performance close to policies trained on millions of target environment experiences in many classic RL environments. You can generate RL simulations on our project website: cs.stanford.edu/~sunfanyun/fac… More in 🧵 1/7

English

374

alex zook retweetledi

Fan-Yun Sun@sunfanyun·28 Eyl

English

211

35.2K

alex zook@zookae·26 Tem

@rndmcnlly of course!! (and congrats ;)

English

Adam M. Smith (史亚当)@rndmcnlly·22 Haz

Oh, hey, I got tenure.

English

104

5.5K

alex zook@zookae·8 Tem

we've extended the AIIDE doctoral consortium submission deadline to JULY 26 we're seeking applicants at one of two stages in their degree process a) assembling a thesis committee b) preparing for final defense and seeking career guidance sites.google.com/gcloud.utah.ed… please share!

English

141

alex zook@zookae·8 Haz

@Scholars_Stage curious: which Chinese accent is the closest match to the British accent?

English

132

T. Greer@Scholars_Stage·7 Haz

In fact I often give advice to Chinese learning English that they copy British or Australian accents because 1) those accents are high prestige in America—in particular people assume people with British accents are smart irregardless of actually intelligence 2) having such an accent will distract away from general flaws in English pronunciation

English

8.3K

T. Greer@Scholars_Stage·7 Haz

I actually think this is terrible advice. On the one hand you can’t avoid having *some* accent—the people who learned in Taiwan sound like parodies of that accent too. On the other hand, whenever I find non-native English speakers with a regional English accent—say, the Chinese kid who learned to speak English in Texas or Australia—it almost always makes them sound *more* fluent to my English speaking ears.

Nik Stankovic@nikstankovic_

BTW, as someone who learned his Mandarin in Beijing, one of the more annoying things is listening to people--Chinese included--who try to fake the Beijing "r" rolls into their Mandarin. It just sounds so fake. Don't do it. It's like me, who learned his English in the US, trying to fake an Irish accent. It just does not work. I was once in an amateur theater troupe in San Jose, California called The Mostly Irish Theater Company. And they were mostly first and second generations Irish in it too, except me and a couple of other Americans. And we put up a play where my role was suppose to have an Irish accent. I just could not do it. So we had to change the script where I was the American doctor.

English

161

38.8K

alex zook@zookae·31 Ara

@RiotIksar @HardcoreHistory @AcquiredFM @djrosent @gilbert @gamecraftpod @blakeir @mitchlasky highest density of information and insight: @cowenconvos

English

August Dean Ayala@RiotIksar·29 Ara

What is the best podcast you listen to? I started with games, branched out to entertainment, business, now just any podcast that has a great storyteller. Some I highly recommend: @HardcoreHistory @AcquiredFM (@djrosent, @gilbert) @gamecraftpod (@blakeir, @mitchlasky)

English

8.2K

alex zook@zookae·14 Ara

Anita’s done some stellar work taking AI from a tool for offline inspiration to a live interactive brush for working directly in 3d environments. it’s amazing how things change when you can go from material ideas to painting with it in an environment in real time.

Masha Shugrina@_shumash

So proud of my team for presenting the first interactive #texture #painting with #AI at #SIGGRAPHAsia2023 Real-Time Live. Well done Anita Hu and team!! We want the artist to stay in control 🎨🖌️🤗 developer.nvidia.com/blog/nvidia-re…

English

299

alex zook retweetledi

Masha Shugrina@_shumash·14 Ara

English

5.3K

alex zook@zookae·8 Eki

how is it already 10 years?! don’t miss it!

AIIDE Conference@AIIDEconference

#AIIDE23 kicks off tomorrow with the Experimental AI in Games workshop! Now in its 10th year, @exag20xx has become a mainstay of our workshop series, and emphasizes showing, teaching, and inventing, alongside traditional paper presentations. Check it out! exag.org/schedule

English

1.7K

alex zook@zookae·29 Ağu

@MaxCRoser is there a regional breakdown of these estimates? would love to compare post-Rome Europe with post-Han China, for example. wondering if the average is hiding regional progress that gets swamped by global trends

English

234

Max Roser@MaxCRoser·28 Ağu

For a very long time, global average land use for agriculture remained largely unchanged. In the last decades, this changed dramatically. It became possible to reduce agricultural land use per person.

English

247

819

274.8K

alex zook retweetledi

NVIDIA AI Developer@NVIDIAAIDev·9 Ağu

Thank you @SIGGRAPH and Real-Time Live. We greatly appreciate the award for 🏆 Best in Show for #SIGGRAPH2023 for our Text2Materials demo by the #NVIDIAResearch team. 👀 See the demo: nvda.ws/45i2rmH

English

103

51K

alex zook@zookae·3 Ağu

@Gaiazelle any chance i could get a code?

English

alex zook retweetledi

AI and Games School 2026@GameAISchool·31 Mar

We are thrilled to announce our sponsorship with @Sony SIE! They are offering scholarships (tickets and a small travel allowance included). To celebrate, a few tickets will be available for students. See you there! Apply now: #scholarship" target="_blank" rel="nofollow noopener">school.gameaibook.org/#scholarship #sony #airesearch

English

4.6K

alex zook retweetledi

Matthew Guzdial@MatthewGuz·30 May

Tired of AI hype but interested in reading about how machine learning (generative AI if you like) can be used to generate content in games? Well do I have the book for you!

English

465

50.4K

Keşfet

@SidN137 @Ryan__Rhys @GWellawatte @maykcaldas @ludomitch @SGRodriques @VoltagePark @nvidia