Matt Redlon

567 posts

Matt Redlon

@mattredlon

Chair AI Program & VP Digital Biology @MayoClinic. Thinking @timberwolfai. Co-founder @clarioanalytics. Lecturer @UMNTLI. Geek for Bio/ML/AI. Views are my own.

Minneapolis Katılım Kasım 2008

270 Takip Edilen573 Takipçiler

Matt Redlon@mattredlon·10 Mar

I immediately thought of DeepMind’s FunSearch paper when I saw @karpathy’s post. The abstraction of this pattern is so powerful. I tried to implement in AutoGPT when the paper came out but the models weren’t there yet. deepmind.google/blog/funsearch…

Andrej Karpathy@karpathy

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autor… Part code, part sci-fi, and a pinch of psychosis :)

English

125

Matt Redlon@mattredlon·26 Oca

@patrickc To be fair one of my favorite Tolkien characters…

English

192

Patrick Collison@patrickc·26 Oca

It's going to be tough for startups when all the Lord of the Rings names are taken and the only thing left is something like Bombadil AI.

English

496

345

11.6K

Matt Redlon@mattredlon·12 Ağu

@nalidoust @tahoe_ai Congrats, Nima! Look forward to meeting you this week.

English

117

Nima Alidoust@nalidoust·11 Ağu

We’ve raised $30M to build the foundational dataset for Virtual Cell Models: 1Bn single-cell datapoints, mapping 1M drug-patient interactions, to be shared with one partner. Our goal: Move the frontier - From models to precision medicines that help patients. @tahoe_ai 🧵

English

318

60.9K

Matt Redlon@mattredlon·19 Tem

@NeelyTamminga Congratulations, Neely! What is focus of degree?

English

Neely@NeelyTamminga·19 Tem

News Update: I’m starting a PhD program at Gonzaga… 🤓

English

100

4.5K

Matt Redlon@mattredlon·26 May

@deedydas

GIF

QME

Deedy@deedydas·26 May

Heard a crazy rumor that Anthropic's corporate social responsibility is run by this megalomaniac called Phil who insists on calling his division... Philanthropic.

English

1.5K

198.5K

Matt Redlon@mattredlon·18 May

@emollick Advertising that is well executed and highly targeted does not feel interruptive. Instagram does it better than any other platform.

English

Ethan Mollick@emollick·18 May

Huh. People don’t actually mind ads on Facebook, at all.

AEA Journals@AEAjournals

Forthcoming in AER: Insights: "The Consumer Welfare Effects of Online Ads: Evidence from a 9-Year Experiment" by Erik Brynjolfsson, Avinash Collis, Daniel Deisenroth, Haritz Garro, Daley Kutzman, Asad Liaqat, and Nils Wernerfelt. aeaweb.org/articles?id=10…

English

167

40.5K

Matt Redlon@mattredlon·18 Ara

@kalomaze @Dorialexander Just read perfect example of this yesterday: huggingface.co/spaces/Hugging…

English

kalomaze@kalomaze·18 Ara

@Dorialexander what makes me sad about small models right now is that they probably only suck because they are very "all or nothing" a 500m model could go much, much farther if it was sometimes allowed to repeat computation for 10+ passes

English

11.6K

kalomaze@kalomaze·18 Ara

mentioning "LLMs" in the context of an advertisement for something with 8GB of VRAM is the biggest scam of all time

Haider.@haider1

🚨 NVIDIA Introduces Jetson Nano Super > compact AI computer capable of 70-T operations per second > designed for robotics, it supports advanced models, including LLMs, and costs $249

English

1.4K

142.7K

Matt Redlon@mattredlon·23 Kas

Me too.

Ian Johnson 🔬🤖@enjalot

I am obsessed with Sparse Autoencoders! SAEs unpack so much existing value and unlock exciting new capabilities. It's happening in text, images and even proteins. This is a long thread with lots of links and quote tweets of the projects, articles and code that made me 🤯

English

233

Matt Redlon@mattredlon·13 Kas

@jxmnop I really enjoyed working through @rasbt’s “Build a Large Language Model (From Scratch)”. Not a text book per se, but could be taught from.

English

895

Matt Redlon@mattredlon·19 Eyl

@Rainmaker1973 Time to retrain all of the self driving algorithms.

English

215

Massimo@Rainmaker1973·19 Eyl

This truck has been adapted to give drivers behind a view of the road ahead through a large LED screen. This helps other vehicles increase their visibility when overtaking [📹 Denis Shvetsov]

English

716

100.4K

Matt Redlon@mattredlon·14 Tem

@rasbt The Manning site is killing me, Sebastian! Keeps stalling out during purchase process. Anywhere else I can purchase ebook?

English

Sebastian Raschka@rasbt·16 Haz

If you are looking for a resource to understand the instruction fine-tuning process in LLMs, I've uploaded a notebook to implement the fine-tuning process from scratch: github.com/rasbt/LLMs-fro… It explains 1. how to format the data into 1100 instruction-response pairs 2. how to apply a prompt-style template 3. and how to use masking. Of course, this also includes a section on implementing an LLM-based automated process for evaluation. Happy coding!

English

403

2.1K

208.2K

Matt Redlon@mattredlon·14 Haz

@Supersam331 @natiakourdadze @AliHussein_20 It's in Queries.csv, Sam.

Deutsch

112

Sam@Supersam331·14 Haz

@natiakourdadze @AliHussein_20 Thanks Natia! i only got these files but didn't see a keywords. it's probably because i don't have any keywords setup. by keyword do you mean the keyword in google ads? so that means i need to create some ads campaign first?

English

430

Natia Kurdadze@natiakourdadze·13 Haz

If you are a startup founder and hate marketing, this is how you can get leads organically 🦄 1. Go to Google Search Console 2. Download CSV file and export all keywords 3. Look at the column called "Position" 4. See what you are almost ranking for 5. Create separate pages for each keyword #buildinpublic

English

102

1.2K

273.7K

Matt Redlon@mattredlon·26 Oca

@ChloeCondon @GoogleCloudTech @lifeatgoogle Amazing! Love it.

English

Chloe Condon@ChloeCondon·26 Oca

Sensible business woman attire that silently screams "Have you heard of Google dot com?" 💙❤️💛💚 @GoogleCloudTech ☁️

English

5.2K

Matt Redlon@mattredlon·25 Oca

@OpenAI @AnthropicAI @Meta @GoogleAI @MistralAI "Supervised Fine Tuning (SFT)" papers cover techniques for aligning the model after pretraining using curated questions from a human labeler. An important step, but it seems to be losing out in research lately to its "big brother RL[H/AI]F" - at least based on Twitter/X activity!

English

Matt Redlon@mattredlon·25 Oca

"Pretrain" tends to be papers put out by the big players building foundation models (@OpenAI , @AnthropicAI, @Meta, @GoogleAI, @MistralAI, etc.). Also overlaps with breakthroughs in multimodal LMs.

English

Matt Redlon@mattredlon·21 Oca

Everyone: "AI is going to take over the world" AI:

English

146

Matt Redlon@mattredlon·21 Oca

@francoisfleuret I just started @AxlerLinear's book Linear Algebra Done Right and thought this same thing on page 2. I kept thinking of @thtrieu_'s recent video on AlphaGeometry though where he mentions famous proofs being a byproduct of "pulling a rabbit out of a hat". youtube.com/watch?v=TuZhU1…

YouTube

English

105

François Fleuret@francoisfleuret·21 Oca

Is there a intuitive rationale for the necessity of the complex numbers to exist? Saying "we needed to solve x^2=-1" is a bit short, why not "x+1=x" ?

English

113

180

131.9K

Matt Redlon@mattredlon·19 Oca

@karpathy @lateinteraction Another example of the flow used in FunSearch, AlphaGeometry, and @rao2z's LLM-Modulo approach. LLM generates ideas and external verifier checks them. While Andrej says "answer is constructed iteratively" you could say prompt is what is iteratively refined.

English

3.6K

Andrej Karpathy@karpathy·18 Oca

Prompt engineering (or rather "Flow engineering") intensifies for code generation. Great reading and a reminder of how much alpha there is (pass@5 19% to 44%) in moving from a naive prompt:answer paradigm to a "flow" paradigm, where the answer is constructed iteratively.