Daswin de Silva

223 posts

Daswin de Silva

@dswnds

Academic. Artificial Intelligence. Analytics. Automation. All Noir. Alliterations

Melbourne, Wurundjeri Katılım Aralık 2020

448 Takip Edilen75 Takipçiler

Daswin de Silva retweetledi

The Conversation - Australia + New Zealand@ConversationEDU·14h

The industry is rife with AI non-events that were wildly promoted and highly anticipated, but failed to deliver. @dswnds @latrobe theconversation.com/in-a-sea-of-hy…

English

559

Daswin de Silva retweetledi

Erik Brynjolfsson@erikbryn·1 Ağu

Congrats to @ThankYourNiceAI, @sj_manning, @manlikemishap and @danielrock for their careful work on AI's labor market effects!

Ethan Mollick@emollick

Interestingly, an economics paper that came out in 2023 predicting which jobs would overlap most with AI turned out to be right. A new Microsoft study of actual AI use by workers (more on that in another post) found a 90% correlation between real world overlap & the predictions.

English

10.9K

Daswin de Silva retweetledi

The Conversation - Australia + New Zealand@ConversationEDU·27 Tem

More autonomous AI systems that can use tools and work in teams are becoming increasingly common. @dswnds @latrobe theconversation.com/ai-agents-are-…

English

1.4K

Daswin de Silva retweetledi

Andrej Karpathy@karpathy·19 Haz

Nice - my AI startup school talk is now up! Chapters: 0:00 Imo fair to say that software is changing quite fundamentally again. LLMs are a new kind of computer, and you program them *in English*. Hence I think they are well deserving of a major version upgrade in terms of software. 6:06 LLMs have properties of utilities, of fabs, and of operating systems => New LLM OS, fabbed by labs, and distributed like utilities (for now). Many historical analogies apply - imo we are computing circa ~1960s. 14:39 LLM psychology: LLMs = "people spirits", stochastic simulations of people, where the simulator is an autoregressive Transformer. Since they are trained on human data, they have a kind of emergent psychology, and are simultaneously superhuman in some ways, but also fallible in many others. Given this, how do we productively work with them hand in hand? Switching gears to opportunities... 18:16 LLMs are "people spirits" => can build partially autonomous products. 29:05 LLMs are programmed in English => make software highly accessible! (yes, vibe coding) 33:36 LLMs are new primary consumer/manipulator of digital information (adding to GUIs/humans and APIs/programs) => Build for agents! Thank you again for the invite @ycombinator and congrats again on an awesome events! I'll post some links/references in the reply.

Y Combinator@ycombinator

Andrej Karpathy's (@karpathy) keynote yesterday at AI Startup School in San Francisco.

English

222

1.2K

8.9K

1.3M

Daswin de Silva retweetledi

Andrej Karpathy@karpathy·11 May

We're missing (at least one) major paradigm for LLM learning. Not sure what to call it, possibly it has a name - system prompt learning? Pretraining is for knowledge. Finetuning (SL/RL) is for habitual behavior. Both of these involve a change in parameters but a lot of human learning feels more like a change in system prompt. You encounter a problem, figure something out, then "remember" something in fairly explicit terms for the next time. E.g. "It seems when I encounter this and that kind of a problem, I should try this and that kind of an approach/solution". It feels more like taking notes for yourself, i.e. something like the "Memory" feature but not to store per-user random facts, but general/global problem solving knowledge and strategies. LLMs are quite literally like the guy in Memento, except we haven't given them their scratchpad yet. Note that this paradigm is also significantly more powerful and data efficient because a knowledge-guided "review" stage is a significantly higher dimensional feedback channel than a reward scaler. I was prompted to jot down this shower of thoughts after reading through Claude's system prompt, which currently seems to be around 17,000 words, specifying not just basic behavior style/preferences (e.g. refuse various requests related to song lyrics) but also a large amount of general problem solving strategies, e.g.: "If Claude is asked to count words, letters, and characters, it thinks step by step before answering the person. It explicitly counts the words, letters, or characters by assigning a number to each. It only answers the person once it has performed this explicit counting step." This is to help Claude solve 'r' in strawberry etc. Imo this is not the kind of problem solving knowledge that should be baked into weights via Reinforcement Learning, or least not immediately/exclusively. And it certainly shouldn't come from human engineers writing system prompts by hand. It should come from System Prompt learning, which resembles RL in the setup, with the exception of the learning algorithm (edits vs gradient descent). A large section of the LLM system prompt could be written via system prompt learning, it would look a bit like the LLM writing a book for itself on how to solve problems. If this works it would be a new/powerful learning paradigm. With a lot of details left to figure out (how do the edits work? can/should you learn the edit system? how do you gradually move knowledge from the explicit system text to habitual weights, as humans seem to do? etc.).

English

712

10.4K

1.5M

Daswin de Silva retweetledi

MIT CSAIL@MIT_CSAIL·23 Mar

The encoder of a transformer, as explained by @rfeers.

English

199

1.1K

107.8K

Daswin de Silva retweetledi

Richard Sutton@RichardSSutton·7 Mar

I am pretty happy with this 30-minute summary of my views on the current state of AI and alignment. youtube.com/watch?v=w177Ov…

YouTube

English

103

724

122.7K

Daswin de Silva retweetledi

Richard Sutton@RichardSSutton·5 Mar

awards.acm.org/about/2024-tur… Machines that learn from experience were explored by Alan Turing almost eighty years ago, which makes it particularly gratifying and humbling to receive an award in his name for reviving this essential but still nascent idea.

English

155

336

2.8K

227.4K

Daswin de Silva retweetledi

Andrej Karpathy@karpathy·5 Şub

New 3h31m video on YouTube: "Deep Dive into LLMs like ChatGPT" This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their "psychology", and how to get the best use them in practical applications. We cover all the major stages: 1. pretraining: data, tokenization, Transformer neural network I/O and internals, inference, GPT-2 training example, Llama 3.1 base inference examples 2. supervised finetuning: conversations data, "LLM Psychology": hallucinations, tool use, knowledge/working memory, knowledge of self, models need tokens to think, spelling, jagged intelligence 3. reinforcement learning: practice makes perfect, DeepSeek-R1, AlphaGo, RLHF. I designed this video for the "general audience" track of my videos, which I believe are accessible to most people, even without technical background. It should give you an intuitive understanding of the full training pipeline of LLMs like ChatGPT, with many examples along the way, and maybe some ways of thinking around current capabilities, where we are, and what's coming. (Also, I have one "Intro to LLMs" video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version of this topic. They can still be combined, as the talk goes a lot deeper into other topics, e.g. LLM OS and LLM Security) Hope it's fun & useful! youtube.com/watch?v=7xTGNN…

YouTube

English

767

2.9K

20.2K

2.4M

Daswin de Silva retweetledi

The Conversation - Australia + New Zealand@ConversationEDU·19 Ara

Tighter regulation and a lack of high-quality, authentic training data are just some of the problems AI developers will be facing, according to @dswnds from @latrobe theconversation.com/ai-will-contin…

English

1.4K

Daswin de Silva retweetledi

MIT CSAIL@MIT_CSAIL·6 Ara

Happy birthday to Geoffrey Hinton, whose work in neural networks has led many to call him the "Godfather of AI." These pioneering efforts recently won him the Nobel Prize: bit.ly/3Bf92Vx Image v/Aaron Vincent Elkaim & @nytimes

English

4.4K

Daswin de Silva retweetledi

Tim Dettmers@Tim_Dettmers·12 Kas

This is the most important paper in a long time . It shows with strong evidence we are reaching the limits of quantization. The paper says this: the more tokens you train on, the more precision you need. This has broad implications for the entire field and the future of GPUs🧵

Tanishq Kumar@tanishqkumar07

[1/7] New paper alert! Heard about the BitNet hype or that Llama-3 is harder to quantize? Our new work studies both! We formulate scaling laws for precision, across both pre and post-training arxiv.org/pdf/2411.04330. TLDR; - Models become harder to post-train quantize as they are overtrained on lots of data, so that eventually more pretraining data can be actively harmful if quantizing post-training! - The effects of putting weights, activations, or attention in varying precisions during pretraining are consistent and predictable, and fitting a scaling law suggests that pretraining at high (BF16) and next-generation (FP4) precisions may both be suboptimal design choices! Joint work with @ZackAnkner @bfspector @blake__bordelon @Muennighoff @mansiege @CPehlevan @HazyResearch @AdtRaghunathan.

English

468

2.9K

696.5K

Daswin de Silva@dswnds·11 Kas

AlphaFold3 is now open source. Thanks @GoogleDeepMind @demishassabis openread.academy/en/paper/readi… github.com/google-deepmin…

English

Daswin de Silva retweetledi

Fermat's Library@fermatslibrary·6 Kas

"One must divide one's time between politics and equations. But our equations are much more important to me, because politics is for the present, while our equations are for eternity." Albert Einstein

English

114

552

46.5K

Daswin de Silva retweetledi

nature@Nature·4 Kas

Tools such as Rosetta and AlphaFold have redefined the protein-engineering landscape. But some problems remain out of reach — for now go.nature.com/3YT7zgN

English

127

403

39.2K

Daswin de Silva retweetledi

CEDA@ceda_news·16 Eki

Are prompt repositories the new frontier for #AI at work? Daswin De Silva (@dswnds), Professor of AI and Analytics at @latrobe, explores how a repository of #prompts can help organisations inform training programs, detect productivity gaps, and more. ceda.com.au/newsandresourc…

English

368

Daswin de Silva retweetledi

Jim Fan@DrJimFan·8 Eki

Hitchhiker's guide to rebranding: - Machine learning -> statistical mechanics - Loss function -> energy functional - Optimize the model -> minimize free energy - Trained model -> reached equilibrium distribution - KL divergence -> free energy difference - Gaussian noise -> random thermal fluctuations - Random step -> Brownian motion - SGD -> directional Brownian motion - GPU -> simulated particle accelerator - Diffusion models -> Langevin dynamics - Reinforcement learning -> control theory - Robotics -> physical computation - Audio learning -> 1D signal processing - Image learning -> 2D signal processing - Video learning -> 3D signal processing - Multimodal models -> multidimensional signal processing - Sora -> learned physics engine You're welcome

English

109

708

4.7K

499.4K

Daswin de Silva retweetledi

nature@Nature·8 Eki

BREAKING: This year's #NobelPrize in Physics has been awarded to John J Hopfield and Geoffrey E Hinton for "foundational discoveries and inventions that enable machine learning with artificial neural networks" Stay tuned for more from @nature

English

240

826

161.4K

Daswin de Silva retweetledi

Andrej Karpathy@karpathy·6 Eki

Not fully sure why all the LLMs sound about the same - over-using lists, delving into “multifaceted” issues, over-offering to assist further, about same length responses, etc. Not something I had predicted at first because of many independent companies doing the finetuning.

English

521

352

7.5K

3.5M

Daswin de Silva retweetledi

The Conversation - Australia + New Zealand@ConversationEDU·16 Şub

With work at our fingertips at all hours of the day, it can be hard to disconnect. AI assistants may solve the problem – at least, that’s what big tech wants us to think, writes @dswnds (@latrobe). #Echobox=1708056820-1" target="_blank" rel="nofollow noopener">theconversation.com/drowning-in-di…

English

2.2K

Keşfet

@latrobe @ThankYourNiceAI @sj_manning @manlikemishap @danielrock @ycombinator @nytimes @GoogleDeepMind