Dwrakseh Petal (@phildunphag) - Twitter Profili

@dwarkesh_sp @michael_nielsen I share my live trading alerts (entry and exit points) on WhatsApp. Join for free! ✅ ➡️ Copy the search query and reply with "555" to WhatsApp: + 13026663796 👉 🔗: api.whatsapp.com/send/?phone=13… 🎥 - Daily Live Trading 📖 - Trading Recap ☢️ - Personal Strategy

English

0

6

Dwrakseh Petal@phildunphag·30 Nis

@dwarkesh_sp @michael_nielsen My key strategy！ ⤵️！！

English

1

0

126

Dwarkesh Patel@dwarkesh_sp·30 Nis

Why @michael_nielsen disagrees with the view that science will keep getting harder and harder as low-hanging fruit is picked:

English

4

7

65

57.1K

Dwrakseh Petal@phildunphag·29 Nis

@dwarkesh_sp @reinerpope Free sharing of trading strategies and the latest market updates.Add me on WhatsApp✅ ➡️ Copy search input Reply “333” to WhatsApp: + 13026663796 + My WhatsApp api.whatsapp.com/send/?phone=13…

English

0

42

Dwrakseh Petal@phildunphag·29 Nis

@dwarkesh_sp @reinerpope My key strategy！ ⤵️！！

English

1

0

479

Dwarkesh Patel@dwarkesh_sp·29 Nis

Wrote up some flashcards and practice problems to help myself retain what @reinerpope taught. Hope it's helpful to you too! Suggest more below and I'll add them. reiner-flashcards.vercel.app

Dwarkesh Patel@dwarkesh_sp

Did a very different format with @reinerpope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Recommend watching this one on YouTube so you can see the chalkboard. 0:00:00 – How batch size affects token cost and speed 0:31:59 – How MoE models are laid out across GPU racks 0:47:02 – How pipeline parallelism spreads model layers across racks 1:03:27 – Why Ilya said, “As we now know, pipelining is not wise.” 1:18:49 – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal 1:32:52 – Deducing long context memory costs from API pricing 2:03:52 – Convergent evolution between neural nets and cryptography

English

35

149

2.1K

237.9K

Dwrakseh Petal@phildunphag·29 Nis

@dwarkesh_sp @reinerpope I share my live trading alerts (entry and exit points) on WhatsApp. Join for free! ✅ ➡️ Copy the search query and reply with "555" to WhatsApp: + 13026663796 👉 🔗: api.whatsapp.com/send/?phone=13… 🎥 - Daily Live Trading 📖 - Trading Recap ☢️ - Personal Strategy

English

0

15

Dwrakseh Petal@phildunphag·29 Nis

@dwarkesh_sp @reinerpope My key strategy！ ⤵️！！

English

1

0

132

Dwarkesh Patel@dwarkesh_sp·29 Nis

Did a very different format with @reinerpope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Recommend watching this one on YouTube so you can see the chalkboard. 0:00:00 – How batch size affects token cost and speed 0:31:59 – How MoE models are laid out across GPU racks 0:47:02 – How pipeline parallelism spreads model layers across racks 1:03:27 – Why Ilya said, “As we now know, pipelining is not wise.” 1:18:49 – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal 1:32:52 – Deducing long context memory costs from API pricing 2:03:52 – Convergent evolution between neural nets and cryptography

English

152

601

6.6K

1.3M

Dwrakseh Petal@phildunphag·29 Nis

@dwarkesh_sp @AdamMarblestone I share my live trading alerts (entry and exit points) on WhatsApp. Join for free! ✅ ➡️ Copy the search query and reply with "555" to WhatsApp: + 13026663796 👉 🔗: api.whatsapp.com/send/?phone=13… 🎥 - Daily Live Trading 📖 - Trading Recap ☢️ - Personal Strategy

English

0

14

Dwrakseh Petal@phildunphag·29 Nis

@dwarkesh_sp @AdamMarblestone My key strategy！ ⤵️！！

English

1

0

72

Dwarkesh Patel@dwarkesh_sp·29 Nis

.@AdamMarblestone thinks it might cost ~low billions to map the human brain connectome. The benefit is getting answers about the brain’s secret sauce: esp. why are humans so much more sample- and energy-efficient. If labs are going to be spending trillions of dollars on compute by the end of the decade, Adam's pitch is: give him 1/100th of that to actually figure out these big questions about intelligence.

English

17

27

193

23.7K

Dwrakseh Petal@phildunphag·29 Nis

@dwarkesh_sp I share my live trading alerts (entry and exit points) on WhatsApp. Join for free! ✅ ➡️ Copy the search query and reply with "555" to WhatsApp: + 13026663796 👉 🔗: api.whatsapp.com/send/?phone=13… 🎥 - Daily Live Trading 📖 - Trading Recap ☢️ - Personal Strategy

English

0

14

Dwrakseh Petal@phildunphag·29 Nis

@dwarkesh_sp My key strategy！ ⤵️！！

English

1

0

169

Dwarkesh Patel@dwarkesh_sp·29 Nis

There's a quadrillion-dollar question at the heart of AI: Why are humans so much more sample efficient compared to LLM? There are three possible answers: 1. Architecture and hyperparameters (aka transformer vs whatever ‘algo’ cortical columns are implementing) 2. Learning rule (backprop vs whatever brain is doing) 3. Reward function @AdamMarblestone believes the answer is the reward function. ML likes to use pretty simple loss functions, like cross-entropy. These are easy to work with. But they might be too simple for sample-efficient learning. Adam thinks that, in humans, the large number of highly specialised cells in the ‘lizard brain’ might actually be encoding information for sophisticated loss functions, used for ‘training’ in the more sophisticated areas like the cortex and amygdala. Like: the human genome is barely 3 gigabytes (compare that to the TBs of parameters that encode frontier LLM weights). So how can it include all the information necessary to build highly intelligent learners? Well, if the key to sample-efficient learning resides in the loss function, even very complicated loss functions can still be expressed in a couple hundred lines of Python code.

English

189

168

1.9K

942.4K

Dwrakseh Petal@phildunphag·20 Nis

Sipping iced coffee, flipping through a book, and letting the world slow down—this is my kind of perfect afternoon.

English

0

1

0

16

Dwrakseh Petal retweetledi

Derek Quick-Assistant@MadiTalbot·20 Nis

"Grad caps up, hearts full—#ClassOf2024, we did the thing! Grateful for every lecture, late-night study sesh, and inside joke. On to the next adventure! "

English

0

1

0

5

Dwrakseh Petal@phildunphag·18 Nis

First time reversing into a parking spot—3 tries, 1 near-miss with a cone, and finally a win! Grinning like a fool, proud of my tiny victory. #LearningToDrive #NewDriverVibes

English

0

1

3

Dwrakseh Petal retweetledi

André Mendes@AndrMen74081447·18 Nis

"From tiny pup to adventure buddy — watch your fur baby bloom! #PetGrowth #DogMomLife"

English

0

1

0

11

Dwrakseh Petal@phildunphag·16 Nis

"Turned 'I can’t' into 'Watch me.' What’s YOUR next 'impossible' #ChallengeAccepted #GrowthMindset"

English

0

1

7

Dwrakseh Petal retweetledi

謙謙@qianqia43048051·16 Nis

"Neighborly win! Shared grill, laughs, and grilled corn with Mr. Li next door—small moments make the block feel like family. Who’s your favorite neighbor "

English

0

1

7

Dwrakseh Petal@phildunphag·15 Nis

"10 mins daily = 1 language win Learn 3 high-frequency phrases (e.g., 'I’m tied up', 'It’s a steal') → use them tonight! Small steps = big progress. #LingQTips #LanguageHacks"

English

0

2

3

Dwrakseh Petal

Keşfet