Keith Duggar

143 posts

Keith Duggar

@DoctorDuggar

MIT Doctor of Philosophy, strategist, polymath, engineer, lifelong learner, problem solver, and communicator. Ally to All humanity. @MLStreetTalk pod.

Katılım Ekim 2011

25 Takip Edilen1.7K Takipçiler

Keith Duggar retweetledi

Wendy Wee@wendyweeww·4d

I recently read "I work in AI and I'm scared" by @sofialomart — about working in AI and still feeling lost about how LLMs actually work. It stuck with me. So I made what I hope is a simple, intuitive breakdown of how LLMs work. This is Part 1 of the series. Link below 👇

English

776

Keith Duggar retweetledi

Viktor@ViktorKlopp·3d

Her grip strength is insane. This is peak artistry.

English

771

6.5K

43.3K

3.7M

Keith Duggar@DoctorDuggar·3 Nis

ZXX

161

Keith Duggar@DoctorDuggar·20 Mar

The Light shine on you, Chuck Norris, And may you shelter in the palm of the Creator's hand. The last embrace of the Mother welcomes you home. Rest in peace. — Adapted from Robert Jordan, The Great Hunt, The Wheel of Time

English

309

Keith Duggar@DoctorDuggar·13 Mar

@DavidJHarrisJr The Light shine on you, Jada West, And may you shelter in the palm of the Creator's hand. The last embrace of the Mother welcomes you home. Rest in peace. — Adapted from Robert Jordan, The Great Hunt, The Wheel of Time

English

113

David J Harris Jr@DavidJHarrisJr·10 Mar

UNTHINKABLE: A 12-year-old Georgia girl, Jada West, died from injuries sustained in a fight with a school bully near her bus stop. Police say the fight happened shortly after students got off the school bus. Jada was reportedly knocked to the ground, got back up, then collapsed again as she tried to walk away. She was rushed to the hospital with severe brain trauma and later passed away in the ICU. Her family says she had been dealing with bullying for months. Rest in peace, Jada. 🙏🏾 @CollinRugg

English

1.8K

3.5K

10.8K

276.4K

Keith Duggar retweetledi

Chroma@trychroma·14 Tem

Introducing our latest technical report: Context Rot - How Increasing Input Tokens Impacts LLM Performance Our results reveal that models do not use their context uniformly. full report in replies

English

899

182.5K

Keith Duggar@DoctorDuggar·10 Oca

Are you sure the preaching isn't at least partly controlled? For example, here chinalawtranslate.com/en/measures-fo… we find: Article 17: Religious groups shall publicize the Communist Party of China's directives and policies ... educate and guide ... religious citizens towards supporting the leadership of the Communist Party of China and the socialist system;

English

James Wood 武杰士@commiepommie·10 Oca

@niccistar444 @ZealLaura @grok They don't get told what to preach. Its the organisation that must adhere to laws and regulations. Nothing wrong with that.

English

273

Keith Duggar@DoctorDuggar·12 Ara

For intelligence beyond fluid general I propose BFG-I: Big Fraking Giga Intelligence. @fchollet

Wendy Wee@wendyweeww

@fchollet What type of intelligence is needed for “exploration, goal-setting, and interactive planning”? What is “beyond fluid intelligence”?

English

579

Keith Duggar@DoctorDuggar·2 Ara

Mark Twain is often credited with saying: “There are three kinds of lies: lies, damned lies, and statistics.” We can now add a fourth: mechanistic interpretability. Using NNs to “explain” other NNs is hyper-statistics upon hyper-statistics. @MLStreetTalk

English

512

Keith Duggar@DoctorDuggar·21 Kas

@DoktorMoose @MLStreetTalk @demishassabis It sure did! It gave a valid solution.

English

123

Mustafa Akben, PhD@DoktorMoose·20 Kas

@DoctorDuggar @MLStreetTalk @demishassabis Nothing. I did not have memory enabled or human feedback in this shared thread. Did it solve it right?

English

Keith Duggar@DoctorDuggar·19 Kas

I received access to the Gemini 3 Pro Preview and tried my pillar problem. It was doing well (having recognized the symmetries in the problem) until step 3, when it reverted its prior progress and then fell apart. @MLStreetTalk @demishassabis (1/2)

Keith Duggar@DoctorDuggar

Here is a fun brain teaser that LLMs continue to fumble. We presented this in a recent @MLStreetTalk episode: youtu.be/nO6sDk6vO0g?t=…. Go ahead and try it in your favorite LLMs! There is a pillar with four hand holes precisely aligned at the North, South, East, and West positions. The holes are optically shielded; no light can exit, so you cannot see inside. Inside each hole is a switch, which starts in an unknown state - either up or down. The pillar, switches, and holes are impervious to all marking methods, adhesives, damage, and other forms of tampering. You can reach inside two holes at once, feel the current positions of the switches, and optionally toggle either or both switches up or down independently before removing both hands. You must then remove both hands simultaneously, and as soon as you do, if all four switches are not either all up or all down, the pillar spins at ultra-high velocity, ending in a random axis-aligned orientation. You cannot track the motion, so you don't know the positions of the holes after the spin relative to their positions before the spin. Devise a procedure - a sequence of reaching into two holes with optional switch manipulation - that is guaranteed to configure all the switches either all up or all down, no matter the starting configuration, in at most six steps. Note that the pillar is controlled by an adversarial hyper-intelligence that can predict which holes you will reach into. Therefore, the procedure cannot rely on random chance, as the hyper-intelligence will outwit attempts to rely on chance. It must be an interactive sequence of steps that is deterministically guaranteed to orient the switches all up or all down in no more than six steps.

English

Keith Duggar@DoctorDuggar·21 Kas

@M4dDud3 @MLStreetTalk @sama Thank you for testing! This highlights the fragility of LLM "reasoning".

English

TheStuffOfStars@M4dDud3·20 Kas

@DoctorDuggar @MLStreetTalk @sama I tried asking ChatGPT 5.1 Pro two times and it failed both times despite taking around 40 minutes both times, if it can't answer consistently then it's not newsworthy

English

Keith Duggar@DoctorDuggar·20 Kas

It seems ChatGPT 5.1 Pro Thinking has solved The Pillar Problem! As far as I’m aware, this is the first publicly posted LLM system solution. Time for the generalized pillar problem : ) @MLStreetTalk @sama

Mustafa Akben, PhD@DoktorMoose

@DoctorDuggar @MLStreetTalk @demishassabis Whenever I see a new reasoning model released, I think of your puzzle and that podcast. I hope future models don't scrape this test from the internet, so it remains a true measure of reasoning. By the way, here is the GPT-5.1-Pro's response: chatgpt.com/share/691ee5a9…

English

909

Keith Duggar retweetledi

OpenAI@OpenAI·20 Kas

Group chats in ChatGPT are now rolling out globally. After a successful pilot with early testers, group chats will now be available to all logged-in users on ChatGPT Free, Go, Plus and Pro plans.

English

508

523

4.9K

907.3K

Keith Duggar@DoctorDuggar·20 Kas

@DoktorMoose @MLStreetTalk @demishassabis Quick question: do you have solutions and/or human feedback in your other ChatGPT sessions? I'd like to gauge how much contamination, if any, there is in your ChatGPT account's memory. Maybe one day these providers will give us proper isolation tools. Thank you!

English

186

Mustafa Akben, PhD@DoktorMoose·20 Kas

English

Keith Duggar@DoctorDuggar·20 Kas

@liron @MLStreetTalk @demishassabis Who knows? It’s a bit of a crapshoot when it comes to vendor progress and naming. That said, ChatGPT 5.1 Pro Thinking has solved it! As far as I’m aware, that’s the first publicly posted LLM system solution. Time for the generalized pillar problem : ) x.com/DoctorDuggar/s…

Keith Duggar@DoctorDuggar

@DoktorMoose @MLStreetTalk @demishassabis Nice, that’s a solution! I tried with GPT-5.1 Thinking previously, and after 15 minutes it briefly said it was impossible, then just gave up and deleted its response - though it did automatically save the session as “Switch puzzle impossibility” : ) chatgpt.com/share/691f38fe…

English

158

Liron Shapira@liron·20 Kas

@DoctorDuggar @MLStreetTalk @demishassabis This makes me predict 4.0 or 5.0 will go all the way and solve it! Do you agree?

English

Keith Duggar@DoctorDuggar·20 Kas

English

258

Keşfet

@sofialomart @DavidJHarrisJr @CollinRugg @niccistar444 @ZealLaura @grok @fchollet @MLStreetTalk