Luke Hutchison

695 posts

Luke Hutchison

Luke Hutchison

@LH

There is always a solution. PhD, MIT. Ex-Google AI research scientist.

Planet Earth Katılım Mayıs 2009
1.8K Takip Edilen2.9K Takipçiler
Luke Hutchison
You have to set #ClaudeCode's effort level to the new `max` level to get back any semblance of the old level of reasoning performance. Bascially the new `high` is the old `medium`, and the new `max` (which is not on by default) is the old `high`.
English
0
0
1
12
Luke Hutchison
If you have noticed that @claudeai (Claude Code Opus 4.6, specifically) suddenly got a lot worse at reasoning in the last few days, you need to know that every Claude Code user had their effort downgraded, without even a public post about it (@bcherny??) gist.github.com/kellan/ccf6125…
English
2
0
1
32
Luke Hutchison
@theo It only works well if you make very small incremental changes, and manually check and test everything, going through rounds of polishing before moving on to the next incremental change. It does NOT work well for big bang development.
English
0
0
0
55
Theo - t3.gg
Theo - t3.gg@theo·
Just let Opus go for over an hour on a new feature. When it was done, I asked how I can test it. 20 minutes later, it realized I can't test it because it did the whole thing entirely wrong. Idk how you guys use this model every day for real work 🙃
Theo - t3.gg tweet media
English
396
25
1.4K
258.1K
Luke Hutchison
@catalinmpit That's because Anthropic renamed the "high" effort level to "max", and renamed "medium" to "high", then bumped everyone down one level by default, in order to reduce server load. You have to set effort to max under /model to get it to work well again.
English
0
0
0
64
Catalin
Catalin@catalinmpit·
Lately, Claude makes some shocking mistakes. ⟶ Implements overly complex code ⟶ Ignores the codebase's code style ⟶ Removes working code for no reason ⟶ Replaces code that's out of scope from the task at hand It feels like it needs 100% supervision. At this point, you're better off writing everything yourself.
Catalin tweet media
English
244
32
566
58.8K
Luke Hutchison
@skdh Particles don't just act as waves, they *are* waves, propagating according to an underlying probability distribution, until some potential interaction that samples from that distribution "rolls a six", resulting in the collapse of all future possibilities to that single point.
English
0
0
1
61
Sabine Hossenfelder
The double slit experiment is the probably most misunderstood experiment ever. I have no idea who created the myth that if you 'look' at one of the slits, then the particles (photons/electrons) stop behaving as waves. It's wrong! They of course STILL behave as waves! Because particles are also waves, always. Photons and electrons make a self-interference EVEN ON A SINGLE slit. Don't believe it? Below an actual measurement from a laser diffracting on a single/double slit from Wikipedia. What happens if you measure which slit the particle goes through is that you get no interference between BOTH slits. And no, you don't need a conscious observer for this. Believe it or not, there have actually been experiments where they had people literally look at a double slit to see if that makes any difference and the answer is no, it does not. The entire mystery of the double slit is in the path of the particle TO the double slit. Because it seems that the particle must "know" whether it WILL be measured at one of the slits before it even gets there. It must "know" whether to go through both or just pick one. Seems like the future influences the past? Not really, it just means you have a consistency condition on the time evolution.
Sabine Hossenfelder tweet media
English
272
147
1.4K
110.6K
Luke Hutchison retweetledi
Lossfunk
Lossfunk@lossfunk·
🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵
English
137
259
2K
1M
Luke Hutchison retweetledi
Haytham Kaafarani
Haytham Kaafarani@hayfarani·
I am a US citizen & Surgeon who took care of the Boston Maraton Bombing victims in 2013. I paid for 7 years to own a small apartment in downtown #Beirut for my 3 kids to enjoy summers there. Today, #Israel reduced my dream home to rubble, with american weapons, paid by my taxes.
English
9K
46.4K
198.5K
9.1M
Gandalv
Gandalv@Microinteracti1·
The James Webb Space Telescope has a problem. A beautiful, maddening, keeps-you-up-at-night kind of problem. Scattered across nearly every deep image it captures are roughly 1,000 tiny red specks. They date from the universe’s first billion years. They are compact, they are bright, and after three years of serious scientific argument, nobody can agree what they actually are. Three camps have formed. The first says the dots are supermassive black holes wrapped in thick shrouds of dust, feeding voraciously in the infant cosmos. The second argues they are ancient stars in the final act of collapse. The third proposes something even stranger: direct-collapse black holes, objects that skipped the star stage entirely and simply fell straight into darkness. All three theories have problems. None fits the data cleanly. So the proposals keep coming. Dozens of them, queued up for Webb’s next observation cycle. Radio telescopes may eventually settle the argument, offering a signal that cuts through the dust and ambiguity alike. For now, the dots just sit there. Small, red, and completely unbothered by our confusion.​​​​​​​​​​​​​​​​ What do you think this means? Stay connected, Follow Gandalv @Microinteracti1
English
150
129
1.1K
195.4K
Luke Hutchison
@Math_files Even more influential in general was ALGOL, released the next year (almost all modern languages are ALGOL-like).
English
0
0
0
22
Math Files
Math Files@Math_files·
Before Python, Java, or even C++, there was Fortran—the first major programming language. In 1957, John Backus and his team at IBM developed Fortran, making coding much easier than low-level machine code. Before Fortran, programmers had to write instructions in binary or assembly—slow and complicated. Fortran changed everything. It allowed programmers to use simple, math-like commands, making it especially useful for scientists and engineers. NASA even used Fortran to help land humans on the Moon. Fortran set the stage for modern coding languages, and it’s still used today in scientific computing.
Math Files tweet media
English
208
293
2.1K
277.9K
Luke Hutchison
@FangYi11101 In practical terms, your example of summing 10 variables gives something normal-like, but it's not normal yet. The CLT depends on the number of variables summed, not the number of trials (the number of trials just affects noise levels, not the shape of the distribution).
English
0
0
1
104
Luke Hutchison
@FangYi11101 That's not how the central limit theorem works either. He *is* summing the RVs, but he is summing only *two* of them. The convolution of two rect kernels is a triangular kernel (what we see here). The CLT says this becomes normal only as you sum a large (infinite) number of RVs.
English
1
0
4
1.3K
Dudes Posting Their W’s
Dudes Posting Their W’s@DudespostingWs·
He rolled dice 10,000 times over 17+ hours and documented the results, an almost perfect normal distribution.
English
566
1.6K
27.6K
5.1M
Luke Hutchison
@HarryVincere @DudespostingWs In this case, it's not a normal distribution, it's a triangular distribution, because there are two dice with uniform probability on each number. The distribution would tend to normal only as you add more and more dice rolls together (the Central Limit Theorem).
English
0
0
4
212
Harry
Harry@HarryVincere·
I knew thats what would happen before I watched, before I read the caption. Normal distribution is everywhere, including nature, it's a law we are bound by. Also, this should inspire you to be away from the average and closer to the top. There's only few who do, but the climb, it's worth it.
English
2
2
27
13.5K
Luke Hutchison
@DudespostingWs Could have saved you 17 hours: the convolution of two rectangular kernels is a triangular kernel.
English
0
0
1
100
Luke Hutchison retweetledi
Zane Koch
Zane Koch@zanehkoch·
for a while i've had a slight fear that the bluetooth from my airpods could be frying my brain this weekend i pulled the raw data from a $30m government study of 1,679 mice blasted with cell phone radiation and reanalyzed it what i found was...not what I expected? 🧵
English
403
684
9.2K
4.3M
Luke Hutchison retweetledi
Govind
Govind@Govindtwtt·
Unpopular opinion: you actually need real coding knowledge to vibe-code properly.
Govind tweet media
English
289
134
3.5K
143.2K
Luke Hutchison retweetledi
Yam Peleg
Yam Peleg@Yampeleg·
claude code is down how’s everyone doing
English
51
5
105
10.9K