Matt Stuchlik

1.2K posts

Matt Stuchlik banner
Matt Stuchlik

Matt Stuchlik

@s7nfo

argmin(cycles) | x@stripe

Singapore Katılım Ocak 2011
1.4K Takip Edilen606 Takipçiler
Sabitlenmiş Tweet
Matt Stuchlik
Matt Stuchlik@s7nfo·
Cheat code for faster memory reads 👀
Matt Stuchlik tweet media
English
7
32
433
61.1K
@fclc (dreaming of The Good Computer)
i often forget that claude is a better performance engineer than a good 70+% of programmers. then i remember that' for a significant amount of programmers, enabling O3 is considered "performance engineering"
English
7
4
212
18.9K
Matt Stuchlik
Matt Stuchlik@s7nfo·
15% improvement on Parse Integers. Insane breakthrough.
Matt Stuchlik tweet media
English
0
0
9
285
Matt Stuchlik
Matt Stuchlik@s7nfo·
@DrPhiltill Yeah, makes sense! One thing I like about Anki that the article does not talk about, is that it has an API and can therefore be used by Claude Code / Codex. You can imagine the exciting opportunities there... I suggest looking into DuChinese for listeining, I find it very nice.
English
0
0
0
48
Phil Metzger
Phil Metzger@DrPhiltill·
This was great! Thanks for linking it. I tried using Anki but the startup was too steep for my patience and I found HanziHero which is the same idea but turnkey. My whole program is here: HSK1 Space repetition work (1-2 hours per day) HanziHero 20 new items/day HanziHero review ~100/day Listening & Speaking: Hello Chinese App YouTube Writing: Hanzi in “HSK1 Chinese Characters Workbook for Beginners” and “HSK2…” Grammar: Study 1 grammatical point every day on the ChineseGrammarWiki Website. Etc. for HSK-2 & up.
English
1
0
1
117
Phil Metzger
Phil Metzger@DrPhiltill·
From the HelloChinese app during Mandarin practice today. 😂 It’s a great app, btw. Highly recommended.
Phil Metzger tweet media
English
2
0
11
1.8K
Tereza Tizkova
Tereza Tizkova@tereza_tizkova·
Looking to cowork in Marina/Pacific Heights today again and looking for new spots. What’s good coworking cafe?
Tereza Tizkova tweet media
English
18
0
99
14K
Nitya Sridhar
Nitya Sridhar@nityasnotes·
Wong Kar-Wai supremacy every shot in the mood for love is so distinct
Nitya Sridhar tweet mediaNitya Sridhar tweet media
English
3
0
20
1.7K
Arav Kumar
Arav Kumar@AI_Arav·
solved the dwarkesh jane street puzzle would love to share notes with anyone else that's finished!
Arav Kumar tweet media
Dwarkesh Patel@dwarkesh_sp

.@collision and I interviewed @elonmusk. 0:00:00 - Orbital data centers 0:36:46 - Grok and alignment 0:59:56 - xAI’s business plan 1:17:21 - Optimus and humanoid manufacturing 1:30:22 - Does China win by default? 1:44:16 - Lessons from running SpaceX 2:20:08 - DOGE 2:38:28 - TeraFab

English
1
0
14
2.3K
Grant Slatton
Grant Slatton@GrantSlatton·
@sbensu Someone should make one that operates via lip-reading webcam instead of audio so I can use in the office around coworkers
English
1
0
1
74
Grant Slatton
Grant Slatton@GrantSlatton·
i've reverted to my 2004 dialect of AIM shortenings with my CLI code agents, since the bottleneck is mostly my own response speed why -> y, you -> u, really -> rly, etc and all the initialisms are coming back, wdyt btw np idk idc lmk ngl i kinda love it, brings me back
English
6
0
46
1.9K
Phil Metzger
Phil Metzger@DrPhiltill·
One example: Deep insight into the psychology of every individual human on the planet and an utterly deep grasp of group dynamics enabling implementation of subtle, undetectable, yet fully pervasive and utterly persuasive propaganda (basically, effective mind control) to overthrow existing democratic regimes and make the world one-party rule. It will be like a strategy game of humans versus guinea pigs, where the ones with the lesser AI are the guinea pigs.
English
7
1
23
1.7K
Ramez Naam
Ramez Naam@ramez·
What do people believe China will do with more advanced AI chips that will harm the US?
English
21
4
25
7.3K
Ryan Leung 梁興盛, CFA
Ryan Leung 梁興盛, CFA@RYANHINGSHING·
Happy weekend! I went to Shenzhen HuaYang Jiu Lou (华洋酒楼) and had some truly exquisite Dim Sum. Highly recommend it if you're looking for great Dim Sum and great Chinese food The best food in China is Guangdong’s traditional food if you ask me!
Ryan Leung 梁興盛, CFA tweet mediaRyan Leung 梁興盛, CFA tweet mediaRyan Leung 梁興盛, CFA tweet mediaRyan Leung 梁興盛, CFA tweet media
English
3
1
11
5.6K
Matt Stuchlik
Matt Stuchlik@s7nfo·
@davidma How useful did you find LLMs in solving the challenge (apart from implementation work, obviously useful there).
English
1
0
1
97
djma
djma@davidma·
What worked for me: - LastLayer is the only one with shape 48,1 - There are 48 inp slots and 48 out slots. That's 48*48 ways to construct a Block (and 48! ways to construct 48 blocks), and 48! ways to order the blocks once they are constructed. - Computed cosine similarity between X and residual for the 48*48 ways to construct a Block. The idea is that if X is basically unrelated to inp/out, then the cosine similarity will -> zero as the dimensions -> inf. If X and inp/outp have a relationship, then there's a chance that the cosine similarity is not 0 and the chance doesn't -> 0 as dim -> inf. - I looked at the top 48 pairs with outlier cosine similarities and to my surprise they are all disjointed which gave me a high confidence I've found the right pairings. If they weren't totally disjointed, I would have used that to reduce the search space. - For the block ordering, which I "solved" before the Block pairing, compute how large the residual is for every block (residual norm) applied to X. The idea is that earlier blocks will contribute more to modifying X to match y_pred. But in practice it seems the reverse is true. From that permutation, brute force pairwise swaps until there's no more improvement. What partially worked: - Define an affinity metric between inp and out: Single-Block MSE (compute the MSE between y_pred and the y after one block and final layer). - Hungarian algo to pair up the inp and out. What didn't work: - Bunch of profiling: looking at feature distribution, features correlation matrix, weights std/norm, zeros, residual norm / X norm, visualizing the layers - Trying to incorporate the idea that the gradient should be small when y_pred ~= y_true - Greedy construction with various orderings. - Didn't try Sinkhorn as suggested by AI.
djma@davidma

o shit, I solved Jane Street's droppedaneuralnet puzzle from the latest @dwarkesh_sp sha1 of solution: b58600d51f2524415ddf919ea4b356a7bf4dfaec

English
5
2
15
2.1K
Eric S. Raymond
Eric S. Raymond@esrtweet·
@nocontextmemes The one true date stamp standard is RFC 3339 YYYY-MM-DD HH:MM:SS In software contexts I prefer the version that's one token with a T in the middle. Fixed length tokens that naturally sort in time order, yay!
English
46
28
847
75.6K
@·
"Stadia service is no longer available. Thanks for playing." Jeff is not an expert. Google hired experts and burned a lot of money and couldn't achieve cloud gaming. Instead, gamers are buying 240Hz monitors and 1000Hz mice to reduce the input lag further.
Pirat_Nation 🔴@Pirat_Nation

Jeff Bezos claims that in the future, you will not buy a gaming PC. You will only rent computing power in the cloud to play games online. He called it inefficient and predicted it won't last, saying people will instead rent computing power from the cloud, much like buying electricity today.

English
232
1.2K
15.9K
450.2K
Surrealistship
Surrealistship@SurrealistShip·
What films are the best to have on in the background of a party on mute? Samsara? Ghost in the Shell? Playtime?
English
41
1
67
13.3K
Simon Willison
Simon Willison@simonw·
In case anyone's interested, that vibe-coded(ish) web browser project that Cursor released the other day does actually compile now, they fixed it up and added instructions to the README. I built it on my Mac and took these screenshots:
Simon Willison tweet mediaSimon Willison tweet media
English
35
30
613
94.5K
Eric Alcaide
Eric Alcaide@eric_alcaide·
I try hard to not be a pessimist but this is so demoralizing. WTF is happening
Eric Alcaide tweet media
English
1
1
5
511