David Harmeyer

364 posts

David Harmeyer banner
David Harmeyer

David Harmeyer

@SecondThread

加入时间 Mayıs 2013
77 关注1.1K 粉丝
David Harmeyer
David Harmeyer@SecondThread·
Just spent so much time working with loss curves that when I opened up my 401k today, I panicked because I saw it going up and to the right
English
0
0
4
85
David Harmeyer
David Harmeyer@SecondThread·
Yes, looks like it's a combination of that, and not enough variation in samples that killed it. tanh() is a million times better
David Harmeyer tweet media
English
0
0
1
96
David Harmeyer
David Harmeyer@SecondThread·
World's ugliest loss curve. Is this just what you get for using sigmoid in hidden layers?
David Harmeyer tweet media
English
3
0
2
179
David Harmeyer
David Harmeyer@SecondThread·
The AI times have begun
David Harmeyer tweet media
English
1
0
2
117
Peter Steinberger 🦞
Peter Steinberger 🦞@steipete·
New @openclaw beta is up! This one again focusses on security and bug fixes, but we added a gem: TELEGRAM MESSAGE STREAMING 🚀 to update, ask your agent or run: openclaw update --channel beta
English
416
277
5.4K
516.2K
David Harmeyer
David Harmeyer@SecondThread·
@tsoding At that point the only reason that'd be useful is as precompute-expensive, query-expensive dataset compression. But in the real world, data is cheap and compute is expensive, so nobody would do this unless there's some immensely tight bottleneck on transmittable data.
English
0
0
0
39
Тsфdiиg
Тsфdiиg@tsoding·
Wait, this sounds incredible useful! Can we just have a model with 0 entropy, 0 hallucinations, that just acts like a retrieval database over its training dataset? Also sounds like a great way to solve the traceability problem. Why don't the AI labs just make something like that?
Тsфdiиg tweet media
English
407
250
8.8K
512.5K
David Harmeyer
David Harmeyer@SecondThread·
Sleep is just "compacting conversation..." for humans
English
0
0
0
158
David Harmeyer
David Harmeyer@SecondThread·
@sciencegirl I guarantee if you give this to actual people, they will misjudge the number of steps on the way down and trip
English
0
0
2
29
Science girl
Science girl@sciencegirl·
A simple invention making stair climbing easier on the body
English
434
615
13.4K
5.2M
Brie Wolfson
Brie Wolfson@zebriez·
I'm looking for a word for "get it-ness." Something to describe that sense you get of a person that "gets it." Usually immediately apparent. contenders that are close but not quite it: awake. in the details/close to the metal. high-agency.
English
1.3K
48
1.9K
280.8K
David Harmeyer
David Harmeyer@SecondThread·
@hyhieu226 If your reasoning isn't autoregressive, it's probably rationalization, not reasoning. Maybe rationalization makes sense in some cases, but it's a lot more artistic than it is logical.
English
0
0
1
36
Hieu Pham
Hieu Pham@hyhieu226·
Naive question, so please roast me. Why don't we have diffusion reasoning models? The way humans think look a lot more like diffusion than autoregressive.
English
241
71
2.1K
483.8K
David Harmeyer
David Harmeyer@SecondThread·
github.com/Kotlin/kotlin-… is amazing, but the fact that it's not available in the extensions search is confusing. It strikes me as obviously the best way to use Kotlin, but it's so hard to find
English
0
0
2
282
David Harmeyer
David Harmeyer@SecondThread·
Many people fail because they go off to build a generic platform, and then find usecases afterwards. It's so much more compelling to find a problem, build a solution, and then after that's proven, make the solution generic.
English
0
0
3
262
David Harmeyer
David Harmeyer@SecondThread·
Looking forward to Hacker Cup Round 1, starting tomorrow morning!
English
1
1
11
2.3K
David Harmeyer
David Harmeyer@SecondThread·
Every time I remember that "m" comes before "n" in the alphabet I do a double-take
English
0
0
4
378
David Harmeyer 已转推
Charlie Kirk
Charlie Kirk@charliekirk11·
Good men must die, but death can't kill their names.
English
1.4K
26.8K
109.6K
0
David Harmeyer
David Harmeyer@SecondThread·
@sama I get that rollouts are tough and it'll take a bit to iron out the bugs -- but could you guys at least provide the option to use an older model until things are smoothed out? There's 0 available GPUs, even for premium users...
David Harmeyer tweet media
English
0
0
1
158
Sam Altman
Sam Altman@sama·
going to try live-tweeting the GPT-5 livestream. first, GPT-5 in an integrated model, meaning no more model switcher and it decides when it needs to think harder or not. it is very smart, intuitive, and fast. it is available to everyone, including the free tier, w/reasoning!
English
1.1K
2K
27.3K
3.7M
AmericanPapaBear™
AmericanPapaBear™@AmericaPapaBear·
Doorbell camera catches man getting attacked by a squirrel. Have you ever seen a squirrel like this?
English
9.1K
7K
56.7K
12M
David Harmeyer
David Harmeyer@SecondThread·
It's a hoot how many people have this perception that prompting an AI is somehow a high-variance skill that requires expertise and training. Reminds me of adults saying we "needed classes to learn Microsoft Word" when I was a kid.
English
1
2
16
1.3K