David Kogan

172 posts

David Kogan

David Kogan

@davidkogan_

interpretability & alignment researcher, tryna make ai smarter, faster, but most importantly, safer. Building @saferintel

the matrices Katılım Şubat 2017
310 Takip Edilen47 Takipçiler
David Kogan
David Kogan@davidkogan_·
@bughuntergeek ofc! Let me know if you run into any troubles getting it set up, I'm happy to help. and thanks for the repost
English
0
0
1
17
David Kogan
David Kogan@davidkogan_·
@0xSero I’ve been working on this problem too. have a different approach to the prediction side building off some established work on expert prediction, that I think gets around the wall you’re describing. my DMs are open if you wanna chat about it, maybe work on it together
English
0
0
0
107
David Kogan
David Kogan@davidkogan_·
@thsottiaux I'm finding that my usage limits are being depleted much faster than before this reset. Anyone else experiencing this?
English
4
1
16
937
MMK
MMK@MwakaKelvin·
@thsottiaux @southphxceleb My codex just reversed the recent reset. Guys, am I the only one affected? I am on PLUS!
MMK tweet media
English
1
0
1
218
David Kogan
David Kogan@davidkogan_·
@thsottiaux my codex usage just suddenly reverted back to 0% (was at 75% less than 2 hours ago, with only one codex /goal session running on xhigh) along with my reset date now back to may 19th instead of may 23rd. is this happening to anyone else?
English
0
1
1
137
Tibo
Tibo@thsottiaux·
@emreertunc_ Yes. There were some cache issues so we cleared everything again
English
15
2
154
7.6K
Tibo
Tibo@thsottiaux·
Ship => Feedback => Ship => Feedback => Ship => Oops something not quite right => Reset => Yay => Ship => … The never ending cycle of Codex improvements
English
257
70
3.1K
102.7K
David Kogan
David Kogan@davidkogan_·
@russe66556 check the second part of this thread! built a tool that scans your computer constantly and blocks the scripts that activate the worm from running on installs. and yeah, scary as hell.
English
0
0
0
64
David Kogan
David Kogan@davidkogan_·
the shai-hulud npm worm hit 170+ packages since last september, including tanstack, mistralai, crowdstrike, & opensearch. it steals your npm tokens and github creds through postinstall scripts & then uses YOUR tokens to infect every other package you maintain. scary stuff. 1/3
English
2
0
0
426
David Kogan
David Kogan@davidkogan_·
I couldn't sleep knowing my machine might be vulnerable so I built a guard for it. one bash install, it aliases npm/pnpm/yarn to always run --ignore-scripts, scans your lockfiles against every known malicious version, and runs a background check every 5 min 2/3
English
1
0
1
119
David Kogan
David Kogan@davidkogan_·
@AntoineRSX Your site is completely down. Might want to have your agent fix that as soon as you see this.
David Kogan tweet media
English
0
0
0
169
Antoine Rousseaux
Antoine Rousseaux@AntoineRSX·
Get Hermes Agent running in less than 5 minutes. No API key, No bot token, No SSH. Just running with the skills and plugins adapted to your needs.
English
56
41
756
1.6M
David Kogan
David Kogan@davidkogan_·
@VictorTaelin @brainage19 4.5 was absurdly expensive, and if I recall had SOTA emotional intelligence and control over its speech. icr if they disclosed much about why it was so expensive, but I suppose it may have just been a very large model. maybe one day they'll oss and we'll know why ... prob not
English
0
0
1
61
Taelin
Taelin@VictorTaelin·
@brainage19 that's a good point, 4.5 was really eloquent for whatever reason why is spud not like it? what is making it incapable of communicating well
English
3
0
27
1.8K
David Kogan
David Kogan@davidkogan_·
@VictorTaelin the smarts of chatgpt 5.5 + the coding skills, personality, and writing/speaking capabilities of claude opus 4.6 + the pricing & efficiency of deepseek v4... literally the dream
English
0
0
5
544
Taelin
Taelin@VictorTaelin·
is there hope that GPT models will ever talk normally? it feels like a non-English speaking coworker. or more like I need an autistic-to-English dictionary to navigate it. it almost doesn't matter how smart the model is if the communication is that dysfunctional. really sad
English
97
9
586
44.8K
David Kogan
David Kogan@davidkogan_·
@BrianSatur24273 @chaumian I mean they're open problems for a reason. but think of it this way. for everything codex tries that DOESNT work, this reveals more and more to it about what the elusive proof must look like. what's the farthest you've gotten? down to a single case unproven? also feel free to dm
English
0
0
1
39
alon turing
alon turing@chaumian·
when i do this it just keeps attempting to prove some seemingly endless sequence of "next lemmas needed to close the gap", failing, proving a new lemma that it claims gets us closer, then concludes "i cant prove it and will not fabricate a proof. the next lemma needed is ..."
Timothy Gowers @wtgowers@wtgowers

All I did was say things like, "Yes, it would be great if you could explore that idea and see whether you can get it to work," or "Could you rewrite that argument as a LaTeX file in the style of a standard mathematical preprint?"

English
14
0
62
17.2K
David Kogan
David Kogan@davidkogan_·
@shikhr_ kimi k2.6 is brilliant. deepseek v4 pro is more economical and generally suffices for most tasks, and is occasionally somewhat faster but it varies heavily. deepseek v4 flash is great as a haiku/gpt-5.4-mini-equivalent subagent and very cheap.
English
0
0
4
1.7K
Shikhar
Shikhar@shikhr_·
Might as well use OpenCode Go since Codex limits are dead right now. Which models work best?
Shikhar tweet media
English
102
11
741
125.3K
David Kogan
David Kogan@davidkogan_·
@sama The equivalent on /goal in normal ChatGPT would be phenomenal! I've been using pro to try and solve some open conjectures but it keeps stopping short after hitting an obstruction and refining the target lemmas. This could massively accelerate math research.
English
0
0
0
434
Sam Altman
Sam Altman@sama·
what would you most like to see improve in our next model?
English
8.3K
304
9K
1.4M
Franklin
Franklin@OnlyFranklns·
@davidkogan_ @haider1 Hinton holds a distinction and doesn't believe they're exactly equivalent. There's some crossovers/similarities but different substrate and mechanisms nonetheless
English
1
0
0
22
Haider.
Haider.@haider1·
Geoffrey Hinton says AI reasoning can be real thought because language itself is a form of thinking Words let humans and AI model almost anything, but human thought goes beyond words — through images, space, and physical movement "the smarter system is the one that can use all of them"
English
44
47
250
24.8K