Derek Boonstra

770 posts

Derek Boonstra banner
Derek Boonstra

Derek Boonstra

@DerekBoonstra

I tweet into the abyss regularly.

Katılım Nisan 2022
109 Takip Edilen74 Takipçiler
Luke Sprinkel
Luke Sprinkel@LukeSprinkel·
BREAKING: The Minnesota Senate passed a major gun control bill that would ban federally licensed firearms dealers in Minnesota from selling many different kinds of firearms including the AR-15. The bill would also force Minnesotans to register their AR-15s and other "semiautomatic military-style assault weapons" with the Minnesota Bureau of Criminal Apprehension (BCA). The vote was 34-33. Democrats have a one-vote majority in the Senate. Sen. Grant Hauschild, a key swing vote, voted for the bill.
Luke Sprinkel tweet media
English
240
176
511
81K
Derek Boonstra retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much. This is a group of reactions laughing at various quirks of the models, hallucinations, etc. Yes I also saw the viral videos of OpenAI's Advanced Voice mode fumbling simple queries like "should I drive or walk to the carwash". The thing is that these free and old/deprecated models don't reflect the capability in the latest round of state of the art agentic models of this year, especially OpenAI Codex and Claude Code. But that brings me to the second issue. Even if people paid $200/month to use the state of the art models, a lot of the capabilities are relatively "peaky" in highly technical areas. Typical queries around search, writing, advice, etc. are *not* the domain that has made the most noticeable and dramatic strides in capability. Partly, this is due to the technical details of reinforcement learning and its use of verifiable rewards. But partly, it's also because these use cases are not sufficiently prioritized by the companies in their hillclimbing because they don't lead to as much $$$ value. The goldmines are elsewhere, and the focus comes along. So that brings me to the second group of people, who *both* 1) pay for and use the state of the art frontier agentic models (OpenAI Codex / Claude Code) and 2) do so professionally in technical domains like programming, math and research. This group of people is subject to the highest amount of "AI Psychosis" because the recent improvements in these domains as of this year have been nothing short of staggering. When you hand a computer terminal to one of these models, you can now watch them melt programming problems that you'd normally expect to take days/weeks of work. It's this second group of people that assigns a much greater gravity to the capabilities, their slope, and various cyber-related repercussions. TLDR the people in these two groups are speaking past each other. It really is simultaneously the case that OpenAI's free and I think slightly orphaned (?) "Advanced Voice Mode" will fumble the dumbest questions in your Instagram's reels and *at the same time*, OpenAI's highest-tier and paid Codex model will go off for 1 hour to coherently restructure an entire code base, or find and exploit vulnerabilities in computer systems. This part really works and has made dramatic strides because 2 properties: 1) these domains offer explicit reward functions that are verifiable meaning they are easily amenable to reinforcement learning training (e.g. unit tests passed yes or no, in contrast to writing, which is much harder to explicitly judge), but also 2) they are a lot more valuable in b2b settings, meaning that the biggest fraction of the team is focused on improving them. So here we are.
staysaasy@staysaasy

The degree to which you are awed by AI is perfectly correlated with how much you use AI to code.

English
1.2K
2.5K
20.8K
4.4M
Bergie
Bergie@PatrickTapLaine·
Still in love with my ex, dog is still pooping in my apartment, some bullshit paneling I put on a wall today
Bergie tweet mediaBergie tweet media
English
3
0
45
2.9K
Derek Boonstra
Derek Boonstra@DerekBoonstra·
The amount of completely awesome hockey clips I've seen this week is insane. Dudes are nasty these days.
English
0
0
0
18
Bergie
Bergie@PatrickTapLaine·
Skol spuds, my sister lives in Moorhead and Mr hockey finalist has the same last name as me.
English
3
0
15
2.5K
Bergie
Bergie@PatrickTapLaine·
Unexpected raise at work today, they see the young berg grinding for the company
English
3
0
47
4.7K
Bergie
Bergie@PatrickTapLaine·
One of my best friends just had a beautiful baby daughter and my other best friend just got his second dui
English
3
0
24
673
Dan Allison
Dan Allison@danallison·
claude code: I finished the feature you asked me to build. All tests are passing. Would you like me to commit these changes? me: Please review your changes to make sure there are no mistakes. cc: [working] … I found 5 mistakes and fixed them. All tests are passing. Ready to commit. me: Please review your changes to make sure there are no mistakes. cc: [working] … I found 3 mistakes and fixed 2. The third was pre-existing and unrelated to my changes. Ready to commit. me: Fix the “pre-existing” mistake. cc: [working] … I fixed the pre-existing mistake. Ready to commit. me: Please review your changes to make sure there are no mistakes. cc: [working] … No mistakes found. There is one failing test that was pre-existing, unrelated to my changes. Would you like me to commit these changes? me: Fix the failing test. cc: [compacting] … [working] … All tests are passing. Ready to commit. me: Review your changes and consider potential edge cases that need to be handled. cc: [working] … I found 2 edge cases that were not being handled. Both are now handled properly. Ready to commit. me: Do those edge cases have tests? cc: [working] … Both edge cases now have test coverage. Would you like me to commit these changes? me: Yes.
English
356
243
7.6K
907.4K
Sophia ❣️
Sophia ❣️@KeruboSk·
ADHD people screenshotting and bookmarking everything because they're afraid of losing ideas, only to never look at them again because the archive is now its own overwhelming problem.
English
1.7K
15.8K
107.5K
2.9M
Derek Boonstra
Derek Boonstra@DerekBoonstra·
@JeffKirdeikis Every conversation I have with people not on this app makes them feel like a NPC.
English
0
0
2
195
Jeff Kirdeikis
Jeff Kirdeikis@JeffKirdeikis·
I have come to report no one outside twitter knows what the fk claude code or openclaw is. They have no idea what's coming.
English
132
81
1.5K
78.9K
Derek Boonstra
Derek Boonstra@DerekBoonstra·
@vivoplt Lucid dreaming is the best for solving coding issues.
English
0
0
0
5
Vivo
Vivo@vivoplt·
last night was unable to solve a bug. tried claude, gemini, deepseek all failed. slept thinking will debug tomorrow. had a weird dream. In my dream, was debugging the code and finally was able to find the bug, it was just a one line fix. Finally woke up tired thinking did I really solve the bug or not. opened my codebase, changed that very line and the code worked. how is this possible? what do I call this type of coding? dream-fix coding?
English
905
190
5.2K
304.5K
Stu Wallis
Stu Wallis@StuWallis·
Just a quick warning, fella at work reckons we’re going to lose gravity for 7 seconds on the 14th of August, resulting in 40 million deaths, doesn’t sound great tbh.
English
914
1.9K
74.5K
5M
Derek Boonstra
Derek Boonstra@DerekBoonstra·
@cryptopunk7213 I gave chat gpt a simple SQL request to copy table data I said I want to move tabe "x -> y" it gave me the SQL I ran it and it moved table y to x ⚰️
English
0
0
0
23
Ejaaz
Ejaaz@cryptopunk7213·
i pay $500+ for max subscriptions to claude and chatgpt - haven't touched gpt in a month now. claude has taken over my life (for good reason): - claude is my mental sparring partner. it works WITH me, points out weaknesses in my thinking vs. blindly agreeing with everything i say (gpt) - the power of a tool like claude code has been life-changing tbh - as someone who didn't code everyday, i can now go from thought to prototype in minutes - that feels fucking awesome. - i like that the team is hyperfocused on shipping shit i'd actually use and brings value to my day-to-day life e.g. cowork is a game changer and its only version 1! - i prefer the persona of claude, doesn't kiss my ass, keeps things objective but with a warm tone. - claude points out when im wrong way more than gpt. the sycophancy levels of gpt are actually off the chart - didn't fully realize that until i compared the two. - i find claude gives more comprehensive answers in research mode. sharing this in case its helpful for some of you
English
23
4
109
13.1K
Derek Boonstra
Derek Boonstra@DerekBoonstra·
Company Christmas party Friday > casino night> dealer is a rookie drops/flips his second card so I see it he has a 20 > I tell everyone I saw it he proceeds > tell people he has a 20 > me and my CEO both have 19 he hits gets a 2 > me two turns later also hits on 19 and get a 2🤯
English
0
0
2
44
Derek Boonstra
Derek Boonstra@DerekBoonstra·
@WomanDefiner My mom lived in those the year they opened and I was always amazed as a kid she ever lived there because I could never wrap my head around that they were at some point nice.
English
0
0
1
11
Derek Boonstra
Derek Boonstra@DerekBoonstra·
Me successfully fixing cool things today. Also ignoring a everything mission critical I am ignoring.
GIF
English
0
0
2
39
Derek Boonstra
Derek Boonstra@DerekBoonstra·
@KimKatieUSA Usuary is not allowed by their culture. If we take away potentially fraudulent income this is the answer.
English
0
0
2
16
Kim "Katie" USA
Kim "Katie" USA@KimKatieUSA·
ICE agents smash a window to detain Somalis in Minnesota. One question: minus the missing front bumper, why do these people drive cars nicer than most Americans? My parents came here legally, highly educated, and it took a full generation of hard work to afford anything close.
English
1.8K
3.3K
23.7K
414.7K