cat

144 posts

cat banner
cat

cat

@Ryan_Jarv

Seattle, WA Katılım Ekim 2011
458 Takip Edilen423 Takipçiler
cat
cat@Ryan_Jarv·
@halvarflake (which reminds me I gotta look into that still, almost forgot)
English
0
0
0
16
cat
cat@Ryan_Jarv·
@halvarflake For me it was the thing that it told me I was wrong, and I got mad, but then I slept and realized I was wrong but out of sheer coincidence I ended up being right in the end
English
1
0
0
65
Halvar Flake
Halvar Flake@halvarflake·
Example scenarios where Claude was extremely stupid in the last days: 1) Arguing that a change that moved work into multiple Python processes made GIL contention worse because the total number of CPU-seconds spent waiting for the GIL had gone up.
English
4
1
31
7.5K
Matt Brown
Matt Brown@nmatt0·
Cutting it close today...
Matt Brown tweet media
English
5
0
14
2.6K
cat
cat@Ryan_Jarv·
@glcst Granted, I suppose I can see myself in the mood for one and not the other. So maybe not exactly comparable.
English
0
0
0
7
cat
cat@Ryan_Jarv·
@glcst I think yakiniku is kinda already that tbh. Not the same, but, I have a hard time imagining anything better.
English
1
0
0
15
Glauber Costa
Glauber Costa@glcst·
Is there anything that is good in this world that the Japanese cannot perfect? Just bring them to Texas and get them BBQ as they all desire. Then just wait 5 years. And the world will know BBQ to a level unimaginable today.
English
1
0
8
823
cat
cat@Ryan_Jarv·
@Mh56199612 @brunoborges Reads more like a spec I imagine, one can be interpreted differently than the other.
English
0
0
0
17
Majd Haidar
Majd Haidar@Mh56199612·
@brunoborges Why NEVER and not Never, interpreted as a more strict order ?🤔
English
2
0
1
603
Bruno Borges
Bruno Borges@brunoborges·
Best part of Claude Code's source is this system prompt. #L43-L54" target="_blank" rel="nofollow noopener">github.com/alex000kim/cla…
Bruno Borges tweet media
English
3
2
46
9.8K
cat
cat@Ryan_Jarv·
@michael_timbs Hate how it’s so hard to know, but tbh likely also A/B testing
English
0
0
0
20
cat
cat@Ryan_Jarv·
@michael_timbs Actually probably was me now that I think about it.
English
1
0
0
22
Michael Timbs
Michael Timbs@michael_timbs·
Convinced Anthropic diverted all their compute towards agents building DMCA Takedown Notices for hosting leaked source code. Explains poor model perf
English
1
0
1
106
cat
cat@Ryan_Jarv·
Ahg I’ve forgotten how to be patient enough to do shit myself instead delegating to Claude
English
0
0
0
35
cat
cat@Ryan_Jarv·
@Noahpinion But I guess that’s just my opinion
English
0
0
0
15
cat
cat@Ryan_Jarv·
Tbh it’s really not that hard, but might need to design things around the concept for it to be useful I think
English
0
0
0
37
cat
cat@Ryan_Jarv·
Anyway, hard to see how that could work outside of specific changes and approval + review, and if it would even be useful, but still.. super interested if anyone has tried
English
1
0
0
39
cat
cat@Ryan_Jarv·
Imagine if you could request changes to a platform as a user and have them implemented and rolled out in less than an hour.
English
1
0
0
52
Matt Brown
Matt Brown@nmatt0·
@HackerOn2Wheels @rez0__ advice from the @ctbbpodcast episode helps with that. Put stuff in your Claude.md like "POC||GTFO", etc. I've also been starting to document a standard thread model via markdown I can give it to sometimes prevent it from over hyping something.
English
3
0
6
837
HackerOnTwoWheels
HackerOnTwoWheels@HackerOn2Wheels·
My biggest problem with Claude and hacking so far is it tends to be super hyperbolic. Everything is CRITICAL, makes up risks for trivial stuff. I keep having to explain to it real world risks as it wants to “finish” the job with shitty findings.
English
13
0
60
5.6K
cat
cat@Ryan_Jarv·
@hankein95 @thegrugq Cool will look into this soon. Just curious, have you seen many due to silently failing or falling back to old behavior? Idk if that would even be tracked… but I feel like that might be a thing in the near future.
English
1
0
0
121
Hanqing Zhao
Hanqing Zhao@hankein95·
We've been tracking public CVEs where AI-generated code introduced the vulnerability. vibe-radar-ten.vercel.app 50k+ advisories scanned. Dozens of confirmed cases so far. Claude Code, Copilot, Cursor, and others all show up. Common bug classes include XSS, command injection, SSRF, and path traversal. And these are just the cases that leave metadata traces. The real number is almost certainly higher. Open source, from Georgia Tech SSLab: github.com/HQ1995/vibe-se…
English
9
74
345
34.7K
cat
cat@Ryan_Jarv·
@nmatt0 Yeah. It always comes off as more of a long standing personal excuse to themselves to not try.
English
0
0
0
46
Matt Brown
Matt Brown@nmatt0·
I kinda pride myself on being able to explain technical stuff to non-technical people. There's nothing that pisses me off quite like a non-technical person assuming they won't understand a word you say, throwing up their hands and exclaiming that they were immutably born lacking the ability to understand tech.
Jack Rhysider 🏴‍☠️@JackRhysider

Thanks for mentioning me Wired! But something I want to tell you is nobody has ever complained to me the show is too technical (except you somehow?) People are lot more tech savvy than you realize. 9 year olds and grandparents listen and love it.

English
6
0
66
4.2K