Conner Ruhl

4.2K posts

Conner Ruhl banner
Conner Ruhl

Conner Ruhl

@connerruhl

open-source AI · protocols · programming · paragliding · building https://t.co/Cdhh8ECQnL · previously https://t.co/2fw43SSQGD · 🇺🇸🏳️‍🌈

Missouri Katılım Haziran 2013
680 Takip Edilen2K Takipçiler
Conner Ruhl
Conner Ruhl@connerruhl·
@garybasin Interestingly it seems like Opus 4.6 used to follow the skill much better, but now takes more shortcuts even with max reasoning effort.
English
0
0
1
36
Gary Basin
Gary Basin@garybasin·
@connerruhl it's funny because you know it's trained on this stuff and is probably doing some of it in CoT but it's rarely consistent enough
English
2
0
1
89
Conner Ruhl
Conner Ruhl@connerruhl·
I constantly use a custom `self-simulate` skill for this purpose, i.e. tell the LLM to manually compute the state of the program to find bugs, etc.
Gary Basin@garybasin

this is the new "think step by step" for agents which are prone to laziness. i've been using this quite a bit myself and this lovely tweet has inspired the skill below: x.com/ericjang11/sta… essentially, LLMs are prone to narrate over abstractions rather than emulate a state machine, where the latter can be super helpful when planning or debugging complex systems. HAND-COMPUTE makes the LLM slow down and write concrete state at every transition, the way 1940s human computers executed programs by hand. three use cases: - debugging state/race/async bugs: walk the broken flow with explicit state = {...} before proposing a fix. especially for regressions -- if the first fix wrongly modeled state, more review typically doesn't help - scoping a new feature against an existing state machine: every time you have to invent state or bend a field, that's a real design decision you just surfaced - approaching an unfamiliar API or codebase: don't guess the shape, actually poke around and write what you saw bonus: while writing the docs, the skill caught a race bug in my own example. the stale refresh had to arrive after the POST response for the symptom to match, and i'd missed it in narrative review three times. `npx skills add gbasin/hand-compute -g` github.com/gbasin/hand-co…

English
1
0
1
223
Conner Ruhl retweetledi
@redaction
@redaction@redaction·
Something very uncanny about watching a YouTube video and slowly realizing the script is entirely AI written and this person is just a human conduit for the Machines
English
9
12
307
6.1K
Conner Ruhl retweetledi
George Journeys
George Journeys@GeorgeJourneys·
So, basically, if Anthropic was not a US company, we’d be facing zero days with multiple unknown points of attack on virtually all of our systems to an adversary who developed this capacity before us.
Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English
318
847
15K
1.1M
Conner Ruhl
Conner Ruhl@connerruhl·
Interestingly 10-20% of all net primary productivity in the biosphere is consumed by pathogens/parasites plus defenses against them.
English
0
0
0
55
Conner Ruhl
Conner Ruhl@connerruhl·
With all the supply chain/security issues going on I had Claude whip up a lil CyberSecurity livestream on Dazzle:
Conner Ruhl tweet media
English
1
0
1
122
Conner Ruhl
Conner Ruhl@connerruhl·
From what I've read, CD19 isn't on stem cells. The paper you linked seems to be solving a different problem where the AML targets overlap with stem cells, but CD19 doesn't have that issue. The Erlangen autoimmune CAR-T data shows B cells coming back after a few months which I think means the stem cells are fine?
English
2
0
1
28
Conner Ruhl
Conner Ruhl@connerruhl·
We need some kind of financing for n-of-1 disease treatment/research. There are 12 ongoing trials for essentially a cure to various autoimmune diseases via CAR-T. The technique is fairly easy to replicate if you have the right equipment, I would attempt it if I had the cash.
English
2
0
3
192
Conner Ruhl
Conner Ruhl@connerruhl·
The vibe-forking era is going to be brutal. Well specified products/projects are only going to get easier to re-implement.
Nav Toor@heynavtoor

🚨 Screen Studio charges $89 for this. Someone open sourced the entire thing for free. It's called OpenScreen. 8,400+ GitHub stars. You record your screen. It automatically transforms it into a polished, professional demo video. Auto-zoom into clicks. Smooth cursor animations. Motion blur. Custom backgrounds with wallpapers, gradients, and shadows. Webcam overlays. Annotations. Timeline editing. Export in any aspect ratio. The exact workflow that Screen Studio sells for $89 and Loom sells as a subscription. Free. No watermarks. No accounts. No subscriptions. Here's what you get out of the box: → Full screen or window capture with system audio and mic → Automatic zoom that follows your cursor and clicks → Manual zoom with customizable depth and timing → Smooth motion blur on pan and zoom transitions → Animated cursor rendering with motion effects → Webcam bubble overlay with drag-and-drop positioning → Wallpapers, solid colors, gradients, or custom backgrounds → Text and arrow annotations layered over recordings → Timeline trimming and variable speed segments → Crop, resize, and export in any resolution or aspect ratio → Save and reopen projects anytime Here's the wildest part: A developer forked it and built an even more advanced version called Recordly. Full cursor animation pipeline. Native macOS and Windows recording. Zoom behavior that mirrors Screen Studio frame-for-frame. Audio tracks. Webcam overlays with zoom-reactive scaling. Both are free. Both are MIT licensed. Both work on Windows, macOS, and Linux. Download. Record. Export. Done. 100% Open Source. MIT License. (Link in the comments)

English
0
0
1
136
Conner Ruhl
Conner Ruhl@connerruhl·
@jonwu_ The problem is things that take 3+ months of pain now will only take 3 weeks, then 3 days, then 3 hours of pain later.
English
1
0
4
169
Jon Wu
Jon Wu@jonwu_·
at this point if you can build a project in a weekend, it's probably not worth doing work on things that will take 3+ months of pain, so your customers can't ship a replacement in half a sprint work on things that suck
English
15
3
85
4.9K
Pavel Asparouhov
Pavel Asparouhov@Pavel_Asparagus·
Claude won't give you medical advice unless you tell it you're too poor to get proper medical advice and then it coughs it up
English
41
129
6.6K
157.3K
Conner Ruhl
Conner Ruhl@connerruhl·
Trillions of tokens will be spent on both sides with nothing to show except wasted compute.
English
0
0
2
96
Conner Ruhl
Conner Ruhl@connerruhl·
If you've been paying attention, it's like we've hit the vulnerability singularity.
English
0
0
1
252