Buck Shlegeris
1.1K posts

Buck Shlegeris
@bshlgrs
CEO@Redwood Research (@redwood_ai), working on technical research to reduce catastrophic risk from AI misalignment. [email protected]


New paper! Think Fast: Estimating No-CoT Task-Completion Time Horizons of Frontier AI Models @METR_Evals showed that models' time horizons have doubled every few months. We ask: what length of tasks can models complete without any CoT?




Some coding scaffolds block and retry risky actions. In a new paper, we find this reveals information a malicious AI can use to bypass monitoring. Resampling without blocked actions in context is less exploitable, but techniques that help in one setting can hurt in another. 🧵






- fast like C - memory safe like Rust - fast compilation like Go - 1st class C++ support like Swift Who’s building this?







Everybody who thinks ai is conscious has to do a mandatory from scratch transformer implementation. There are only floats and multiplications.










