
SubatomicArticles
178 posts

SubatomicArticles
@OptiMiserJoe
Reliability Engineer and chronic storyteller, now working at MIRI. Opinions are my own.





IMO as a field we should: - Aim for 100% coverage - Use the most powerful models as monitors rather than dumb tiny ones - Preserve CoT monitorability - Currently expect almost 100% recall for models that use CoT effectively - Do CoT monitoring rather than just action monitoring


And the engineer, befuddled, was presented with an envelope and a handwritten calculation about the shortfall, with profuse apologies and the promise it would be correct going forward.


@ramez What are your main reasons for having a low p(doom)?







25/ What's really concerning is that OpenAI got to write its own rules, and still broke them. The models are only getting more powerful, and the competitive pressure more intense. If companies won't meet basic, self-imposed commitments now, why would we expect better later?







