CJ Wright ری ٹویٹ کیا

~50% of AI “safety” benchmarks highly correlate with compute across models.
We added “compute correlations” to our recent safetywashing paper, showing that compute is a driving force behind a lot of “safety” benchmark advances: arxiv.org/abs/2407.21792
English
























