Cam Tice
373 posts

Cam Tice
@cam_tice
Geodesic Research || Marshall Scholar || Alignment

Thanks to a generous philanthropic grant from @coeff_giving (pending final logistics), 𝘎𝘦𝘰𝘥𝘦𝘴𝘪𝘤 𝘪𝘴 𝘩𝘪𝘳𝘪𝘯𝘨 𝘧𝘰𝘶𝘳 𝘔𝘦𝘮𝘣𝘦𝘳𝘴 𝘰𝘧 𝘛𝘦𝘤𝘩𝘯𝘪𝘤𝘢𝘭 𝘚𝘵𝘢𝘧𝘧. Come build the base of alignment with us 🤖 We're a Cambridge-based AIS org. Our seminal work (alignmentpretraining.ai) showed you can bake alignment priors into base models. Applications now open: airtable.com/appuugUGFPJEy6…

🚨 New paper: AI evaluation is structurally unsuitable for continual learning (CL). To address this, evaluation should be centred on the "behavioural trajectories" that CL systems develop, with the goals of characterising possible behaviours and forecasting their evolution. 🧵


Geodesic is a new AI safety org i’m particularly excited about: they do awesome neglected work trying to figure out how to shape alignment priors for frontier AI. People should consider applying!

Geodesic is hiring Members of Technical Staff. Come align some AIs with us! We're a Cambridge-based AI safety org. Our seminal work showed you can bake alignment priors into base models. Now, we want to make base models robust to the adversarial effects of long-horizon capabilities RL. EOI (~5 mins): tally.so/r/vG4G6A

Geodesic is hiring Members of Technical Staff. Come align some AIs with us! We're a Cambridge-based AI safety org. Our seminal work showed you can bake alignment priors into base models. Now, we want to make base models robust to the adversarial effects of long-horizon capabilities RL. EOI (~5 mins): tally.so/r/vG4G6A

Geodesic is hiring Members of Technical Staff. Come align some AIs with us! We're a Cambridge-based AI safety org. Our seminal work showed you can bake alignment priors into base models. Now, we want to make base models robust to the adversarial effects of long-horizon capabilities RL. EOI (~5 mins): tally.so/r/vG4G6A

New Anthropic research: Teaching Claude why. Last year we reported that, under certain experimental conditions, Claude 4 would blackmail users. Since then, we’ve completely eliminated this behavior. How?











