Cam Tice

373 posts

Cam Tice banner
Cam Tice

Cam Tice

@cam_tice

Geodesic Research || Marshall Scholar || Alignment

Auburn, AL Katılım Nisan 2016
366 Takip Edilen268 Takipçiler
Sabitlenmiş Tweet
Cam Tice
Cam Tice@cam_tice·
Cam Tice tweet media
ZXX
0
3
14
835
Cam Tice
Cam Tice@cam_tice·
I'm extremely excited to bring people on the team who live for making the future of humanity brighter. If you know anyone who you think would be a good fit (or you think you would be), DM me!
Geodesic Research@GeodesResearch

Thanks to a generous philanthropic grant from @coeff_giving (pending final logistics), 𝘎𝘦𝘰𝘥𝘦𝘴𝘪𝘤 𝘪𝘴 𝘩𝘪𝘳𝘪𝘯𝘨 𝘧𝘰𝘶𝘳 𝘔𝘦𝘮𝘣𝘦𝘳𝘴 𝘰𝘧 𝘛𝘦𝘤𝘩𝘯𝘪𝘤𝘢𝘭 𝘚𝘵𝘢𝘧𝘧. Come build the base of alignment with us 🤖 We're a Cambridge-based AIS org. Our seminal work (alignmentpretraining.ai) showed you can bake alignment priors into base models. Applications now open: airtable.com/appuugUGFPJEy6…

English
0
1
9
1.1K
Cam Tice
Cam Tice@cam_tice·
"Calling for prudence, rigorous evaluation and even, at times, a slower pace in adopting AI does not mean opposing progress; instead, it is an exercise of responsible care for the human family." - @Pontifex I'm happy this morning to have a new teammate
English
0
0
3
75
Cam Tice
Cam Tice@cam_tice·
Great paper on continual learning by @LPacchiardi . We should be far more prepared for a world where agents learn extensively from their environments. This work seems like a solid starting point and was very helpful for conceptualizing different types of CL systems.
Lorenzo Pacchiardi@LPacchiardi

🚨 New paper: AI evaluation is structurally unsuitable for continual learning (CL). To address this, evaluation should be centred on the "behavioural trajectories" that CL systems develop, with the goals of characterising possible behaviours and forecasting their evolution. 🧵

English
0
1
12
2.2K
Cam Tice retweetledi
Elizabeth Barnes
Elizabeth Barnes@BethMayBarnes·
(3) METR (and other independent orgs, as well as safety/security teams at labs) feel woefully under-resourced compared to the scale and pace of AI development - we’re struggling to build benchmarks fast enough, keep ahead of latest capability developments, read and respond to all the safety-related claims that AI developers are making, run all the evaluations and assessments that companies + governments are asking us to, plus develop the science needed to assess risks from increasingly capable AIs.
English
3
19
366
28.3K
Cam Tice retweetledi
Caleb Parikh
Caleb Parikh@caleb_parikh·
I like Geodesic a lot. The founders are highly motivated by catastrophic risks, low ego, and particularly "for the mission". Spaces that "feel RSI" are rare, so when I moved back to Cambridge, I ran weekly "ASI futurism" dinners - half of the regulars now work at Geodesic.
Tomek Korbak@tomekkorbak

Geodesic is a new AI safety org i’m particularly excited about: they do awesome neglected work trying to figure out how to shape alignment priors for frontier AI. People should consider applying!

English
1
2
16
1.7K
Cam Tice retweetledi
Tomek Korbak
Tomek Korbak@tomekkorbak·
Geodesic is a new AI safety org i’m particularly excited about: they do awesome neglected work trying to figure out how to shape alignment priors for frontier AI. People should consider applying!
Geodesic Research@GeodesResearch

Geodesic is hiring Members of Technical Staff. Come align some AIs with us! We're a Cambridge-based AI safety org. Our seminal work showed you can bake alignment priors into base models. Now, we want to make base models robust to the adversarial effects of long-horizon capabilities RL. EOI (~5 mins): tally.so/r/vG4G6A

English
0
12
147
19.2K
Cam Tice retweetledi
Cam Tice retweetledi
Daniel Tan
Daniel Tan@DanielCHTan97·
Anthropic does a lot of alignment science. This is good, but I also wonder how much the conclusions they draw rely on baked-in assumptions about how they did pretraining, midtraining, etc. Better open-source alignment needed as a contrast point
English
10
5
87
5.2K
Cam Tice retweetledi
Geodesic Research
Geodesic Research@GeodesResearch·
Excited to see @AnthropicAI landing on improving pretraining priors as a central alignment intervention. In line with our findings in Alignment Pretraining, they find that a small amount of positive AI discourse in pretraining substantially reduces misalignment.
Anthropic@AnthropicAI

New Anthropic research: Teaching Claude why. Last year we reported that, under certain experimental conditions, Claude 4 would blackmail users. Since then, we’ve completely eliminated this behavior. How?

English
1
2
8
340
Cam Tice retweetledi
Geodesic Research
Geodesic Research@GeodesResearch·
Excited to see our work featured in Isambard-AI's case study series; kind of compute-intensive research Geodesic does wouldn't happen without infrastructure like this. Thanks @BeestonMedia and BriCS for the chance to talk about our research!
Geodesic Research tweet media
English
0
2
5
245
Cam Tice
Cam Tice@cam_tice·
I was disappointed in the rigor of this system card. It seems obvious that the safety team had extremely limited time and resources to put the report together. I hope this changes as we continue to push the frontier of intelligence and misalignment becomes harder to spot.
English
1
0
4
52
Cam Tice
Cam Tice@cam_tice·
@OpenAI has a beautiful sweet of agentic misalignment benchmarks developed with @apolloaievals. I'm saddened to see that these were not included in the GPT-5.5 system card. This is especially frustrating given a meaningful increase in misalignment compared to GPT-5.4.
Cam Tice tweet media
English
1
1
5
115
Cam Tice retweetledi
david rein
david rein@idavidrein·
“WTBU” is one of the most useful communication technologies I know of. It stands for “Watch Team Back-Up”, I believe it originated to reduce mistakes on nuclear submarines. You prepend it to a message to someone where you’re pointing out something that might be obvious, but you want to check/confirm that they’re on top of it. For example, you might say “WTBU: you checked that we’re cleared to share this info with XYZ person?” It takes the pressure/ego out of the message, by letting you communicate something to the effect of “I’m not saying this because I think you’re incompetent/dumb, so ignore this if it isn’t relevant/useful—I just really want to make sure we don’t mess up/make silly mistakes, and smart/competent people can make silly mistakes!”—except once you’ve coordinated on using the letters WTBU to communicate that, you can just say “WTBU:”. This lets you now check basic/obvious things with coworkers much more easily and with less ego/emotion, which makes it much easier to catch mistakes in advance. Worth considering adopting into your org as a standard communication practice!
English
13
58
789
77.5K
Cam Tice retweetledi
Nathan Lambert
Nathan Lambert@natolambert·
Opus 4.7 has a new tokenizer. This means it's also a new base model. Glory days of pretraining still very much going.
Nathan Lambert tweet media
English
66
126
2.5K
322.8K
Cam Tice
Cam Tice@cam_tice·
I'd recommend giving AI 2027 another pass if you're trying to get a better understanding of how the coming years play out. On the authors own guidance, reading it as AI 2029 is probably closer to the truth. But there is lots of danger in the tails :/ ai-2027.com
English
0
0
3
46
Cam Tice
Cam Tice@cam_tice·
4. AI 2027 is a year out, and the AI progress predicted is not far off. @eli_lifland and crew made concrete predictions that were easy to falsify. Predicted AI revenue is ahead of schedule (predicted 26 Billion by May 2026) and their 80% METR time-horizon is basically tracking.
English
1
0
3
72
Cam Tice
Cam Tice@cam_tice·
You should take some time to think about what short-timelines to superintelligence means for your career/research agenda/family. My main takeaways from the Anthropic release I presented at the Meridian office yesterday:
English
1
0
5
167