Peter McIntyre

154 posts

Peter McIntyre

Peter McIntyre

@pmcntyr

Founder, Trajectory: Safety evals and RL envs for frontier AI labs. Formerly @GovAI_ @learnnontrivial and @80000hours

London, UK Katılım Haziran 2016
617 Takip Edilen1.4K Takipçiler
Sabitlenmiş Tweet
Peter McIntyre
Peter McIntyre@pmcntyr·
I spent 7 years preparing to be a doctor before finally realising how much more I could help outside the hospital. So, I put together a team and built a course to help people avoid making the same mistake: non-trivial.org/courses/how-to… Some stuff I really wish I’d known... (1/20)
English
27
180
860
0
Peter McIntyre retweetledi
Peter Wildeford🇺🇸🚀
Peter Wildeford🇺🇸🚀@peterwildeford·
Opus 4.5 automated 3.75% of the tasks on the Remote Labor Index (eval composed of many diverse freelancer tasks). Performance is doubling every ~4 months so far! Could see this getting to somewhere between 8%-30% by December, which would be a big deal for the economy.
Peter Wildeford🇺🇸🚀 tweet media
Center for AI Safety@CAIS

AI agents are getting good at coding, but how close are they to automating all digital labor? New Remote Labor Index results: Opus 4.5 is able to automate 3.75% of remote labor projects, with GPT-5.2 in second place.

English
21
55
534
68.7K
Peter McIntyre retweetledi
Ethan Perez
Ethan Perez@EthanJPerez·
We’re hiring someone to run the Anthropic Fellows Program! Our research collaborations have led to some of our best safety research and hires. We’re looking for an exceptional ops generalist, TPM, or research/eng manager to help us significantly scale and improve our collabs 🧵
English
10
42
256
69.3K
Peter McIntyre retweetledi
Seth Bannon
Seth Bannon@sethbannon·
The world needs more founders building a safe and aligned AI future. We’ve added a 5050 AI track to help researchers start companies civilization needs. You’ll learn from mentors including Wojciech Zaremba (OpenAI co-founder), Ross Girshick (Vercept co-founder), Emmett Shear (Softmax co-founder), Dileep George (Vicarious co-founder), and Ronnie Chatterji (Chief Economist at OpenAI). As AI capabilities accelerate, safety struggles to keep up. Startups can help make safety the default at scale. If you’re working on safety evaluation and red-teaming, interpretability, alignment infrastructure, governance and compliance, human-AI collaboration, robustness tools, biosecurity, AI-powered cybersecurity, or something outside this list to build an aligned future -- or you want to be -- reach out. 5050, our founder foundry, is a free program to help great scientists, researchers, and engineers become great founders. We’ve helped launch 78 companies, many of which wouldn’t exist otherwise. 5050 will teach you everything you need to start a deep tech startup. You’ll also join a community of entrepreneurial scientists and engineers to navigate the founder journey together. 5050 alumni, many of whom are now founders, are eager to share their experiences and support your journey. Whether you’re validating an idea or ready to build, 5050 is for you. Applications for our US and UK cohorts are open!
Seth Bannon tweet media
English
15
17
86
44.8K
Peter McIntyre retweetledi
Lizka
Lizka@LizkaVaintrob·
Can we get AI to stabilize the world faster than it disrupts it? Even as AI poses risks, it’ll provide new tools for navigating those risks. But these tools won't develop themselves (yet!). In a new 📄, we explore what tools would help most & how to outpace broad AI progress.
Lizka tweet media
English
5
21
139
19.8K
Peter McIntyre retweetledi
William MacAskill
William MacAskill@willmacaskill·
Is AGI an “all or nothing” problem? Failure on alignment = AI takeover, and success = AI solves everything? In a new paper with @finmoorhouse we argue no. We describe the dizzying range of challenges AGI will pose, *even if* we succeed at alignment. forethought.org/research/prepa…
English
13
71
366
133.5K
Ethan Alley
Ethan Alley@EthanAlley·
any recs for a podcast search engine? my dream: search a person, get a feed of exactly all episodes they appear in across any pod.
English
1
0
0
282
Peter McIntyre retweetledi
Allan Dafoe
Allan Dafoe@AllanDafoe·
I'm proud of GoogleDeepMind/Google's v2 update to our Frontier Safety Framework. We were the first major tech company to produce an explicit risk management framework for extreme risks, and I'm glad we are continuing to push ahead on safety best practice. deepmind.google/discover/blog/…
English
3
17
116
8.8K
Peter McIntyre retweetledi
Jan Leike
Jan Leike@janleike·
Super exciting robustness result: We built a system that defends against universal jailbreaks! It has minimal increase in refusal rate and moderate inference cost.
English
87
76
1.3K
233.4K
Peter McIntyre retweetledi
Non-Trivial
Non-Trivial@learnnontrivial·
Non-Trivial is an online fellowship for high schoolers to start an impactful research or policy project. You'll join a world-class community and learn from speakers like AI pioneer Yoshua Bengio and philosopher Peter Singer. 📅 Early applications close 𝐌𝐚𝐫𝐜𝐡 𝟑𝟏𝐬𝐭!
Non-Trivial tweet media
English
1
1
6
782
Peter McIntyre retweetledi
Spencer Greenberg 🔍
Spencer Greenberg 🔍@SpencrGreenberg·
We published a piece in Scientific American today on how accurate (or inaccurate) different personality tests are! Here's a lovely chart that they made for the article based on our study results (attached). Link to the article is below:
Spencer Greenberg 🔍 tweet media
English
20
59
372
65.5K
Peter McIntyre
Peter McIntyre@pmcntyr·
"we investigated... team performance distributions by relying on 274 performance distributions including 200,825 teams (e.g., sports, politics, fire-fighters, information technology, customer service)" "only 11% of the distributions were normal" pubsonline.informs.org/doi/10.1287/or…
Ethan Mollick@emollick

Twitter talk is too focused on individual productivity. 55% of all work is done in teams And unlike individual skills, teams don’t follow a normal distribution with most teams a little worse or better. In 70% of cases it is a power law: the top 20% or so of teams are much better

English
2
0
8
392
Peter McIntyre retweetledi
Non-Trivial
Non-Trivial@learnnontrivial·
As a teenager, Isaac Newton invented calculus. Mary Shelley wrote Frankenstein at age 18. Many people miss out on opportunities during their youth because they lack confidence or knowledge. What is the best advice you would give your 16-year-old self today?
English
1
1
11
1.6K
Peter McIntyre retweetledi
Stephen Clare
Stephen Clare@stephenclare_·
Could a war get large enough to cause an existential catastrophe: human extinction or a global civilisational collapse? It seems *very* unlikely, but we can't rule out the possibility. To be more specific: I think the chance of such an event this century is between 0.05% and 2%
Stephen Clare tweet media
English
2
5
15
2.3K
Nathan 🔎
Nathan 🔎@NathanpmYoung·
What's the best speech to text app. I hear whisper is good but I don't know how to use it.
English
12
1
11
4.9K
Nathan 🔎
Nathan 🔎@NathanpmYoung·
Non-Trivial is a course for pre university students to work on improving the world. Peter is someone I'd trust and I wish I'd done this course as a teen. Good thread too.
Peter McIntyre@pmcntyr

It's a 7-week online program with a €500 scholarship for pre-university students to start an impactful research, policy, or entrepreneurial project. The deadline to apply is 𝐉𝐚𝐧𝐮𝐚𝐫𝐲 𝟐𝟗, 𝟐𝟎𝟐𝟑. Learn more and apply at: non-trivial.org

English
1
0
3
563
Peter McIntyre
Peter McIntyre@pmcntyr·
Do you have an idea you think could change the 🌍? Here’s a big mistake people often make, and 3 tips to avoid it.
English
1
5
13
3.8K