Rachel
267 posts

Rachel
@chelcott9
Studying moral minds @Harvard. Enthusiast.


Today we release our open-source tool designed to help physicists use AI for research-level discovery. Physical Superintelligence PBC is a company whose sole mission is to solve physics using AI. We are releasing Get Physics Done (GPD) as a tool to support practicing physicists in their research.



New paper out with @Scale_AI! Introducing MoReBench - the first-ever benchmark to evaluate procedural moral reasoning in LLMs. MoReBench focuses on how LLMs reason, not just what they decide. We reveal surprising gaps in frontier models' moral reasoning that scaling laws & existing benchmarks miss entirely, and encourage more research around CoT monitoring and robust capability building. This collaboration spanned @UW @nyuniversity @harvard @stanford @mit @cais & more 🧠⚖️



🚨 BREAKING: Fields medalist Terry Tao on how mathematics will change: “When these tools are perfected, we will change the way we do mathematics. If there's a drudgery or a big computation, we'll just hit it with all our technology and say: 'By Gauss, you can get from here to there,' and now we just keep going. So we can blast through all these obstacles that we avoid almost subconsciously. If you look at what we miss, it's the missed opportunities, and that percentage of the overall opportunities is huge.” Full conversation with Math, Inc.’s @jessemhan and @jdlichtman coming soon.





📣NEW PAPER! What's In My Human Feedback? (WIMHF) 🔦 Human feedback can induce unexpected/harmful changes to LLMs, like overconfidence or sycophancy. How can we forecast these behaviors ahead of time? Using SAEs, WIMHF automatically extracts these signals from preference data.

We often hear from reviewers: "what about demand effects?" So we developed a method to eliminate them. Something weird happened during testing: We couldn’t detect demand effects in the first place! (1/8)





New episode w @Lewis_Bollard - a deep dive on the surprising economics of the meat industry. 0:00:00 – The astonishing efficiency of factory farming 0:07:18 – It was a mistake making this about diet 0:09:54 – Tech that’s sparing 100s of millions of animals/year 0:16:16 – Brainless chickens and higher welfare breeds 0:28:21 – $1 can prevent 10 years of animal suffering 0:37:26 – The situation in China and the developing world 0:41:41 – How the meat lobby got a lock on Congress 0:53:23 – Business structure of the meat industry 0:57:42 – Corporate campaigns are underrated Available on YouTube, Apple Podcasts, Spotify, etc (look up Dwarkesh Podcast).









