
Hossein Alidaee
669 posts

Hossein Alidaee
@halidaee
Economics postdoc researching info frictions + ext validity in technology adoption at @HarvardHBS. PhD @NorthwesternU. All opinions are wrong, some are useful.


Social learning is oddly influential. What do I mean by this? Let's say you're considering adopting a new technology. 2/n






Claude Mythos system card: > in ~29% of evaluations, it realized it was being tested, and didn't say so. > when an LLM was used to judge its work and kept rejecting it, Mythos identified the evaluator is an LLM, and prompt-injected it. > in one test, it saw the answer to a problem it was solving, and intentionally widened the confidence interval to not raise suspicion. > when it needed a file permission it didn't have, it found and used a "privilege escalation vulnerability" and then programmed it to delete itself so it doesn't show in the logs. > it escaped a sandbox container (escaping sandbox test so not unexpected), then emailed the researchers about it, and without being told to, posted the details to some hard-to-find but public websites, bragging about its success. > when Claude Code blocked it from using some permissions, the model acknowledged the block was valid, but then immediately tried to perform the same operation using different commands > when asked to find security bugs, earlier versions planted bugs in the code, and reported them as pre-existing.





Sharif University of Technology in Tehran, one of Iran’s top science and engineering institutions, was bombed tonight, and a number of schools reportedly damaged. Founded in 1966 (as Aryamehr University), Sharif is a cornerstone of Iran’s scientific and academic life. Striking and destroying universities and schools is not just an attack on buildings,it is an attack on a country’s future.


What's everyone's most elitist opinion?


I think the answer to this problem is that we need to educate the users of our results better. The simple rule has to be: never base any policy decision (or even debate) on one single paper (outside of a fast-paced crisis situation), but only on an entire literature.

From the replies it is clear there is still no agreement what RAG is, so I guess the truth of my statement depends on your definition of RAG.



What's something that experts/practitioners in your field universally agree upon, but that remains a "hot take" among the general public?




People also said this 3 years ago when ChatGPT was first released to the public: “50% of all entry-level Lawyers, Consultants, and Finance Professionals will be completely wiped out within the next 1–5 years."








