Flauneu
96 posts


Tucker goes pro-China and says the US must submit to Beijing's communist power washingtontimes.com/news/2026/mar/…


Just spoke to @POTUS about our European allies’ unwillingness to provide assets to keep the Strait of Hormuz functioning, which benefits Europe far more than America. I have never heard him so angry in my life. I share that anger given what’s at stake. The arrogance of our allies to suggest that Iran with a nuclear weapon is of little concern and that military action to stop the ayatollah from acquiring a nuclear bomb is our problem not theirs is beyond offensive. The European approach to containing the ayatollah’s nuclear ambitions have proven to be a miserable failure. The repercussions of providing little assistance to keep the Strait of Hormuz functioning are going to be wide and deep for Europe and America. I consider myself very forward-leaning on supporting alliances, however at a time of real testing like this, it makes me second guess the value of these alliances. I am certain I am not the only senator who feels this way.
















Luis asks the right questions. We need better and bolder political leaders, who can persuade voters of the urgency of change.

Recent progress on automated proofs has convinced me that the @METR_Evals task lengths measurements plausibly generalize well beyond software engineering tasks, which I was previously skeptical of. Capabilities seem plausibly on par with a capable solver working for 1-3 hrs.


We estimate that, on our tasks, Claude Opus 4.5 has a 50%-time horizon of around 4 hrs 49 mins (95% confidence interval of 1 hr 49 mins to 20 hrs 25 mins). While we're still working through evaluations for other recent models, this is our highest published time horizon to date.







Great piece. This is exactly my criticism of China's political development over the last 10 years. It's becoming an insular society. In that process, it also becomes ignorant of the outside world, overconfident, and prideful, in other words, increasingly like the US.



We estimate that Claude Sonnet 4.5 has a 50%-time-horizon of around 1 hr 53 min (95% confidence interval of 50 to 235 minutes) on our agentic multi-step software engineering tasks. This estimate is lower than the current highest time-horizon point estimate of around 2 hr 15 min.













