
Bayesian
876 posts

Bayesian
@Bayesian0_0
#1 AI forecaster on Manifold Markets (and #2 across all categories) https://t.co/glexRhhFiK I want everything to make sense


GOOGLE IS CLOSE TO STRIKING A DEAL TO FUND ANTHROPIC'S DATA CENTER, ACCORDING TO FT.


Anthropic’s new model, Capybara: “Compared to Claude Opus 4.6, Capybara achieves dramatically higher scores in software coding, academic reasoning, and cybersecurity.” According to Dario's previous interview, it might be a 10T-parameter model that cost $10 billion to train.


SCOOP: 2 new OpenAI models have popped up internally over the last couple of days... one new text model, one new image model More soon 👀 before that though I wanna hear your guesses


Under Harvard's proposed new grading policy, the 10th-best undergraduate out of 10 in a graduate-level elective would be capped at an A‑minus. For honors evaluation, they would be recorded as "0th percentile." That's not a typo.




To manage growing demand for Claude we're adjusting our 5 hour session limits for free/Pro/Max subs during peak hours. Your weekly limits remain unchanged. During weekdays between 5am–11am PT / 1pm–7pm GMT, you'll move through your 5-hour session limits faster than before.





AI companies don't disclose data about their safety teams. But this analysis suggests they're a tiny fraction of overall staff, says @parmy (via @opinion) bloomberg.com/opinion/articl…




GPT-5.4 is here. Native computer-use capabilities. Up to 1M tokens of context in Codex and the API. Best-in-class agentic coding for complex tasks. Scalable tool search across larger ecosystems. More efficient reasoning for long, tool-heavy workflows. openai.com/index/introduc…


New post: on Jan 14, I predicted that SWE time horizon by EOY would be ~24 hours. Now I think it'll be >100 hours, and maybe unbounded. For the first time, I don't see solid evidence against AI R&D automation *this year.* Link below.





