Ben McClure

206 posts

Ben McClure

@Ben_mac2

ETLautomate. Always questioning and learning. Physics and Economics by training.

Katılım Kasım 2021

188 Takip Edilen24 Takipçiler

Ben McClure@Ben_mac2·2d

@DCinvestor I really want to vibe code a connection to the garmin api to feed my data into a project like this

English

119

DCinvestor@DCinvestor·2d

LLMs are a fantastic tool to use if you are starting your fitness journey. i'm using Claude, though i think most of the big models could work create a "project" (or whatever your tool of choice calls it) and then give it some data on your starting parameters and progress over time based on your comfort level with sharing the data (can include basic info like weight, or body scans, bloodwork, etc.) from there, it's given me helpful analysis on a semi-weekly basis i can use to tailor my program, activity levels, diet, etc. you can even have it do meal plans for you or adjust recipes so they are simpler and/or more healthy. it also serves as a useful accountability check across time it's not a replacement for in-person personal training for me- rather, it's a supplement to keep an eye on the holistic picture and to keep pushing myself overall, a super helpful motivational and analytic tool with a $20 per month subscription you are probably already paying

English

Ben McClure@Ben_mac2·22 May

@OptimusNin3 @SpaceX It’s the one that started the list

English

Ben McClure@Ben_mac2·22 May

@OptimusNin3 @SpaceX Dude it’s literally on my wall of horrible words, so bad

English

320

SpaceX@SpaceX·21 May

Watch Starship's twelfth flight test twitter.com/i/broadcasts/1…

English

1.5K

6.6K

29.3K

5.9M

Ben McClure@Ben_mac2·21 May

I directionally tend to agree. Not sure how I feel about it but that does not change the reality. Plan accordingly.

QC@QiaochuYuan

at this point it is completely untenable to believe anything along the lines of “AI can only spit out an average of the training data.” that was already only a very rough way of understanding older models pre-reasoning, it was already obsoleted by o1 which released in 2024, and now it should be obviously and conclusively dead even if you haven’t been paying close attention. recursive self-improvement has barely even started and we are already here. even with the recent erdos problem solves you could argue that those were cherrypicked out of a large database for being neglected by humans. that cope is no longer available now generalize the lesson: all other arguments that there is some essential human activity forever beyond the reach of AI are also cope, these are technical problems and the will and money and talent exists and is being deployed to solve them. artificial superintelligence is not a fairy tale. assume it’s coming and plan accordingly

English

Ben McClure retweetledi

Timothy Gowers @wtgowers@wtgowers·20 May

AI has now solved a major open problem -- one of the best known Erdos problems called the unit distance problem, one of Erdos's favourite questions and one that many mathematicians had tried. openai.com/index/model-di…

English

608

3.6K

1.5M

Ben McClure@Ben_mac2·20 May

19/05/2026

DCinvestor@DCinvestor

i'll be blunt with you: there's a greater than 80% chance capitalist systems will not survive the rise of AI/robotics it's not capitalism if the only way to obtain new capital is to already own existing capital. it simply won't work. we're not at that point yet, but we are getting there quickly and there's a greater than 50% democracies won't survive it either. with more centralized control of capital comes the inevitable desire to capture and control political systems more meaningful democracies could come back into vogue one day after a cataclysm (assuming humans still matter at all at that point) and a return to principles and more ancient wisdom, but we seem far from that point today. indeed, our systems are already substantially corrupted and this is increasingly obvious

Ben McClure@Ben_mac2·2 May

@dampedspring 4 guesses

English

Andy Constan@dampedspring·2 May

🌎 May 2, 2026 🌍 🔥 34 | Avg. Guesses: 8 ⬜🟥🟩 = 3 globle-game.com #globle lucky!

English

8.3K

Ben McClure retweetledi

NASA's Kennedy Space Center@NASAKennedy·22 Nis

The planet can spell your name – literally. 🔤🌍 This Earth Day, see your name written in landscapes captured by Landsat: go.nasa.gov/4ak4Cdu

English

2.4K

27.9K

184.2K

56.1M

Ben McClure@Ben_mac2·17 Nis

@KelseyTuoc Do you have memories turned on or did it search the web?

English

3.5K

Kelsey Piper@KelseyTuoc·17 Nis

I have a bunch of secret AI benchmarks I only reveal when they fall, and today one did. I give the AI 1000 words written by me and never published, and ask them who the author is. They generally give flattering wrong answers (see ChatGPT, below:)

English

2.2K

445.1K

Ben McClure@Ben_mac2·17 Nis

@pvncher I especially felt this testing it with making visuals in excel vs 4.6

English

1.2K

eric provencher@pvncher·16 Nis

4.7 is performing so poorly for me that I think it has to be a bug, potentially with the tokenizer change. It just doesn’t see text the same way the model is a lot less reliable as a result. I’m convinced it’s a regression on image comprehension too.

English

337

65.5K

Ben McClure@Ben_mac2·16 Nis

I stand corrected

English

Ben McClure@Ben_mac2·16 Nis

“Particularly on later turns in agentic settings” ie they are disincentivizing users (making it more expensive) for the model to run for long periods autonomously. Human guardrails too it seems

English

Ben McClure@Ben_mac2·16 Nis

Please note that “Opus 4.7 uses an updated tokenizer… the same input can map to more tokens—roughly 1.0–1.35×... Second, Opus 4.7 thinks more at higher effort levels, particularly on later turns in agentic settings.” Token burner, not sig. different better benchmarks

Claude@claudeai

Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision.

English

Ben McClure@Ben_mac2·16 Nis

@claudeai “Particularly on later turns in agentic settings” ie they are disincentivizing users (making it more expensive) for the model to run for long periods autonomously. Human guardrails too it seems

English

Claude@claudeai·16 Nis

English

4.7K

10.2K

81K

13.9M

Ben McClure@Ben_mac2·16 Nis

@claudeai “Opus 4.7 uses an updated tokenizer… the same input can map to more tokens—roughly 1.0–1.35×... Second, Opus 4.7 thinks more at higher effort levels, particularly on later turns in agentic settings.” We’re burning more tokens for a ~marginally better model~

English

Ben McClure@Ben_mac2·14 Nis

@claudeai This would be great if the models didn’t get nerfed

English

Claude@claudeai·14 Nis

We've redesigned Claude Code on desktop. You can now run multiple Claude sessions side by side from one window, with a new sidebar to manage them all.

English

2.1K

3.2K

42.7K

6.1M

Ben McClure@Ben_mac2·9 Nis

@martin_casado I would love to hear a coherent rebuke of this, maybe I’ll try to steelman it later

English

1.4K

martin_casado@martin_casado·9 Nis

It's only a matter of time before only the model creators have access to the most powerful models. The rest get access to smaller, distilled versions. Or access the models through first party apps and services that don't provide direct access to the token path. The investment needs for training are too high, and distillation too effective to warrant any other future.

English

108

868

492.1K

Ben McClure@Ben_mac2·9 Nis

I would love to hear a coherent rebuke of this, maybe I’ll try to steel man it when I have time

martin_casado@martin_casado

English

Ben McClure retweetledi

Michiel Bakker@bakkermichiel·7 Nis

🚨📄 New preprint! We find the “boiling the frog” equivalent of AI use. In a series of RCTs, we show that after just 10 min of AI assistance people perform worse and give up more often than those who never used AI. w Grace Liu @brianchristian Mira Dumbalska and Rachit Dubey 🧵

English

242

739

137.9K

Ben McClure retweetledi

NASA@NASA·7 Nis

Hello, Moon. It’s great to be back. Here’s a taste of what the Artemis II astronauts photographed during their flight around the Moon. Check out more photos from the mission: nasa.gov/artemis-ii-mul…

English

9.9K

173.9K

809.3K

29.8M

Keşfet

@DCinvestor @OptimusNin3 @SpaceX @dampedspring @KelseyTuoc @pvncher @claudeai @elonmusk