Ben McClure

206 posts

Ben McClure

Ben McClure

@Ben_mac2

ETLautomate. Always questioning and learning. Physics and Economics by training.

Katılım Kasım 2021
188 Takip Edilen24 Takipçiler
Ben McClure
Ben McClure@Ben_mac2·
@DCinvestor I really want to vibe code a connection to the garmin api to feed my data into a project like this
English
0
0
1
119
DCinvestor
DCinvestor@DCinvestor·
LLMs are a fantastic tool to use if you are starting your fitness journey. i'm using Claude, though i think most of the big models could work create a "project" (or whatever your tool of choice calls it) and then give it some data on your starting parameters and progress over time based on your comfort level with sharing the data (can include basic info like weight, or body scans, bloodwork, etc.) from there, it's given me helpful analysis on a semi-weekly basis i can use to tailor my program, activity levels, diet, etc. you can even have it do meal plans for you or adjust recipes so they are simpler and/or more healthy. it also serves as a useful accountability check across time it's not a replacement for in-person personal training for me- rather, it's a supplement to keep an eye on the holistic picture and to keep pushing myself overall, a super helpful motivational and analytic tool with a $20 per month subscription you are probably already paying
English
12
0
38
7K
Ben McClure retweetledi
Timothy Gowers @wtgowers
Timothy Gowers @wtgowers@wtgowers·
AI has now solved a major open problem -- one of the best known Erdos problems called the unit distance problem, one of Erdos's favourite questions and one that many mathematicians had tried. openai.com/index/model-di…
English
72
608
3.6K
1.5M
Ben McClure retweetledi
NASA's Kennedy Space Center
NASA's Kennedy Space Center@NASAKennedy·
The planet can spell your name – literally. 🔤🌍 This Earth Day, see your name written in landscapes captured by Landsat: go.nasa.gov/4ak4Cdu
NASA's Kennedy Space Center tweet media
English
2.4K
27.9K
184.2K
56.1M
Ben McClure
Ben McClure@Ben_mac2·
@KelseyTuoc Do you have memories turned on or did it search the web?
English
1
0
0
3.5K
Kelsey Piper
Kelsey Piper@KelseyTuoc·
I have a bunch of secret AI benchmarks I only reveal when they fall, and today one did. I give the AI 1000 words written by me and never published, and ask them who the author is. They generally give flattering wrong answers (see ChatGPT, below:)
Kelsey Piper tweet media
English
62
97
2.2K
445.1K
Ben McClure
Ben McClure@Ben_mac2·
@pvncher I especially felt this testing it with making visuals in excel vs 4.6
English
0
0
0
1.2K
eric provencher
eric provencher@pvncher·
4.7 is performing so poorly for me that I think it has to be a bug, potentially with the tokenizer change. It just doesn’t see text the same way the model is a lot less reliable as a result. I’m convinced it’s a regression on image comprehension too.
English
35
9
337
65.5K
Ben McClure
Ben McClure@Ben_mac2·
I stand corrected
Ben McClure tweet media
English
0
0
0
6
Ben McClure
Ben McClure@Ben_mac2·
“Particularly on later turns in agentic settings” ie they are disincentivizing users (making it more expensive) for the model to run for long periods autonomously. Human guardrails too it seems
English
1
0
0
13
Ben McClure
Ben McClure@Ben_mac2·
Please note that “Opus 4.7 uses an updated tokenizer… the same input can map to more tokens—roughly 1.0–1.35×... Second, Opus 4.7 thinks more at higher effort levels, particularly on later turns in agentic settings.” Token burner, not sig. different better benchmarks
Claude@claudeai

Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision.

English
1
0
0
60
Ben McClure
Ben McClure@Ben_mac2·
@claudeai “Particularly on later turns in agentic settings” ie they are disincentivizing users (making it more expensive) for the model to run for long periods autonomously. Human guardrails too it seems
English
0
0
0
8
Claude
Claude@claudeai·
Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision.
Claude tweet media
English
4.7K
10.2K
81K
13.9M
Ben McClure
Ben McClure@Ben_mac2·
@claudeai “Opus 4.7 uses an updated tokenizer… the same input can map to more tokens—roughly 1.0–1.35×... Second, Opus 4.7 thinks more at higher effort levels, particularly on later turns in agentic settings.” We’re burning more tokens for a ~marginally better model~
English
0
0
0
34
Ben McClure
Ben McClure@Ben_mac2·
@claudeai This would be great if the models didn’t get nerfed
English
0
0
0
7
Claude
Claude@claudeai·
We've redesigned Claude Code on desktop. You can now run multiple Claude sessions side by side from one window, with a new sidebar to manage them all.
English
2.1K
3.2K
42.7K
6.1M
Ben McClure
Ben McClure@Ben_mac2·
@martin_casado I would love to hear a coherent rebuke of this, maybe I’ll try to steelman it later
English
1
0
0
1.4K
martin_casado
martin_casado@martin_casado·
It's only a matter of time before only the model creators have access to the most powerful models. The rest get access to smaller, distilled versions. Or access the models through first party apps and services that don't provide direct access to the token path. The investment needs for training are too high, and distillation too effective to warrant any other future.
English
108
68
868
492.1K
Ben McClure retweetledi
Michiel Bakker
Michiel Bakker@bakkermichiel·
🚨📄 New preprint! We find the “boiling the frog” equivalent of AI use. In a series of RCTs, we show that after just 10 min of AI assistance people perform worse and give up more often than those who never used AI. w Grace Liu @brianchristian Mira Dumbalska and Rachit Dubey 🧵
Michiel Bakker tweet media
English
27
242
739
137.9K
Ben McClure retweetledi
NASA
NASA@NASA·
Hello, Moon. It’s great to be back. Here’s a taste of what the Artemis II astronauts photographed during their flight around the Moon. Check out more photos from the mission: nasa.gov/artemis-ii-mul…
NASA tweet mediaNASA tweet mediaNASA tweet mediaNASA tweet media
English
9.9K
173.9K
809.3K
29.8M