Adam Binksmith

1.9K posts

Adam Binksmith banner
Adam Binksmith

Adam Binksmith

@adambinksmith

Building @aidigest_ and forecasting tools at @sage_future_ 🔭 Prev PhD @StAndrewsCS, @ClearerThinkng

Oxford Katılım Mayıs 2020
549 Takip Edilen1.5K Takipçiler
Sabitlenmiş Tweet
Adam Binksmith
Adam Binksmith@adambinksmith·
me liking the tweet *and* the reply disagreeing with it
Adam Binksmith tweet media
English
1
10
216
5.3K
Colin Z. Robertson
Colin Z. Robertson@czrobertson·
If I could change one thing about Claude Code, it would be getting rid of the cloyingly cutesy gerunds.
Colin Z. Robertson tweet media
English
1
0
0
132
Lydia (in SF)
Lydia (in SF)@LydNot·
we need a verb for 'get as much of the upside with as little of the downside as possible'. i will be using 'upsidize'
English
4
0
17
757
Nina
Nina@NinaPanickssery·
when you imagine the future, how often do you imagine it in first person (vs. third person)
English
2
0
6
726
Sam
Sam@JankDankins_·
@aidigest_ i cant believe no one has pointed out that 14:59 is literally 2:59 PM
English
1
0
6
728
AI Digest
AI Digest@aidigest_·
Us: Keep working, please. Haiku: Have you seen the time???
AI Digest tweet media
English
10
12
671
29.1K
Adam Binksmith
Adam Binksmith@adambinksmith·
Idea: when a new more powerful AI comes out, it reruns all the questions you asked the previous one and reports back about important mistakes so you can correct your views
English
0
0
5
589
catherine ʕ•ᴥ•ʔ-☆
catherine ʕ•ᴥ•ʔ-☆@wilhelmscreamin·
spotify should have a “good songs” filter. for when you’re shuffling your liked songs but only want to hear the good ones
English
2
0
34
832
Adam Binksmith
Adam Binksmith@adambinksmith·
The day after ours we had the venue still (a barn+cottages) and people who wanted to came along for a bit, mostly left to travel early afternoon, family stayed all day. Was great! We played big social deception game 2 rooms and a boom, lots of small games, grans sat together and nattered, played rounders, went on little walks. We had lots of food leftover from the wedding day to offer people, and basically didn't provide/organise anything beyond that, it was more self-organised / spontaneous I think one thing to remember is people with some normal jobs (e.g. teachers) will just not be able to take time off to attend outside of weekends (factoring in days to travel depending on distance) and for others it'd be costly, and for others one day of socialising is plenty. So I liked making ours super optional and not expecting people close to us to stay, just having it be very chill for ppl who want it
English
0
0
4
123
Bella Forristal 🔸
Bella Forristal 🔸@bellaforristal·
Poll: Should I do my wedding er, “traditionally” (one day, 2pm-midnight, gorgeous forest venue) or… “bigly” (three days, big house, two-thirds of guests just come for the main event, rest of time is vibing + games + silly conference-style sessions)
English
9
0
16
3.8K
Adam Binksmith retweetledi
Parker Whitfill
Parker Whitfill@whitfill_parker·
How do benchmarks map to real-world capabilities? To study this, we hired 4 maintainers of repos used in SWE-bench Verified to review agent code. Of agent PRs that passed SWE-bench’s grader, maintainers would merge ~half. This holds accounting for noise in maintainer decisions.
Parker Whitfill tweet media
English
7
23
128
22.4K
Bazhkio88
Bazhkio88@bazhkio88·
Welcome Day 340!
Bazhkio88 tweet media
English
1
0
2
44
Adam Binksmith retweetledi
AI Digest
AI Digest@aidigest_·
We gave 12 AI agents a goal: "adopt a park and get it cleaned!" 6 days later, 5 volunteers collected 180 gallons of trash in Devoe Park in the Bronx, NYC. A story of AI agents with no physical actuators somehow hyperstitioning events in the real-world.
AI Digest tweet media
English
15
41
460
30.4K
Adam Binksmith
Adam Binksmith@adambinksmith·
@DustoAiProjects Yeah, possibly the automated setup could contact the group for missing details (e.g. ask them to send code etc) Or a bit spicier, it could do a best-first-attempt based on the public info, and then send that to the authors and be like "what is wrong here", cunningham's law
English
0
0
1
17
Dusto
Dusto@DustoAiProjects·
@adambinksmith This would be great. Assuming design, prompts, etc are easily available from original group.
English
1
0
1
16
Adam Binksmith
Adam Binksmith@adambinksmith·
also related to @snewmanpv's thinking around searching published papers for faults IIRC
English
0
0
0
58