Tim Dingman

5.2K posts

Tim Dingman banner
Tim Dingman

Tim Dingman

@TimDingmanLive

Autoexigetical

New York, NY เข้าร่วม Mart 2010
25 กำลังติดตาม509 ผู้ติดตาม
ทวีตที่ปักหมุด
Tim Dingman
Tim Dingman@TimDingmanLive·
What the fuck is going on here
Tim Dingman tweet mediaTim Dingman tweet mediaTim Dingman tweet mediaTim Dingman tweet media
English
1
0
3
169
Tim Dingman
Tim Dingman@TimDingmanLive·
@shreyasnsharma @Muennighoff 100%. In fact it would be strange *not* to see pass@k improvements at these values of k, as Yue et. al. also show. You'd also need to show a graph of entropy imo if you want to disprove elicitation hypothesis
Tim Dingman tweet media
English
0
0
1
67
Tim Dingman
Tim Dingman@TimDingmanLive·
@kalomaze Although there is a difference between "can recall this accurately" and "is currently salient"
English
0
0
1
42
kalomaze
kalomaze@kalomaze·
i am proactively summarizing manually by hand extremely deep into sessions because of my almost superstitious instinctual fear of context rot, even though i'm pretty sure that the agent is not actually getting meaningfully worse at doing stuff
English
5
0
41
1.6K
kalomaze
kalomaze@kalomaze·
1m context opus has cleared my skin, my crops are flourishing, etc
English
7
2
221
5.9K
Tim Dingman
Tim Dingman@TimDingmanLive·
Just saw my first LLM-generated typo in years. GPT-5.4 xhigh (in Codex) called some modules "high-lelevance"
English
0
0
0
38
Tim Dingman
Tim Dingman@TimDingmanLive·
What are we even doing here Claude
Tim Dingman tweet media
English
0
0
0
17
Tim Dingman
Tim Dingman@TimDingmanLive·
@peterwildeford Once Nvidia releases Nemotron 3 Ultra I think it'll be worth tracking them. They're getting serious about commoditizing their complement
English
0
0
4
619
Peter Wildeford🇺🇸🚀
Peter Wildeford🇺🇸🚀@peterwildeford·
Based on the data I see, I think: - Anthropic🇺🇸/Google🇺🇸/OpenAI🇺🇸 all ~tied - Meta🇺🇸 / xAI🇺🇸 each ~7mo behind - Moonshot🇨🇳/- Deepseek🇨🇳 / zAI 🇨🇳 / Alibaba🇨🇳each ~9mo behind - Mistral🇫🇷 ~1.5 years behind - No other companies competitive
Ethan Mollick@emollick

Both xAI and Meta seem to be falling behind, based on the Grok 4.2 benchmarks and this reporting. Frontier AI models are really a three way race at this point.

English
268
191
3.2K
1.1M
toucan
toucan@distributionat·
I think we are looking at the mother of all supply shocks and the market is completely mispriced due to active manipulation. But the price of crude oil will spike soon.
English
4
0
31
2.8K
Tim Dingman
Tim Dingman@TimDingmanLive·
@weeklytreeman @kalomaze Hit a certain conversation length periodically and then summarize. Maybe if the conversation is confusing or you have to make notes for third parties. But not in natural conversation, even over many hours
English
2
0
1
42
kalomaze
kalomaze@kalomaze·
the funniest thing about claude compaction is that the agent can be constantly invoking a specific ssh alias for the entire transcript, your compaction instruction can mention "include knowledge of which ssh", and it will still... not include the ssh alias in the compaction...
English
17
1
128
8.4K
Luxun's alt
Luxun's alt@weeklytreeman·
@kalomaze summarization is openai coded. i can't quite put words to it, but it felt intuitive that anthropic does not care about claude doing it well
English
1
0
2
180
toucan
toucan@distributionat·
That's not the crazy part, the crazy part is when models get distilled small enough that cyber-agents can copy themselves to compromised systems. Survive and spread is a *feature* for cyber-weapons. They will be *designed* to go autonomous. Hope we solve alignment by then!
English
4
1
6
327
Rick Ross
Rick Ross@RickRossTN·
I’m focusing more on clusters of RTX 6000 Pro gpus for my app’s inference.
English
1
0
0
81
Rick Ross
Rick Ross@RickRossTN·
I'm considering selling my mint condition Mac Studio M3 Ultra with 512 GB unified ram, 2 TB storage, 10 Gbe and AppleCare+ until 1/11/2029. I think this is a backordered configuration. Anyone interested?
English
3
0
1
180
Tim Dingman
Tim Dingman@TimDingmanLive·
@distributionat If history has ended, why does Francis Fukuyama look so old now? Checkmate, liberal democrats
English
0
0
1
16
toucan
toucan@distributionat·
Please, can we bring back the end of history!
toucan tweet media
English
1
0
7
309
Tim Dingman
Tim Dingman@TimDingmanLive·
Cancelled my ChatGPT subscription and uninstalled the app. Always liked Claude better anyway, even when it sucked
English
0
0
1
63
Peter Wildeford🇺🇸🚀
Peter Wildeford🇺🇸🚀@peterwildeford·
Kalshi could be a force for good in the world but instead they have decided to lean into the most evil part of themselves As a regular Kalshi user this makes me very angry (pro tip: You are not going to make $280/week predicting weather)
Nigel Eccles@nigeleccles

It is clear to me that @kalshi is going down the same path as Juul, and if they don’t pull back it is going to have the same conclusion. For those that don’t remember, Juul was one of a number of the main vaping brands in the 2010s. It took a product that had a social good (helping smokers quit) but then aggressively pushed it into a new market, non smokers and particularly kids. (See the similarity?) The backlash took time to build but when it did it was devastating for the company. I’ve worked in the online gaming industry for over 25 years, all over the world. This type of marketing is actually extremely rare in real money gaming. Firstly and most importantly it is rare because operators view it as highly unethical. It might surprise you that a lot of people in the gaming industry do actually care about things like underage and problem gambling. Secondly it is also rare because it doesn’t work. Do you think the teenagers in these ads are going to keep playing when they lose all their rent money? The only other company I can think of that pushed this type of advertising was Skillz who aggressively pushed the “second income” line. Check out their share price if want to see how that worked out for them.

English
15
4
149
10.3K
Tim Dingman
Tim Dingman@TimDingmanLive·
Why have I never seen it noted that Ilya is Russian for Elijah, the biblical prophet?
English
1
0
0
43
Tim Dingman
Tim Dingman@TimDingmanLive·
Legibility is low status
English
0
0
0
28