Tom

153 posts

Tom banner
Tom

Tom

@middleagedc0der

Open source memory AI assistant, https://t.co/Rvxg8vmN5a

Katılım Şubat 2022
77 Takip Edilen11 Takipçiler
Luca
Luca@lucadilollo·
@WorldWideWob also kept Gobert off the floor without a chance to comfortably set defence
English
1
0
1
771
Rob Perez
Rob Perez@WorldWideWob·
LOVE that decision by Mitch Johnson to not call timeout. Almost every Wolves player was grabbing their shorts after those chaotic turnover sequences, they anticipated it coming, and it results in Champagnie getting one of the best looks of the game to win it. Just didn't fall.
English
96
216
7.3K
216K
Dystopian Barbie
Dystopian Barbie@DystopianBarbie·
@Timodc @piersmorgan Honestly, that's a great skill right there. He doesn't have to tell Russel, this is embarrassing. Everyone can see! And now we even have a time stamp of how long it took for this clown to find the text in his favourite book so important to him.
English
1
0
0
7.2K
Tom
Tom@middleagedc0der·
@TheSpeculator0 Also leaked all their source code lol
English
0
0
0
171
Speculator
Speculator@TheSpeculator0·
yeah bro we reached AGI, we shipped a fucking bug that nuked every session that went >1h between responses and no one caught it for a month but yeah our models are so strong we can't release them anymore.
Speculator tweet media
English
15
27
663
36.6K
Tom
Tom@middleagedc0der·
@noahsloss @TJHSSTParents but to imply they were decisive or even particularly important to the outcome is disingenuous . The recall won 70%-30%. The idea that e.g. Trump's crypto czar is a powerful player in a city where Trump got 17% of the vote is silly
English
1
0
0
24
Tom
Tom@middleagedc0der·
@noahsloss @TJHSSTParents Pro-recall donations were at least 5-10x those of anti-recall even if you remove contributions from Tan/Sacks/Arthur Rocks. There were 1,000 volunteers, endorsements from the mayor, former school board members, many others. Yes tech elite supported it -
English
1
0
0
19
Tom
Tom@middleagedc0der·
@TJHSSTParents @noahsloss That’s not to discount anything about how Thiel/Musk/crypto grifter ideology is described here, but to say the school board recall in SF was some kind of astroturfed tech elite thing is not remotely accurate.
English
1
0
0
27
Tom
Tom@middleagedc0der·
@TJHSSTParents @noahsloss This isn’t really an accurate reading of what drove the SF school board recall. COVID shutdown, school renaming during shutdown, Lowell high school admissions changes, and anti Asian racist remarks from a school board member pissed off a lot of people
English
1
0
0
26
Tom retweetledi
Pope Leo XIV
Pope Leo XIV@Pontifex·
When simulation becomes the norm, it weakens the human capacity for discernment. As a result, our social bonds close in upon themselves, forming self-referential circuits that no longer expose us to reality. We thus come to live within bubbles, impermeable to one another. Feeling threatened by anyone who is different, we grow unaccustomed to encounter and dialogue. In this way, polarization, conflict, fear and violence spread. What is at stake is not merely the risk of error, but a transformation in our very relationship with truth.
English
1.6K
14K
82.2K
8.7M
Tom
Tom@middleagedc0der·
@sevensixfive What devilry is the lamp up to this time
Tom tweet media
English
0
0
1
33
Tom
Tom@middleagedc0der·
@mylordcod @provisionalidea Less of a problem in a startup where there’s not legacy/undesirable scope that someone just has to take… more of a problem when 40% of existing work needs to find a new home
English
0
0
2
48
codicular
codicular@mylordcod·
@provisionalidea I think the thing they didn't discuss is how the network of DRI's will take shape. Ie: what constitutes owning a problem? How big? how do you split them when in conflict etc Given roelof, it sounds like they are trying to take early PayPal's system and deliver it at scale.
English
3
0
2
197
James Rosen-Birch ⚖️🕊️
James Rosen-Birch ⚖️🕊️@provisionalidea·
I have a lot of *thoughts* on the Dorsey piece, but tonight I’ll just reiterate — while I very much love that people are starting to care about org design again, 1) human context is not the same thing as LLM context, and environments of ubiquitous surveillance and documentation do not miraculously transmute one into the other 2) reinventing the flat org for the umpteenth time will not magically make its flaws go away just because you plug in AI. I know tech perennially fantasizes about flat orgs and ‘firing all the managers’ once every two to three years, but there are much cooler and more impactful ways to redesign your org around AI that actually optimize for what the AI’s good at as opposed to trying to force AI to resurrect an undead fantasy 3) strategy, planning, resource delegation, coordination, assignment, advocacy, conflict resolution, mentorship, accountability, and decision-making under uncertainty (collectively: management) makes up a distinct skillset and area of expertise that becomes *more* valuable in an agentic world, not less. when you give every engineer ten agents to assign work to, what you’ve done is turned those engineers into managers of digital workers. this then *increases* the administrative and coordination burden of the org geometrically despite headcount remaining stable. 4) the exciting thing about AI in this moment is that it can empower people to make faster, better-informed decisions, not that you can hand your decisions off to a machine to make them in your stead.
jack@jack

our lead independent director @roelofbotha and i wrote about the history of organizational structures, and our intent to rebuild block as a mini-AGI. x.com/jack/status/20…

English
11
15
189
33.7K
rat king 🐀
rat king 🐀@MikeIsaac·
narrative for a while was "apple is behind on AI" — which was true when siri was a bust after trying in house but now they're in a position where they're not spending the insane amounts of capex the other hyperscalers are AND using their models for siri dumb like a fox?
Mark Gurman@markgurman

BREAKING: Apple is planning to open up Siri to run any AI service via their App Store apps as part of iOS 27, dropping ChatGPT as the exclusive outside partner in Apple Intelligence and Siri. bloomberg.com/news/articles/…

English
30
34
707
107.7K
Tom
Tom@middleagedc0der·
@RealJimChanos I just flew on a plane and there wasn’t any AI and the pilots were bored as shit with no infotainment.. very bullish on the Nikola guy doing planes
English
0
0
0
24
Ben Grinspan
Ben Grinspan@BennyGrin·
@conner_omalley If Lorne Michaels wasn't 200 years old this is what would be on SNL and the show would be actually relevant again
English
9
9
2.2K
85.7K
Tom retweetledi
Here's What I Reckon:
Here's What I Reckon:@angryaboutbikes·
Conspiracy theory: this isn't the launch of a new feature, it's the soft launch of accurate pricing for Claude, and Anthropic are testing the waters to see when the time is right to drop the full bombshell.
Claude@claudeai

Code Review optimizes for depth and may be more expensive than other solutions, like our open source GitHub Action. Reviews generally average $15–25, billed on token usage, and they scale based on PR complexity.

English
36
107
3.5K
250.5K
Tom
Tom@middleagedc0der·
@CarioSZNN @NBA Not to mention the grab before the push off
English
0
0
0
115
CarioSZN
CarioSZN@CarioSZNN·
@NBA Sga uses his literal arm every push off and there is no call we gotta get some regulation over here
English
18
0
98
12.5K
NBA
NBA@NBA·
JOKIĆ AND SHAI DELIVER AN ENDING FOR THE AGES 🚨 Thunder top Nuggets in a GAME-OF-THE-YEAR contender 🍿
English
383
3.8K
29.1K
1.5M
Tom retweetledi
John Loeber 🎢
John Loeber 🎢@johnloeber·
it's strange to see the world of the past fade before my eyes from 2012 through 2024, I wrote code in long sessions of sitting in vim -- sometimes typing, mostly thinking, flipping between different terminals, making changes, looking at errors, googling, reading stackoverflow... I took pride in carrying in my head these towering abstractions. I knew every nook and cranny of my business logic, like a neighborhood you live in. I felt extra fast when tab-completing a single long variable name. Nice. I placed every parenthesis, every semicolon, myself. Hundreds of thousands of them. And like a great wave washing over your sandcastle on the beach, it is now all gone. Engineering will never again be as it once was. What's especially significant about it to me is that there's barely a record of the way it was: I've spent thousands of hours writing software, and I don't think there's a single video recording of me doing it. I remember how it was: the long breaks of meditative silence, the frustration of hunting a particularly tricky bug, the relief and joy in solving it, the expressions of taste and cleverness that come with any manual craft. But it's hard to communicate how it was to someone who has never experienced it. As with all histories, the narrative is lacking in depth: you really had to be there.
judah@joodalooped

some of you fail to understand why the coding by hand people are mad being a programmer writing code in your favourite text editor was a way to take a meditative holiday while at work now that time is being taken away, to the employer’s benefit and your loss

English
60
105
1.8K
162.6K
lowbie
lowbie@archivepilled·
Introducing: Number Research Inc. At Number Research Inc., we are attempting to find and document all* available numbers. This is a volunteer-lead research position, where anyone is able to contribute. Simply type a number in, and we'll check if we've got it. If we have, no worries, just try another. If it is a new number, then thank you for your hard work!
lowbie tweet media
English
307
255
4K
401.6K
Tom
Tom@middleagedc0der·
@scaling01 (Claude indeed does well at this)
English
0
0
0
10
Lisan al Gaib
Lisan al Gaib@scaling01·
He's back with an improved "BullshitBench V2" Anthropic models are still dominating everything
Lisan al Gaib tweet media
Peter Gostev@petergostev

BullshitBench v2 is out! It is one of the few benchmarks where models are generally not getting better (except Claude) and where reasoning isn't helping. What's new: 100 new questions, by domain (coding (40 Q's), medical (15), legal (15), finance (15), physics(15)), 70+ model variants tested. BullshitBench is already at 380 starts on GitHub - all questions, scripts, responses and judgements are there so check it out. TL;DR: - Results replicated - @AnthropicAI latest models are scoring exceptionally well - @Alibaba_Qwen is another very strong performer - OpenAI and Google models are not doing well and are not improving - Domains do not show much difference - rates of BS detection are about the same across all domains - Reasoning, if anything, has negative effect - Newer models don't do that much better than older ones (except Anthropic) Links: - Data explorer: petergpt.github.io/bullshit-bench… - GitHub: github.com/petergpt/bulls… Highly recommend the data explorer where you can study the data and the questions & sample answers.

English
38
60
1K
241.4K