notebook enthusiast
5.7K posts

notebook enthusiast
@enthusednotebk
prove the existence of discomfort
six feet under Присоединился Kasım 2022
580 Подписки295 Подписчики
notebook enthusiast ретвитнул
notebook enthusiast ретвитнул

Introducing a new, stupid website to find a piece of classical music whose duration most closely matches that of your next trip.
busundreu.com

English

met an absolutely cracked quant the other day
and what i’ve noticed talking to people like this (genuinely brilliant, deeply analytical, almost frighteningly intelligent) is that they’re usually incredibly lonely inside
because most people can’t be in a conversation with them
their thoughts are too advanced and too technical
a normal conversation just doesn’t have the bandwidth for what’s actually going on in their head so they’ve learned to compress it, hide it, or just stop sharing altogether
but when you actually give them the space (genuine interest, real willingness to follow them wherever the thought goes) something shifts
you can see the relief
almost like they’ve been carrying this entire world inside them that’s never had anywhere to go and suddenly there’s somewhere for it to go
these have honestly been some of the most profound conversations i’ve had with people
not because i’m the smartest person in the room (i’m not), but because i can hold the space for it and follow the thread - fill in the pieces when they’re struggling to articulate something they’ve never had to put into words before
the amount of stuff stuck inside people that never comes out simply because nobody around them has the capacity to receive it
that’s the part that is so exciting for someone like me, it’s like a new discovery that’s contagious
English
notebook enthusiast ретвитнул

Radar graphs are among the worst ideas in data visualization. The whole point of them is to show the area and you can usually reorder the labels freely in order to create a desired dramatic effect.
Two versions of the same graph:
- left one tells the story that AI is rapidly replacing whole industries
- right one shows the "jaggedness" and reinforces the idea that humans will always have something that AI won't be able to replicate


Andrew Curran@AndrewCurran_
Striking image from the new Anthropic labor market impact report.
English

Some personal news:
- Finished another trip around the sun today 🫡
- Decided to join @PrimeIntellect to work on evals!! There’s a lot to be build and do couldn’t imagine a better team to do just that 🙌
- I will be in SF the next two weeks :) Just to look around, of course 👀
English
notebook enthusiast ретвитнул

Just got this DM from a follower:
Hey dude, I need to vent this to someone who gets it. I've been at this Big Tech company (you know the one) for almost 6 years now—senior SWE, TC around $350k last year with RSUs still vesting. Thought I was bulletproof after surviving the 2023-2024 bloodbaths and then pivoting hard into the AI org. But fuck, the ground is shifting under my feet faster than I can keep up.
Last week in our all-hands, leadership was bragging about how the team's "AI leverage ratio" hit 4.2x—meaning each engineer is now shipping what used to take a team of four. They showed the metrics: feature velocity up 180% YoY while headcount's down another 22% since Q4 '25. The slide literally had a photo of Cursor + Claude Sonnet 4 workflows replacing entire squads. Everyone clapped like trained seals, but I saw three faces go pale—they're the mid-level folks who just finished documenting their entire codebase for the "knowledge distillation" project.
My direct report, this solid L5 who joined right after me, got put on a 30-day PIP after his productivity dashboard dipped below the new AI-augmented benchmark. The benchmark? It's literally what the offshore team in India hits using the exact prompts he used to write. He trained them on our internal style guide last quarter—now they're outperforming him at $28/hour all-in. He told me privately he's burning through savings and eyeing real estate licensing because "at least houses don't get refactored by agents overnight."
The internal job board is a ghost town. Entry-level SWE roles? Frozen since mid-'25. What few postings go up are tagged "AI-native preferred" and get 2,000+ apps in hours, mostly from people already on H-1Bs or contractors. Meanwhile, they're quietly converting more mid-tier positions to "AI orchestration" contractors—$90-110/hour remote from LATAM or Eastern Europe, no benefits, 6-month contracts. My manager admitted in 1:1 that if the next Grok/Claude/Anthropic release closes the last 10-15% quality gap, we'll probably cut another layer.
I'm hanging on because I'm one of the ones who owns the prompt libraries and fine-tuning pipelines now. They need humans to babysit the models until the self-improving loops actually work without constant human intervention. But I see the writing: every time we make the system more autonomous, we make our own roles more optional. The alumni Slack is full of 2024-2025 grads DMing for coffee chats because their referrals bounce—67% underemployed or gigging according to the last poll. One kid I mentored last year is back living with parents after burning through his signing bonus.
I used to tell people "just upskill in AI, you'll be fine." Now I feel like a fraud saying it. If I lost this tomorrow, I'd be competing with the same offshore talent I've been helping scale, plus a flood of recently "managed out" seniors. My emergency fund is decent, but the mortgage isn't. Thinking about side hustles in trades or something offline—plumbing, electrical, anything that can't be prompted away.
This feels like watching the industry eat itself from the inside while pretending it's evolution. You still feeling secure over there, or is it hitting your shop too? Need to hear I'm not going insane.




English

@whitfill_parker do you think algorithmic progress measured by nanogpt speedrun is representative of algorithmic progress as a whole?
English
notebook enthusiast ретвитнул

@holdmycovfefe12 this tweet was authored by an octopus
English
notebook enthusiast ретвитнул
notebook enthusiast ретвитнул

@scaling01 dunno if you've seen this, but might be useful to you - they're saying the ceiling is 95% at least
x.com/jyangballin/st…
John Yang@jyangballin
Across all mini-SWE-agent + <model> runs, SWE-bench Verified's current "ceiling"? - 87.4% (0.874 - 0.8) * 500 = another *37* instances that aren't solved consistently. If you recalculate this number across all official SWE-bench Verified submissions? - 95% from SWE-bench site
English

What's the upper bound on SWE-Bench-Verified?
I yoinked a bunch of EpochAI data and analyzed it.
However, I only got to download 21 files, now it seems im permanently blocked from downloading anything lol
44/500 = 8.8% of SWE-Bench-Verified questions are either unsolvable or still too hard for even the best models like Opus 4.6 Thinking
With that we can say that the upper-bound is at least 91.2% on SWE-Bench Verified.
The hardest subset of question seems to be pylint-dev and astropy.
There were 3 questions that were only solvable by Gemini 3.0 Pro, 2 questions only solvable by GPT-5.1 and 2 questions only solvable by Kimi-K2.5
You will see the GPT-5.1 rows and columns are very bright, meaning there is almost no overlap in failures between GPT-5.1 and the other models, which is why they reran it.
Its pass rate is still pretty high at 65.9% but I guess it failed a lot of the easy tasks which is why the rows and columns are bright. The two tasks that were only solvable by GPT-5.1 might be buggy.




English

@rayhanadev 😌😌 please wish us luck on devpost devpost.com/software/freak…
English

















