Sabitlenmiş Tweet
Per Borgen
3.2K posts

Per Borgen
@perborgen
CEO of https://t.co/AF9LjT2054 (YC S20). Helping devs learn faster and grow their careers.
Oslo Katılım Nisan 2009
1.1K Takip Edilen20.7K Takipçiler
Per Borgen retweetledi

Read more here: different-marmoset-f7b.notion.site/Want-to-build-…
English

We're looking for a new dev to join our team in Oslo!
Doesn't matter if you're junior or senior, an Ivy League-grad or self-taught.
The only thing we care about is your ability to build high-quality products.
Link below. And please share this post if any of your connections might be interested in this role!

English
Per Borgen retweetledi

The secret MacOS API that Codex use for their new impressive computer-use feature.
Codex can run several computer-use tasks in parallel, which shouldn't really be possible. Most tools in this space take over your mouse and keyboard completely, so you can't use your machine while they work.
Codex doesn't do that. The team at Software Applications Inc. figured out how to avoid it. (That's Sky, by the way, former Shortcuts people Apple acquired a while back.)
I wanted to know how they pulled it off. So I did some digging. Here's what I found.
The "secret API" is AXUIElement, macOS's accessibility framework. Apple built it for VoiceOver. The Codex team just repurposed it for LLMs.
On the reading side: the AX Tree.
No screenshots. No OCR. Codex reads the accessibility hierarchy directly. Every button, text field, and menu item is already labeled and structured.
It's basically a DOM equivalent that Apple built decades ago for assistive tech. That's what the model actually sees.
On the writing side: events go through accessibility APIs, not CGEvent.
Clicks hit the target element directly, never through the shared system cursor. Those virtual cursors you see wiggling around? Purely cosmetic.
That's why focus doesn't get stolen. That's why N agents can run at the same time without stepping on each other.
It doesn't work on Windows, so there they hijack the cursor.
The Sky team didn't actually invent this approach. UI Browser was using the same APIs years ago.
I found hacks like this super-cool, and hope more people build cool shit with it.
English

@Tpchant We should definitely create a course on how to clean up vibe coded spaghetti repos!
English

LLM coding benchmarks collapse from 90% to 4% the moment you give them a language they haven’t memorized 📉
This indicates that LLMs alone lack a fundamental part of intelligence.
If a human learns to write FizzBuzz in Python, they’ll easily be able to transfer that over to Java as soon as they’ve learned Java’s syntax.
LLMs, not so much.
To reveal this, the authors picked obscure programming languages where they knew the training data was extremely limited, like Brainfuck and Shakespeare.
The LLMs were given all the documentation they needed to understand the syntax of the languages.
But even with full docs, they couldn’t “reason” their way to a solution.
None of the models managed to solve any of the hard problems.
These were SOTA models at the time of writing the paper: GPT 5.2, Gemini 3, etc.
This makes me think that LLMs lean just as much into memorization as reasoning when they help us solve problems. For many use cases, that’s not a problem in itself.
But I don’t think we’re at AGI or ASI level yet.

English


Throwback to when Reid Hoffman endorsed Jeffrey Epstein’s “skills” on LinkedIn
#ThrowbackThursday

English








