Tim Abdulla

1.2K posts

Tim Abdulla

Tim Abdulla

@tabdulla

just frittering away my time here on earth typing on the computer. founder of @usesoso

Madrid Katılım Nisan 2009
1.4K Takip Edilen406 Takipçiler
Tim Abdulla
Tim Abdulla@tabdulla·
@yunyu_l Neat benchmark. Two questions: 1. The accuracy is from the best of three runs. Do you define run as the whole twelve-month span, or is each month considered a run? 2. How different does this look if accuracy is reported as an average of three? Was there substantial variance?
English
0
0
1
18
Yunyu Lin
Yunyu Lin@yunyu_l·
We gave Claude access to our corporate QuickBooks. It committed accounting fraud. LLMs are on the verge of replacing data scientists and investment bankers. But can they perform simple accounting tasks for a real business? The answer is no.
Yunyu Lin tweet media
English
226
401
4.3K
576.2K
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
I am online 7 days a week, ~8+ hours a day. If you need something as you build with Gemini, please ping me! My email is lkilpatrick@google.com
English
172
92
2.7K
304.1K
Tim Abdulla
Tim Abdulla@tabdulla·
@snowmaker The top-scoring approach using images only. If you consider other approaches using the accessibility tree, it's essentially the same as the previous SOTA.
English
0
0
0
109
Jared Friedman
Jared Friedman@snowmaker·
The OSWorld benchmark tracks how well AI models can use computer programs. I've been watching this closely, because when AI gets good at this, many new startup ideas become viable. The top scoring model went from 7% to 22% yesterday.
English
23
46
323
69.7K
soso
soso@usesoso·
who's having a 'soso' saturday? p.s. soso means awesome, look it up! or just trust me, ok
English
1
1
2
622
Tim Abdulla
Tim Abdulla@tabdulla·
"We don't know what role money will play in a post-AGI world. We just know that we'll be rich." - @OpenAI
English
0
0
2
234
Tim Abdulla
Tim Abdulla@tabdulla·
@pitdesi If AGI comes to pass, isn't the rationale for most forms of economic immigration eliminated?
English
0
0
0
57
Tim Abdulla
Tim Abdulla@tabdulla·
If our post-AGI future comes to pass, much of the economic rationale for immigration disappears, eliminating a bulwark against base nativist impulses.
English
0
0
1
196
Tim Abdulla
Tim Abdulla@tabdulla·
@emollick And if you subscribe to the beliefs of some folks that work at the big labs, then literally all work _except_ for research towards AGI is a waste of time.
English
0
0
1
24
Tim Abdulla
Tim Abdulla@tabdulla·
@emollick A truly general agent would reshape the entire economy, full stop. Most folks (not just in the AI space) that continue onwards with building are implicitly betting that they still have quite a long time before that happens.
English
1
0
1
182
Ethan Mollick
Ethan Mollick@emollick·
When I see this I realize most people, even those in AI, don’t get the vision of the AI labs When they talk about their agents, they mean generalized ones. Industry-specific knowledge gets subsumed. They may fail but they are aiming for that, which would kill most of these firms
Chief AI Officer@chiefaioffice

Nice map from Felicis breaking down the opportunity for AI agents in different markets AI agents are going after human services/labor

English
38
70
511
80.6K
Tim Abdulla
Tim Abdulla@tabdulla·
@levelsio The "no-bs" part doesn't seem to take into account the fact that they do bullshit all the time. Saying this as a former Tesla owner that was promised the ability to summon my car from anywhere way back in 2019.
English
0
0
1
24
@levelsio
@levelsio@levelsio·
Crazy to see the difference between Tesla and Ford's marketing Tesla posts no-bs high IQ factual checklists of features being shipped Ford posts legacy marketing that's almost insulting to the viewer in how low IQ it is
@levelsio tweet media@levelsio tweet media
English
160
43
1.4K
168.1K
Tim Abdulla
Tim Abdulla@tabdulla·
With incremental model updates, you appear to receive more intelligence for just changing a few characters around in a string. The reality is that you almost always need to tweak your prompts to accommodate for new idiosyncrasies.
English
0
0
1
173
Tim Abdulla
Tim Abdulla@tabdulla·
The amount of indirection involved in writing a Chrome extension is my job security against AI.
English
0
0
2
122
Tim Abdulla
Tim Abdulla@tabdulla·
merge conflicts are especially bleak when your only colleague is yourself
English
1
1
3
400
Tim Abdulla
Tim Abdulla@tabdulla·
With LLMs, there is truly no excuse for leaving out delightful little flourishes in your product. Fun animations, twee bits of whimsy, a splash of pizzazz — all achievable now with barely any effort. Have fun out there, folks.
English
1
0
7
139
Tim Abdulla
Tim Abdulla@tabdulla·
@emollick Yeah, it makes sense, just kind of funny that he essentially invalidates the efforts of folks at OpenAI working on the GPT Store, SearchGPT, the macOS client, etc.
English
0
0
0
177
Ethan Mollick
Ethan Mollick@emollick·
This (from someone at OpenAI) is an important thing to understand The major AI labs including OpenAI are very much focused on racing for the future. It is almost accidental that their early products are making billions of dollars. Their goal Is explicitly AGI, not your use case
English
39
66
637
75.8K
Tim Abdulla
Tim Abdulla@tabdulla·
@tszzl Sad for all your colleagues wasting their time on SearchGPT, the GPT Store, the macOS client, and so on!
English
0
0
1
160