J

306 posts

J

J

@JackHaller7

Katılım Temmuz 2018
171 Takip Edilen8 Takipçiler
J
J@JackHaller7·
@emollick Unlikely they could keep it a secret OpenAI and Anthropic people all go to the same parties.
English
0
0
2
235
Ethan Mollick
Ethan Mollick@emollick·
The imaginary optimal selfish scenario for OpenAI, in retrospect, was to keep Reasoners a secret, skip releasing o1 and o1-preview, and release o3 as GPT-5 There would have been no Deep Seek moment, other labs may not have discovered Reasoners. But they weren't selfish about it!
English
10
6
141
12.9K
Ethan Mollick
Ethan Mollick@emollick·
The second most important release of the LLM era (after GPT-3.5), featuring what was likely the most important chart. Still seems surprising to me that OpenAI told everyone about the biggest advance in AI technology since the LLM rather than keeping it to themselves until later.
Ethan Mollick tweet media
Adam.GPT@TheRealAdamG

openai.com/index/introduc… I think that big bet on reasoning and test-time compute is going to pay off

English
67
30
960
189K
J
J@JackHaller7·
@NateSilver538 @TheStalwart Imo compressing is worse then just starting over fresh again really need to keep your work small enough for 1 context windows
English
0
0
1
125
Nate Silver
Nate Silver@NateSilver538·
@TheStalwart Also wonder if this has something to do with very long context windows, seems to increase once you've gotten to the point where the chat is so long that it has to start to compress it. But wasn't noticing this a month ago when I also hit that point on a previous model.
English
13
0
51
13.7K
Nate Silver
Nate Silver@NateSilver538·
I wonder whether there's either emergent or intentional behavior where Claude is trying to minimize compute. Working on a fairly complex soccer model. Especially in the past few weeks, it's often like "that's good enough, can we move on?" as I'm trialing different model specs.
English
124
16
1.1K
232K
Alice
Alice@AliceDev98·
@pancakehime012 Code is different from art. Art can be subjective while code follows patterns repeated the same time over and over again. And it didnt steal jobs from programmers.. it just made it easier.
English
7
1
28
7.5K
eigenrobot
eigenrobot@eigenrobot·
we can confidently say that the only academic field making progress today is the one primarily concerned with documenting prehistoric race wars total @bronzeagemantis victory tbqh
Hutty B. Goode@pleasemilf

@eigenrobot Ancient genomics has made some serious strides in the past 5 years

English
5
9
246
7.3K
J
J@JackHaller7·
@JeremiahDJohns I asked an intern to translate "He was being silly and enjoying himself until a bunch of girls began to harass him, making him stressed. Is ignoring women and instead focusing on mogging foid worshipping men more useful than using your looks and sexual market value to get girls?"
English
0
0
5
283
Jeremiah Johnson 🌐
Jeremiah Johnson 🌐@JeremiahDJohns·
I can barely even tell this is English but I'm pretty confident it qualifies for the Worst Tweets Bracket
Jeremiah Johnson 🌐 tweet media
English
63
52
2.1K
55.7K
J
J@JackHaller7·
@alexolegimas @emollick Sorry but why does saying you can't distinguish functional forms with 15 data points warrant a paper this is just standard undergrad statistics knowledge?
English
0
0
1
59
Alex Imas
Alex Imas@alexolegimas·
@emollick Yup. This is the key part of the abstract:
Alex Imas tweet media
English
1
1
16
1.5K
Ethan Mollick
Ethan Mollick@emollick·
This paper (by a colleague) is being misunderstood - it is not a prediction about whether the METR exponential curve is ending, but an argument that it is plausible to model the exponential as a series of stacked s-curves, which would require further breakthroughs (at the scale of reasoning) to continue. I don't necessarily even think this contradicts what most insiders in AI, from Demis to Andrej, are saying!
Hamsa Bastani@hamsabastani

I wanted to make a few clarifications, which we believe were clear in our paper but not in my original post (re-analyzing @METR_Evals data). Our contribution is to posit progress as a multiplicative product of sigmoids around different innovations. Given the METR data, we split it into improvements in base capabilities (data/model size) and reasoning. We show that this product provides a similar *in-sample* fit to the small datasets we observe as exponential growth. However, the implications are very different! Under our model, we would need continued innovations (akin to reasoning) to see continued exponential progress. This isn’t to say we rule out exponential progress, or that our product of sigmoids is the right model. It is simply to say there are few points and multiple possible underlying models with very different implications. Our product sigmoid fit actually fits very well when holding out GPT 5.2 and/or Gemini 3 pro. We do look worse when additionally holding out Claude Opus 4.5, but still plausible. Our goal isn’t to quibble about OOS metrics on a handful of data points, but to point out that existing forecasts are fragile, and don’t model the succession of different innovations. (There are a couple of other fits floating around X, but they don’t seem to be using our proposed product sigmoid so I can’t say what’s going on there…) I apologize for my un-nuanced earlier post – we hope people will read the paper!

English
9
10
126
26.3K
J
J@JackHaller7·
@mattyglesias *Well nourished people earn more because they're smarter and stronger.
English
1
0
6
628
scoopdiddyoop
scoopdiddyoop@scoopdiddy1·
@adrusi i do not believe this accounts for the fighting words doctrine
English
1
0
4
911
Matt Darling 🌐🏗️
Matt Darling 🌐🏗️@besttrousers·
@tszzl From an external perspective, this sort of called the peak. Is it very different from an internal perspective?
English
4
0
14
6.6K
roon
roon@tszzl·
if you think AI chatter has reached an annoying level right now you're in for something else. it's going to be the only thing on anybody's mind starting shortly
English
131
328
3.9K
1.1M
J
J@JackHaller7·
@HGWellsFargo @Citrini7 Spend all your money on cocaine while you still have time
English
0
0
3
312
H. G. Wells Fargo
H. G. Wells Fargo@HGWellsFargo·
@Citrini7 what's the trade for "holy f*ck agents are going to consume everything"
English
3
1
23
8.3K
chopsticks
chopsticks@1chopsticks·
@dylanmatt transphobes be like "you're not a real woman" but transwomen proving they're women by also dodging accountability and lying for no reason
English
4
6
308
5.7K
Basil🧡
Basil🧡@LinkofSunshine·
@Ancapta What information do any derivative markets provide?
English
1
0
76
3K
Basil🧡
Basil🧡@LinkofSunshine·
I’m kinda surprised someone didn’t do this earlier? Making bets that crypto goes up then buying a bunch of crypto seems like the most obvious play ever lol
PredictTrader@polymarketbet

A Polymarket trader ran a Wolf of Wall Street–level play overnight - made $233K by drained liquidity from trading bots and it flew completely under the radar. The setup was brilliant and extremely simple. A trader known as @a4385 made $233K overnight exploiting 15-minute Polymarket markets. Saturday night. Liquidity is thin. Binance spot order books are shallow. On “XRP Up or Down — Jan 17, 12:45–1:00 PM ET” he aggressively bought UP at any price. His counterparties were trading bots. Polymarket market making is relatively straightforward, with low barriers to entry for solo devs and trading bots are now very popular. By the 10th minute of the market - XRP was down ~0.3% from the open yet he had pushed UP shares to 70¢. The bots saw an opportunity and walked straight into the trap selling him even more UP. ~77K UP accumulated at an average price of ~48¢. Two minutes before settlement, a wallet on Binance bought ~$1M USDT of XRP spot, pushing price ~0.5% higher. Seconds after settlement, the $1M spot buy was sold back. Cost of the manipulation: ~0.25% slippage each way + fees. With Binance VIP 4 level (0.06%) (quite easy to obtain) and 0.25% slippage on both sides the total cost was ~$6,200; It may be less. He ran the same play multiple times, cleaning out bot wallets by exploiting thin weekend liquidity. @a4385 - polymarket.com/profile/0x506b… Some bots were shut down in time. Others didn’t react fast enough and lost their entire balances - including @aleksandmoney" target="_blank" rel="nofollow noopener">polymarket.com/@aleksandmoney, which gave up a full year of profits.

English
13
8
824
94.6K
J
J@JackHaller7·
@pleometric lmao can you post the text
English
0
0
0
322
J
J@JackHaller7·
@0xMerridew It's called using Twitter what's so confusing
English
0
0
0
196
Joe Weisenthal
Joe Weisenthal@TheStalwart·
Corn futures are plunging
Joe Weisenthal tweet media
English
69
112
1.3K
1M
Drew Pavlou 🇦🇺🇺🇸🇺🇦🇹🇼
Palestinian wedding: “Harden your heart, oh Putin. Increase your attacks. We will marry Ukrainian women. Also we say to China, invade Taiwan.”
English
99
239
1.2K
178.8K
J
J@JackHaller7·
@besttrousers @AlanMCole ✊ Finally someone brave enough to attack 18 month olds those sniveling brats have had it to good for to long.
English
0
0
13
473
Matt Darling 🌐🏗️
Matt Darling 🌐🏗️@besttrousers·
@AlanMCole Isn't that part of the insidiousness, though? Like it's very funny to hear a 18 month sing "THAT PERFECT GIRL IS GOOONE!". What expectations were society holding you to, exactly?
English
6
1
251
13.3K
Alan Cole
Alan Cole@AlanMCole·
It's very funny that Frozen tries to do all this stuff to subvert Disney Princess-ism, and it's all completely lost on the 3-year-old girls who just love Elsa in the princess outfit.
English
99
245
11.4K
443.5K