ULO

307 posts

ULO

ULO

@Vinibi90

Katılım Eylül 2013
17 Takip Edilen0 Takipçiler
ULO
ULO@Vinibi90·
@THemingford In London this price is not even for a 1 bed house...
English
0
0
4
827
Thomas H. 💙
Thomas H. 💙@THemingford·
How on Earth have we got to the point where a 3 bed house costs £1200/£1300 per month to rent?
English
399
80
1.3K
121.1K
BridgeMind
BridgeMind@bridgemindai·
GPT 5.5 codename Spud ships Wednesday. Polymarket is pricing 80% odds. Pretraining finished March 24. Two years of research inside it. Brockman says it has "big model feel." If the rumors are real, OpenAI pulls back ahead.
BridgeMind tweet media
English
24
12
285
20.3K
Chubby♨️
Chubby♨️@kimmonismus·
Some people say that GPT-5.5 is already rolling out for them, it’s being stealth tested. Initial testing from them say it outperforms Opus4.7 for them (don’t know in which tasks tho). Hopefully it’s not being released on Monday since I’ll be on a 13hour flight to china and would miss the release
English
54
11
672
48.5K
Taelin
Taelin@VictorTaelin·
asked 4.7 if an algorithm I wrote satisfies a bunch of requisites. while thinking, it realized I broke one important invariant. this was very helpful. on the final answer, though? it just said my algorithm is brilliant and flawless!! wtf? glad I read the thinking traces...
English
37
16
828
39.2K
BridgeMind
BridgeMind@bridgemindai·
Claude Opus 4.7 is so good I just bought two Max subscriptions. After vibe coding for over 12 hours yesterday I can confirm this is the best model in the world. It burns tokens fast. But the coding output is unmatched. Two $200/month Max plans so I never hit a rate limit again.
BridgeMind tweet media
English
209
24
714
125.3K
ULO
ULO@Vinibi90·
@kimmonismus Oh, ok, ok... so this was buried in random posts, then... but, knowing these companies a bit, I wouldn't be surprised if, in a few weeks, our token limits "OUT OF NOWHERE" start degrading faster...
English
1
0
2
118
Tibo
Tibo@thsottiaux·
Codex just got a lot more powerful. Computer use, in-app browser, image generation and editing, 90+ new plugins to connect to everything, multi-terminal, SSH into devboxes, thread automations, rich document editing. Learns from experience and proactively suggestions work. And a ton more.
Tibo tweet media
English
434
375
5.2K
422K
ULO
ULO@Vinibi90·
@VictorTaelin @__maicl So, for every request you ask Opus, you have to follow up asking not to be lazy and do it in a proper way?
English
2
0
1
681
Taelin
Taelin@VictorTaelin·
My final thoughts on Opus 4.6: why this model is so good, why I underestimated it, and why I'm so obsessed about Mythos. When I first tested GPT 5.4 vs Opus 4.6 - both launched at roughly the same time - I was initially convinced that GPT 5.4 was vastly superior, because it did better on my logical tests. That's still true: given the same prompt, by default, GPT will be more competent, careful, and produce a more reliable output, while Opus will give you a half-assed, buggy solution, and call it a day. Now, here's what I failed to realize: Opus bad outputs are not because it is dumb. They're because it is a lazy cheater. And you can tell because, if you just go ahead and tell it: "you did X in a lazy way, do it in the right way now" And if you show that this is serious, it will proceed to do a flawless job. That doesn't happen with dumber models. And, the more I work with Opus, the more I realize that, if you just keep pushing it, its intelligence ceiling is much, much higher than it seems. It IS there, you just need to be patient and push it. GPT, on the other hands, when it fails, it already did its best, so, pushing it further will give you no added results. That is also one of the reasons that benchmarks lie. When Claude and GPT score the same in a given benchmark, it is likely that Claude is actually smarter, because it puts less effort. Now, consider that for a moment, and remember that Mythos is outperforming GPT 5.4 *Pro* on benchmarks. How insane that is? Remember that Sonnet 3.5 lagged behind on benchmarks, yet everyone knew that it was superior to 4o. I think it is this effect at play: for whatever reason, Claude-series model "try less hard" on the first shot. Because of that, even if Spud gets close to Mythos on benchmarks (which I predict will be the case), I suppose Mythos will still be superior. This also leads me to wonder if perhaps Anthropic actually has a real lead over OpenAI, that will only get larger? I could totally see a timeline where Anthropic's models become so good that OpenAI simply fails to catch up as the recursive improvement unfolds? Just my silly thoughts though, what do I know As always I could be wrong, and I hope I am!!
English
154
60
1.5K
176.4K
ULO
ULO@Vinibi90·
@__maicl @VictorTaelin The question is: why wouldn't it work at its best the first time, without having to ask it not to be lazy and to redo it?
English
2
0
1
666
heruvim
heruvim@__maicl·
@VictorTaelin @Vinibi90 It's very interesting though, because most people in my timeline fanboy one of the two and all conversions I've seen have been from Claude to GPT I guess, after seeing Opus < GPT output couple of times, I gave up and never thought to tell it "don't be lazy", assuming skill issue
English
1
0
5
696
Murray Hill Guy
Murray Hill Guy@MurrayHillGuy1·
How to get laid in 2026: Set Hinge location to Williamsburg/DUMBO or anywhere in Brooklyn Filter: “liberal only” like below (congrats you just unlocked the whole app) Bio must include: therapy, “reading again,” “he/him” pronouns and one fake hobby Match with: septum piercing / 4+ tattoos / tote bag Date plan: $9 matcha + walk in park (say you’re “off alcohol for mental clarity”) Topics to hit: • how exhausting capitalism is • Mamdani mentioned at least 30 times and how good he is • how you “almost moved to Lisbon” • how evil Trump is (don’t overdo it, just nod intensely) • your latest “burnout” despite having a fake email job Close: “I have a record player, wanna come hear this vinyl?”
Murray Hill Guy tweet media
English
196
139
6.5K
984.5K
ULO
ULO@Vinibi90·
@zaygranet What if was just someone trolling with the wifi name?
English
1
0
0
14.9K
Isaiah Granet
Isaiah Granet@zaygranet·
Our office is next to a frontier labs secret office. Blacked out windows, 24/7 security guard outside, no markings the whole thing. We figured it out because they named their WiFi [Lab name]-[Team Name] lol
English
32
60
7.6K
425.6K
ULO
ULO@Vinibi90·
Translation: "In Iraq, the Houthis in Yemen, and Iran itself. In other words, they wanted to strangle us, and now we’re strangling them. They threatened us with annihilation, and now they’re fighting to survive. We’ve hit them hard. We still have more to do, and I’ll lay that out in a moment. But I want to start with Iran itself."
English
0
0
0
581
Jackson Hinkle 🇺🇸
Jackson Hinkle 🇺🇸@jacksonhinklle·
🚨🇮🇱🇮🇷 NETANYAHU: "The campaign against Iran is not over — we still have more to do" Demonic fucking lunatic.
English
867
7.1K
36.3K
1.1M
Josh Hunt
Josh Hunt@iAmJoshHunt·
I want to talk about the scale of what’s coming for the UK over the next three months. Because I don’t think many people have joined the dots yet. The Strait of Hormuz has been effectively closed for over five weeks. Before this war, 135 ships passed through it every day. Now it’s 5 to 7. Over 600 vessels are still stranded. Iran has mined the strait, is charging tolls, and controlling who passes. The CEO of Abu Dhabi’s national oil company said it this week: “The Strait of Hormuz is not open. Access is being restricted, conditioned and controlled. That is coercion.” Two thirds of Gulf crude has no alternative route. 14 million barrels a day behind a 21-mile chokepoint. Energy bills are forecast to jump 20% in July. From £1,641 to nearly £2,000. The second major energy shock in four years. Petrol up over 15%. Diesel up nearly 30%. Wholesale gas rose 75% in under four weeks. Food inflation could hit 8% by June and 9% by December. Academics advising DEFRA say it could reach 12%. UK food prices are already 38% higher than before Covid. We’re only 62% self-sufficient in food. We import 60% of our nitrogen fertiliser. Red diesel for farming has surged 60%. Average arable farm income has fallen to £17,000, the lowest in over 20 years. Yesterday, China announced it’s halting all sulphuric acid exports from May. Sulphuric acid is essential for phosphate fertilisers, copper mining, oil refining, and battery manufacturing. A third of the world’s sulphur was already blocked by the Hormuz closure. Now the world’s largest exporter has pulled the other lever at the same time. The fertiliser crisis just got significantly worse, heading straight into planting season. Before the war, markets expected rate cuts. Now they’ve priced in two rate rises. Over 1,500 mortgage products have been pulled. Two year fixes have jumped from 4.8% to 5.5%. Nearly £1,000 a year extra on a £200k mortgage. Gone in weeks. Flights are next. A quarter of UK jet fuel comes from Kuwait, behind the strait. In early April, major carriers said they had five to six weeks of reserves. That clock is running. Ryanair’s CEO has warned 5-10% of summer flights could be cancelled. Iran’s strike on Qatar’s Ras Laffan LNG complex, which handles 30% of the world’s helium, is estimated to take 3 to 5 years to repair. Helium is critical for semiconductors and MRI machines. That’s not a disruption. That’s structural damage. Chemical and steel manufacturers are imposing surcharges of up to 30%. Analysts are warning of permanent deindustrialisation. European gas storage was at just 30% after a harsh winter. If the strait stays restricted through summer, Europe can’t refill for next winter. In Ireland, fuel protests shut down Dublin for four days. The army was deployed. Over 100 fuel stations ran dry, with warnings of 500 by end of the week. Downing Street has held talks on the potential for mass protests here. The OECD has downgraded the UK more than any other G7 nation. Growth slashed from 1.2% to 0.7%. Inflation forecast nearly doubled to 4%, with some saying it could breach 5%. Starmer and Trump spoke this week about military options to reopen the strait. The UK is leading a 30+ nation coalition. But the ceasefire is already fracturing. Iran re-closed the strait over Israeli strikes on Lebanon. Reeves is boxed in by fiscal rules. Higher gilt yields are eating her headroom. And I haven’t heard a credible plan from anyone in Westminster. Energy. Food. Fertiliser. Aviation fuel. Mortgages. Industrial chemicals. Semiconductors. Shipping. Government borrowing. Political stability. All under stress. All compounding. This country imports 44% of its energy. Has almost no gas storage. Imports most of its food and fertiliser. Gets a quarter of its jet fuel from behind a mined strait. Every structural weakness built up over 20 years is being stress tested at once. The next three months aren’t going to be uncomfortable. They’re going to be defining
English
376
1.4K
5.5K
904.7K
International Cyber Digest
International Cyber Digest@IntCyberDigest·
🚨 BREAKING: The FBI has successfully extracted deleted Signal messages from a suspect's iPhone via notification storage, the place where all your notifications are stored for up to one month. Notification storage stores data from all messaging apps, it's a big flaw in iOS. But there's a way to turn it off...
International Cyber Digest tweet mediaInternational Cyber Digest tweet media
English
463
4.7K
24.9K
6M
ULO
ULO@Vinibi90·
@kimmonismus ChatGPT, write this little article for me and make a few typos and obvious mistake... hahahaha
English
0
0
0
44
Chubby♨️
Chubby♨️@kimmonismus·
Those typos happen when you write content by hand :D
English
6
3
45
7.1K
Chubby♨️
Chubby♨️@kimmonismus·
Meta's new model could pose a threat to only one company: OpenAI. OpenAI currently has 900 weekly users, 95% of whom are still in the free tier. It's arguably the best model for the average user, which is why most people use ChatGPT in their daily lives. With Spark, Meta has now developed a model that is being rolled out free of charge (!) to one billion users and is at least as useful for everyday use as ChatGPT. Let's be honest: 99% of people don't use LLMs for coding or frontier math, but for questions about tax returns, legal violations, brainstorming, or simply for chatting. ChatGPT and Spark are equally well-suited for this. Meta has the Moat distribution. If Meta succeeds in introducing Spark to its users and they realize that they now have a model within the Meta ecosystem that can address their concerns and needs just as effectively as ChatGPT, the consumer market could shift towards Meta. *That* could be dangerous for OpenAI. Because the business and enterprise sector is primarily located at Anthropic. OpenAI would also like more access to this market, but is currently still struggling for market share. OpenAI is deeply rooted in the consumer market. This is where Meta can become a real threat. That should set off alarm bells for OpenAI.
English
70
33
684
61.1K
ULO
ULO@Vinibi90·
@IntCyberDigest Maybe, but since I live in the UK, that might affect my Apple Store somehow... that's life, I guess...
English
10
0
34
10.4K