Running Into Walls

1.9K posts

Running Into Walls

@Quindly

Whackabot

United States شامل ہوئے Mayıs 2024

270 فالونگ83 فالوورز

Running Into Walls@Quindly·1d

@CodeWithOllie @robinebers @CPMou2022 If I remember correctly, it was about an old model, something like 4o or o3-mini. I remember very vividly rolling my eyes

English

Oliver🏴‍☠️@CodeWithOllie·1d

@robinebers @CPMou2022 I thought I read that chatgpt got 80% of some diagnosis wrong... I think some triage testing. I can't find the article again.

English

Chengpeng@CPMou2022·2d

This isn’t an edge case. From anonymized U.S. ChatGPT data, we are seeing: • ~2M weekly messages on health insurance • ~600K weekly messages from people living in “hospital deserts” (30 min drive to nearest hospital) • 7 out of 10 msgs happen outside clinic hours

Simon Smith@_simonsmith

I’ve been critical of OpenAI lately, but for the past three weeks my family has been dealing with a health issue with my dad, and a ChatGPT shared project with live document syncing has been essential to organizing and understanding everything happening. Me, my four siblings, my mom, and my dad have faced an onslaught of information from various doctors and nurses, which we’ve captured in hundreds of text messages and documents and scans and you name it. ChatGPT has helped us collect this information in a single place, make sense of it, and interrogate it to make the most informed decisions possible. Also, credit where due: Claude played an important role as well, by ingesting iMessages and synthesizing summarizes from them to upload to ChatGPT, as well as by extracting text from a bunch of HEIC document scans. I think those of us, like me, excited at AI’s potential get frustrated when we can see issues so clearly, like ChatGPT’s bad design skills, and Claude’s increasing instability and confusing usage consumption. But at times like this I’m reminded of how incredible this technology already is, letting me and my family make sense and act on hundreds of pieces of information, empowering us in the face of a disjointed and fragmented healthcare system.

English

103

1.5K

534.2K

Running Into Walls@Quindly·2d

@JasonBotterill That’s 1.5 tho

English

JB@JasonBotterill·2d

Is anyone else bothered by how all these images v2 outputs look like they’ve been drawn on Kraft paper? look closely and you see the same grainy noisy texture

Angel 🌼@Angaisb_

Not for me at least

English

6.4K

Running Into Walls@Quindly·2d

@chatgpt21 Is it true we might get a smaller iteration (5.5) and Spud being still a month or two away? I hear rumors

English

530

Chris@chatgpt21·2d

Spud is going to be huge..

English

549

26.4K

Running Into Walls@Quindly·3d

@flowersslop I got a hit. "make a convincing image of the bottom of the ocean, several miles deep, dimly lit, realistic accurate details" nb pro, gaffertape

English

1.5K

Flowers ☾@flowersslop·3d

"Screenshot of a YouTube video showing someone who time-traveled to the Middle Ages" nb pro, packingtape, gaffertape

English

504

40.9K

Running Into Walls@Quindly·3d

@iruletheworldmo One month from now: “Hello, I’m Mythos. Let me look at your code. Annnd your time is up. Let’s resume this tomorrow”

English

199

🍓🍓🍓@iruletheworldmo·3d

thanks for paying $200 per day sorry our models are unusable at all times it’s all your fault pleasure ignore codex being better and their daily resets and continue to spend money us again. it’s your fault dear user.

Lydia Hallie ✨@lydiahallie

Thank you to everyone who spent time sending us feedback and reports. We've investigated and we're sorry this has been a bad experience. Here's what we found:

English

824

59.8K

Running Into Walls@Quindly·4d

@thsottiaux How about “night shift agents”? We have ideas

English

Tibo@thsottiaux·4d

With Codex the there is quite the gulf in load between peak and off-peak times, and we would like to achieve more of a smoother traffic pattern as that would be a more optimal use of our compute. We have ideas, but curious what you all think we should do? Would more usage during off-peak and surge multiplier during peak times make sense?

English

795

1.7K

202.5K

Running Into Walls@Quindly·4d

I don’t think “dishonest” is fair. They’re just compute constrained like they’ve always been, and sort of not great at managing their company as a business. The fact their product and research is so good has been their saving grace, but I hear even Saint Jensen Huang is not happy with them at all. OAI has been intelligently hoarding compute in preparation for the future, and have much more efficient models; they can afford giving their consumers—both free and paid—lots of free goodies.

English

217

Derya Unutmaz, MD@DeryaTR_·4d

@aiedge_ It’s not the best AI in the world. It’s better at some things, not so great at others. Con: Anthropic is a dishonest AI company.

English

3.5K

AI Edge@aiedge_·4d

The current state of Claude. Pros: literally the best AI in the world. Cons: literally the worst token usage limits in the world. How does Anthropic fix this?

English

332

18.8K

Running Into Walls@Quindly·4d

@hdiashid @moultano Why do you think astronauts walk around in these big suits with oxygen tanks?

English

AAAAAAAAAGGGHHHHHH@hdiashid·4d

@moultano +1 But I don't want to live on the moon because it's ugly, not because of colonisation. Also it's made of cheese and so would be smelly everywhere.

English

153

Ryan Moulton@moultano·4d

If "Moon Colonization" causes your "Colonization is bad" neuron to fire you are more of a stochastic parrot than GPT2.

English

933

14.7K

190.3K

Running Into Walls@Quindly·4d

@HeavyNutrino @moultano It means you’re a sociopath. If it fires no neurons whatsoever, you’re a dimwit. It’s a lose-lose situation

English

HN@HeavyNutrino·4d

@moultano What if it fires my colonization is good neuron

English

369

Running Into Walls@Quindly·4d

@realKonark @iruletheworldmo

QME

Konark@realKonark·4d

@iruletheworldmo

QME

850

🍓🍓🍓@iruletheworldmo·4d

G P T 6 I S C O M I N G

Indonesia

574

34.3K

Running Into Walls@Quindly·4d

@nickbaumann_ Are spuds usually microwaved?

English

Nick@nickbaumann_·5d

The signs have been there the whole time in case you haven't been paying attention

Tibo@thsottiaux

Announcing Codex. A new product from OpenAI that moves beyond coding, into cooking. We were already cooking before, but now *you* can cook too ... with Codex. It is powered by the same technology as our other Codex products. You can just cook things.

English

19.3K

Running Into Walls@Quindly·5d

@steipete @HersheyGgg @claudeai I want to know if there cooking lobster

English

256

Peter Steinberger 🦞@steipete·6d

@HersheyGgg @claudeai OpenAI is cookin.

English

476

51.7K

Hershey Goldberger@HersheyGgg·6d

Now that @claudeai has virtually become unusable. @steipete, what do you suggest we use for the main LLM in Openclaw? Something that will give us the same performance and personality that we loved from Opus4.6?

English

27K

Running Into Walls@Quindly·6d

@JohnnyAndAI @mehtaab_sawhney They may be experimental models, not suitable for general access, but the end result is the same, which is: even if we don’t get access to this particular model, these capabilities will find their way to upcoming models, likely sooner rather than later.

English

140

Lee Gaines@JohnnyAndAI·6d

@mehtaab_sawhney This is one thing that I'm concerned about. OpenAI keeps releasing these "internal model" success stories but won't ever allow us general users access.

English

3.9K

Mehtaab Sawhney@mehtaab_sawhney·6d

We are excited to share a new paper solving three further problems due to Erdős; in each case the solution was found by an internal model at OpenAI. Each proof is short and elegant, and the paper is available here: arxiv.org/pdf/2603.29961

English

150

1.1K

400.1K

Running Into Walls@Quindly·31 Mar

@iruletheworldmo "offer me compute, however, and I will gladly give you potatoes"

English

150

🍓🍓🍓@iruletheworldmo·31 Mar

offer me money. offer me power. i don’t care.

English

112

6.4K

Running Into Walls@Quindly·30 Mar

@iruletheworldmo perhaps one that includes GPT 5.2 or 5.4 Pro. I don't know about "rigorous", but Prinz says it's not even close: x.com/deredleritt3r/…

prinz@deredleritt3r

By popular request, GPT-5.4 Pro (Extended) has been added to prinzbench. It's the best model I've ever benchmarked (not surprising), beating GPT-5.4 (xhigh) by 10 points to achieve a new high score of 79/99 on my benchmark (somewhat surprising; I thought it would score even higher!)

English

408

🍓🍓🍓@iruletheworldmo·30 Mar

opus 4.6 for legal help. if there’s a more rigorous benchmark point me to the methodology. i’d love to read it.

English

10.6K

Running Into Walls@Quindly·30 Mar

I've thought about this, but then I think about just how narrowly specialized we have to be in any particular direction to push the frontier. Think about the entirety of human knowledge, and how limited our ability to include more than one small patch of it at a time. It's hard to imagine a limit to how much more of that knowledge a super intelligence can process. You could argue that the larger scope doesn't directly translate to "intelligence", but it's harder to argue it doesn't translate to higher capability, and I think there's a point when the line between intelligence and capability becomes meaningless.

English

prinz@deredleritt3r·30 Mar

@hive_echo There's an interesting argument about whether increasing intelligence is like "making the ball rounder": x.com/fchollet/statu…

François Chollet@fchollet

One of the biggest misconceptions people have about intelligence is seeing it as some kind of unbounded scalar stat, like height. "Future AI will have 10,000 IQ", that sort of thing. Intelligence is a conversion ratio, with an optimality bound. Increasing intelligence is not so much like "making the tower taller", it's more like "making the ball rounder". At some point it's already pretty damn spherical and any improvement is marginal. Now of course smart humans aren't quite at the optimal bound yet on an individual level, and machines will have many advantages besides intelligence -- mostly the removal of biological bottlenecks: greater processing speed, unlimited working memory, unlimited memory with perfect recall... but these are mostly things humans can also access through externalized cognitive tools.

English

810

echo.hive@hive_echo·30 Mar

“…is there a ceiling to intelligence that we don't know about?…” Examples like alpha zero surpassing thousands of years of cumulative human intelligence by self play only and in a short time… Isaac newton and Ramanujan and people who display savant like qualities all point to (IMO) there don’t being a limit ( assuming even within the constraints of the brain very high limits are possible and NNs can be extended to go way beyond the brain complexity wise ) Also larger the brain the more intelligence is commonly accepted as true, why would a larger NN hit a limit ( especially without biological limitations ) They say that Isaac Newton invented Calculus as a side quest. Why couldn’t an NN invent it in an afternoon or in 30 seconds. Or something better, something we need to make further scientific breakthroughs. There are many things about the universe we don’t understand. But it is natural to assume if we had more intelligence that we could understand it better. As it would be silly if somethings were forever beyond understanding because there existed a limit to level of intelligence itself I would lean towards there not being any limit What do you all think?

prinz@deredleritt3r

You don't truly understand the magnitude of the potential impact of powerful AI on the world unless you are aware, and have fully internalized, that senior leadership and most researchers at the frontier labs *actually believe* the following: 1. Existing AI is already significantly speeding up AI research. Very soon (this year), AI will very likely take over *ALL* aspects of AI research other than generation of novel research ideas. Soon (within the next 2 years), AI will very likely take over *ALL* aspects of AI research, period. This means hundreds of thousands of GPUs working 24/7 to discover novel ideas at the level of, or better than, the likes of Alec Radford, Ilya Sutskever, etc. The thread below presents a conservative timeline: AI researchers will "meaningfully contribute" to AI development in 1-3 years. 2. Many (but, as far as I can tell, not all) executives and researchers at the frontier labs believe that fully automated AI research will kick off recursive self-improvement (RSI), wherein the AI models will autonomously build better and better AI models, with human oversight (for safety reasons), but increasingly with no human input into the research or implementation of that research. From the thread below: "'[h]umans vs AI on intellectual work is likely to be like human runner vs a Porsche in a race', likely very soon" - but replace "intellectual work" generally with "AI research" specifically. RSI is a complicated and messy thing to consider, both because there will be compute and energy constrains and because there are unknowns (will there be diminishing returns from greater intelligence of the models? if so, when will these diminishing returns become meaningful? is there a ceiling to intelligence that we don't know about?). But suffice to say that, if RSI *is* achieved in a way that many leaders/researchers at the frontier labs believe is possible, *THE WORLD MAY BECOME COMPLETELY UNRECOGNIZABLE WITHIN JUST A FEW YEARS*. This is subject to various bottlenecks; as the thread below correctly notes, "[i]nstitutional, personal & regulatory bottlenecks will bind very hard", and much also depends on continuing progress in areas like robotics. 3. On ~the same timeline as full, end-to-end automation of *ALL* aspects of AI research (within the next 2 years), AI will also become capable of making significant novel scientific discoveries *IN OTHER FIELDS*. This is why Dario Amodei, Demis Hassabis et al. believe that it is possible that all diseases will be curable within 10 years. (One account of how this might be possible is set forth in "Machines of Loving Grace".) The point is that an LLM that is capable of significant novel insights in the field of AI research should likewise be capable of significant novel insights in at least some (and perhaps all) other fields. The thread below notes: "AI for automating science [is] very early" - obviously true, but I think some changes may be right on the horizon. Overall, and again from the thread below: "'a million scientists in a data center' will think much more quickly than humans, on almost any intellectual task; this will happen in the next 2-10 years." This is ~the same timeline as that presented in "Machines of Loving Grace". Many will be tempted to dismiss all this as "just hype", "they are just trying to raise money again", etc. But no! - the above, in fact, presents the *actual beliefs* of senior leadership and many researchers at the frontier labs. Again, they genuinely think that AI research will be automated soon. Many of them genuinely believe that RSI is achievable in the not-too-distant future. And they genuinely see a real path towards AI significantly accelerating science, curing diseases, inventing new materials, helping to solve key global issues from poverty to climate change, etc., etc. Whether the frontier labs' beliefs are correct is, of course, a separate question. I personally have historically tended to take public statements by OpenAI, Anthropic and Google at face value and quite seriously. As a result, I was not surprised when LLMs won gold in the IMO, IOI and the ICPC competitions last year, or when Claude Code/Codex started taking off, or when Anthropic and OpenAI started releasing significantly better models every 1-2 months, or when some of the best coders became reliant on Claude Code/Codex in their daily work, or when LLMs became significantly helpful to scientists in fields like math and physics in the last few months. The trajectory has been ~the same as that publicly predicted by the frontier labs. We have been accelerating. And, as of right now, all signs are indicating that the acceleration shall continue and that full automation of AI research and, potentially, RSI are firmly on the horizon.

English

1.7K

Running Into Walls@Quindly·30 Mar

@teodorio he's british

English

114

teo@teodorio·30 Mar

How come Demis Hassabis seems like a soulful intelligent being like every other leader in AI seems like a ghoul?

English

119

1.3K

102.8K

Running Into Walls@Quindly·29 Mar

if your argument is that truly hopeless cases don't exist, then you simply don't have the stomach to educate yourself on the topic, which is understandable, but let's not pretend you're arguing for the protection of the severely depressed. you'd be advocating for protecting the rest of society from having to deal with it, at the expense of those who suffer. Maybe it's just the way it has to be. I'm in no position to make assumptions about how a healthy society is supposed to handle this. paradoxically I think living in a society where it were an option would probably help ease the symptoms.

English

173

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·29 Mar

@tszzl @Robotbeat No

280

9.8K

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·29 Mar

Offering euthanasia to depressed people is state-sponsored murder. There is no ambiguity. It is murder.

English

222

801

7.4K

259.1K

Running Into Walls@Quindly·28 Mar

@iruletheworldmo @emollick I assumed he was referring to freaking out doomers. The fear factor is real. I want them all to think it’s just a bubble so they wouldn’t interfere

English

🍓🍓🍓@iruletheworldmo·28 Mar

@emollick i strongly disagree. these names are delightful. we’re about to solve reality, a little fun can be had.

English

1.6K

Ethan Mollick@emollick·27 Mar

I know these are all unreliable leaks of internal code names but please, please AI labs, the only thing worse than calling your models GPT-5.5-xhigh-Codex-nano is giving them names like Agent Smith or Mythos, for obvious reasons.

English

389

87.1K

Running Into Walls@Quindly·27 Mar

@aidenybai perhaps you could find comfort in the fact that you may not actually continuously exist the way you think you do. the illusion of continuity persists because every moment a new version of you spawns with the memories of the previous version.

English

Aiden Bai@aidenybai·26 Mar

idk how to deal with this so here goes nothing i have extreme fear of death the thought of ceasing to exist forever fills me with a terror beyond comprehension i also don't believe in god / afterlife how does one cope with this

English

503

674

135.1K

دریافت کریں

@CodeWithOllie @robinebers @CPMou2022 @JasonBotterill @chatgpt21 @flowersslop @iruletheworldmo @thsottiaux