infrecursion

4K posts

infrecursion

@infrecursion1

Katılım Nisan 2020

124 Takip Edilen40 Takipçiler

infrecursion@infrecursion1·4h

@AlexGodofsky More hilarious antics from the horse carriage industry lmao.

English

Alex Godofsky@AlexGodofsky·13h

This is not workable. It is simply not possible on any level to detect students who are using LLMs for "conceptualizing" assignments, especially if they do so with any skill. The further restriction on uploading course materials to an LLM is silly.

Chris Hoofnagle@hoofnagle

Berkeley law has introduced a new, much stricter AI policy law.berkeley.edu/wp-content/upl…

English

3.2K

infrecursion@infrecursion1·12h

@Jesse_Livermore Listen man, hate to break it to you but GPT-5.5 high is smarter than your entire bloodline has ever been or will be - at least for math and science.

English

431

Jesse Livermore@Jesse_Livermore·1d

I'm almost at the point where I'm going to stop asking AI questions and instead ask it to find sources that might answer them. Because I know what it's doing: pushing the linguistic buttons in my brain that lead me to believe it and stick with it, even as it gives me garbage.

English

417

172.4K

infrecursion@infrecursion1·16h

@burny_tech Absolutely retarded cope and coming from people who are completely incapable of producing either kind of proofs. And no, there isn't a single example of a proof or new concept that was plucked out of thin air.

English

Burny - Effective Curiosity@burny_tech·22h

"Every one of those ingredients was already in the literature. The Golod-Shafarevich towers, the cutting mechanism of Hajir-Maire-Ramakrishna, the pigeonhole, the Minkowski packing bound: all of it sitting there, in papers from the last sixty years. The construction recombines them, ingeniously, into something nobody had assembled before. What it does not do is reach for a concept that was not already available to reach for. The distinction I care about is not whether a new named object appears, which would be a silly test that most great mathematics would fail, but whether the shortest explanation requires extending the working vocabulary or only requires a new path through the vocabulary already present. This proof is the second kind. The hypothesis space was not grown. It was traversed, much further than any human had the stamina to traverse it. Sawin and Shankar both call the proof genuinely ingenious and very human in character, and Shankar says plainly that the model went beyond being a helper: it had an original idea and carried it through. He’s right, and I’d rather grant that fully than explain it away. Recombining known structures in a way nobody had thought to try is real creativity. It’s most of what mathematics actually is. Fermat’s Last Theorem, Szemerédi, much of additive combinatorics: landmark work that introduced no new primitive object and was no less creative for it. So the claim can’t be that recombination is the lesser thing. The claim is narrower. There’s a kind of advance that recombination, however deep, just doesn’t reach: the one where the existing vocabulary can’t express the answer, and has to be enlarged before you can even state it. The metric tensor wasn’t a clever arrangement of Newtonian quantities. It was a new thing the old language had no word for. That’s the move I mean, at whatever scale it happens, and the unit distance proof, ingenious as it is, isn’t an instance of it. It’s the top of the other ladder. And the people who verified it describe it in terms that fit that reading, even though confirming my framework was the last thing on their minds."

Vishal Misra@vishalmisra

@ShriramKMurthi @Hesamation Fair to question - so I read the actual proof and the nine verifiers’ notes. It recombines existing number theory brilliantly; it doesn’t reach for a concept that wasn’t already there. Gowers (who checked it) names the same boundary. Wrote it up: @vishalmisra/the-hardest-test-so-far-72358d1b232c" target="_blank" rel="nofollow noopener">medium.com/@vishalmisra/t…

English

1.3K

infrecursion@infrecursion1·23h

@AnthropicAI Shut up man and release opus 5. No one gives a shit about this marketing/pr bs stunt.

English

Anthropic@AnthropicAI·23h

Patching these vulnerabilities will make us safer. But the software industry will need to adapt to the volume of vulnerabilities that models like Claude Mythos Preview will be able to find. We discuss this in our initial update on Project Glasswing: anthropic.com/research/glass…

English

1.3K

282.4K

Anthropic@AnthropicAI·23h

Last month we launched Project Glasswing, our collaborative AI cybersecurity initiative. Since then, we and our partners have found more than ten thousand high- or critical-severity vulnerabilities in essential software.

English

391

520

7.2K

1.9M

infrecursion@infrecursion1·1d

@ptrschmdtnlsn I love that the original statement by Noah that the guy dunked on to which Scott replied to has about the same authenticity and rigor as this long ass post, but far less pretentious.

English

206

Peter Schmidt-Nielsen@ptrschmdtnlsn·1d

There are several results that morally point in the direction of "you should expect endless rich beauty", but my favorite is this: You might naively worry that at some point we prove everything that's easy to describe, and we have to make up extremely complicated questions. And at that point, is it really that interesting to say that some 50 page contrived problem required some complicated theory to prove? However, we know this won't happen! If you take all questions of n characters, take the shortest proof of each, then look at the growth rate of the length of the longest such shortest proof, you find that the length must grow uncomputably quickly in n. Otherwise theorem proving would be computable by proof search, which would let you decide provability, thus a fortiori decide the halting problem. So, we must have a vast supply of very simple questions whose answers are spectacularly complicated. In practice, this is what we see, and the complexity seems to correspond to real richness! For example, as near as we know, the answer to "when does x^n+y^n=z^n have solutions over Z?" is just massively incompressibly deep, requiring the development of extremely sophisticated tools. Likewise, "what can be said of a group that's finite and simple?" seems to just be a massively deep question. That simple questions can require thousands of pages of deep theory should be unsurprising in light of this "proof lengths must grow uncomputably quickly" result! And "grows uncomputably quickly" is an absolutely staggering growth rate. There is likely some short couple-paragraph question where resolving it would require you to develop one million pages of rich theory, beyond the intellect of any human.

Scott Alexander@slatestarcodex

@souljagoyteller Can you explain his mistake in more detail? Is it that we can never run out of interesting math to do? Do we know this for sure (a theorem?) or as a common-sensical extension of the idea that we can study whatever mathematical structures we want?

English

194

17.9K

infrecursion@infrecursion1·2d

@Lux_Stella_ This is complete bullshit. Can you give one example of those supposed magical proofs that were generated ever in human history, which came out of thin air, purely through the genius of the particular individual? These people are clueless. All proofs are like this "specific kind".

English

660

Lux_Stella@Lux_Stella_·2d

there's some really interesting commentary from one of the mathematicians in the companion paper not to pour cold water on the results necessarily - but it suggests this is a specific "kind" of proof these models might be particularly good at

OpenAI@OpenAI

Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.

English

986

91.9K

infrecursion@infrecursion1·4d

@OfficialLoganK What is the price on API? Same as 3 flash?

English

380

Logan Kilpatrick@OfficialLoganK·4d

Try it in the Gemini API, Google AI Studio, Antigravity, AI Mode, Gemini App, and wherever else you use Gemini!

English

328

41.2K

Logan Kilpatrick@OfficialLoganK·4d

Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own. We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now!

English

466

748

7.4K

651.9K

infrecursion@infrecursion1·4d

@GoogleDeepMind @Google Where is the link? Also no benchmarks or pricing info, doesn't look very promising lol

English

2.8K

Google DeepMind@GoogleDeepMind·4d

Introducing Gemini 3.5: our newest family of models combining frontier intelligence with real-world action. The first release is 3.5 Flash, our strongest model yet for agents and coding 🧵

English

121

397

3.8K

778.4K

infrecursion@infrecursion1·4d

@RolandDunbrack It's true but that has been so limited and has required so much compute to even predict something useful for real proteins for mere nanoseconds that compared to AI techniques now they may not even have existed. When AI works here it's just so much superior.

English

203

Roland Dunbrack 🏳️‍🌈 @rolanddunbrack.bsky.social@RolandDunbrack·5d

MD simulations of proteins have been around for almost 50 years (1977). What are you talking about?

Yang Fan 范阳@Yang_Supertramp

For the past 70 years, modern biology has been built on a static objects worldview: DNA = information, proteins = structures, cells = nanomachines, disease = broken parts, drugs = part repair/update. I'm honestly excited for a “new worldview” and “Dynamic Biology” to emerge.

English

243

39.6K

infrecursion@infrecursion1·5d

@Burkino_1 @lthlnkso It's not telling anything other than you're an ignorant fuck. Any of these AIs can mop the floor with your ass on any math or coding tasks. They are good at somethings, not good at others (unlike you with absolutely no talent at anything whatsoever).

English

350

Burkino 🥇‏@Burkino_1·5d

@lthlnkso I think it's pretty telling of AI that you can't just ask it "provide a deterministic answer to this question" but instead have to go "create a tool that has a deterministic answer"

English

11.6K

Quick Thoughts@lthlnkso·5d

1 prompt to Claude Code by the way.

Broskі@broskiFGC

its cool how half the global economy is contingent on this shit

English

921

238.8K

infrecursion@infrecursion1·6d

@_sholtodouglas Dude, does this post mean no new model this week?

English

Sholto Douglas@_sholtodouglas·6d

When do you reach for other models instead of Claude? What can we do better? Hit me with all of your frustrations. dms open. If you can give me detail (e.g. specifics/transcipts) - it'll help a lot in finding out exactly what we need to do to improve the next model

English

1.2K

1.4K

388.6K

infrecursion@infrecursion1·6d

@zx_sci_john Cute you think GPT-5.5 is only used for editing mistake. It can wipe the floor with your ass in any math exam, go fuck yourself.

English

313

Science Dogtor@zx_sci_john·6d

Scientific fraud isn't an editing mistake

English

18.5K

infrecursion@infrecursion1·6d

@RuxandraTeslo @RichardHanania This assumes that somehow AI will always be inferior to humans. Typical arrogance. Like any of these people have any chance of holding a candle to a super-intelligent AI. Even now the frontier models are like better than 99% of the mass.

English

105

Ruxandra Teslo 🧬@RuxandraTeslo·6d

Cool post from @RichardHanania where he shows the results of a "Turing test" on his writing. I personally found it very easy to identify what was AI written, but around half of respondents did not. This has really interesting implications regarding how public opinion will be shaped in the AI era, and who our "storytellers" will be, as Richard himself notes. It seems that such results point towards a trend predicted by @danwilliamsphil and others, where AI will lead to clustering around "expert consensus" and a de-democratization of opinions -- the opposite of what we saw with the internet. The interesting thing is that AI is arriving on a vacuum of intellectual authority, as trust in academia has been eroded. Obviously, academic publications still form a large part of its training corpus, but I think academic opinion will be less relevant relative to its volume at least. Especially in non technical fields. So who will be the shapers of public opinion in the AI era, then? We already saw that those who have written a lot on the internet, for example rationalists and effective altruists, seem to have their opinions overrepresented. This might not just about volume, but also the style of their argumentation (very indexed on logical coherence), which might be favored by LLMs. But how will this look like in the future? Important question imo. richardhanania.com/p/can-ai-repla…

English

12.6K

infrecursion@infrecursion1·6d

@TinaGurnaney You're high. It's the exact opposite. Maybe you're using free GPT and not GPT-5.5 high.

English

Tina Gurnaney@TinaGurnaney·6d

I can't explain this but Claude gives STEM graduate and ChatGPT gives degree in Arts & Philosophy

English

1.8K

infrecursion@infrecursion1·6d

@scottnarmstrong Amen, apply that to every subject and do it post hoc as well. Either of those models can find both human and AI slop. Ban all (it will probably make arxiv a read-only repo for some time, but worth it).

English

584

Scott Armstrong@scottnarmstrong·6d

A modest proposal: authors who submit math papers to arxiv containing obvious human-written errors that Opus 4.6 or GPT 5.5 would flag should receive a 1-year arxiv ban. Why should I read your paper if you haven't even taken basic steps to ensure its accuracy?

English

420

100.6K

infrecursion@infrecursion1·16 May

@Willrandship @max_paperclips Lmao you're just pulling numbers out of your ass.

English

William Shipley@Willrandship·15 May

@max_paperclips if it requires you to learn screen/tmux syntax it's inaccessible to 95% of developers and 99.9% of the population. what you really need is a vnc session

English

2.5K

Shannon Sands@max_paperclips·15 May

oh wow, and then you could have something we're calling a "secure" "socket" "shell" into a "virtual private server" Could you all stop reinventing stuff that's existed for decades from first principles and pretending you're innovating

Ishaan Sehgal@ishaansehgal

Your agents die when you close your laptop. We fixed that. Omnara Cloud Sandboxing is live. Close your laptop, the session keeps running in the cloud. Open it back up, you're right where you left off. Close the lid. Keep building.

English

3.7K

infrecursion@infrecursion1·16 May

@lefthanddraft Yeah they shipped this model pre-nerfed. But occasionally when they decide to turn the thinking on, it's glorious.

English

177

Wyatt Walls@lefthanddraft·16 May

Incredible alpha in distrusting what Opus 4.7 says.

English

14.8K

infrecursion@infrecursion1·16 May

@0xglitchbyte The only people I see complaining are those who haven't shipped anything worth anything in their life and never will.

English

2.3K

Glitchbyte@0xglitchbyte·16 May

When are we going to stop pretending oneshotting bun’s rewrite was a good idea?

English

1.4K

144.3K

infrecursion@infrecursion1·16 May

@LimYenKheng Yes you're supposed to do it but not for a fucking preprint you moron.

English

4.6K

Lim Yen Kheng@LimYenKheng·16 May

what the hell are you talking about. YES YOU'RE SUPPOSED TO DO THIS

Ted Pavlic (he/him/his)@TedPavlic

@WKCosmo Co-authors may read an entire paper and still not take the time to check that every reference exists and is relevant to how it's being used. Doing so will usually mean *each/every co-author* will have to go to Google Scholar, do a search for each citation, and follow the result.

English

323

17.1K

infrecursion@infrecursion1·16 May

@Kaju_Nut Why do you people always have the most short-sighted takes? Do you believe AI progress will just stop for some reason, and we won't have AI that soon understands science better than 99.9% of all humans (including you) can ever hope to? What would be the point of your postdoc?

English

210

Nirmalya Kajuri@Kaju_Nut·15 May

Today I was discussing with a colleague about the challenge of postdoc selection in the coming years, now that some PhD students (and maybe even PIs) are publishing heavily ai-assisted papers whose content they don't fully understand. Looks like interviews are the only way, and they might have to get quite technical.

English

12.7K

Keşfet

@AlexGodofsky @Jesse_Livermore @burny_tech @AnthropicAI @ptrschmdtnlsn @Lux_Stella_ @OfficialLoganK @GoogleDeepMind