infrecursion

3.8K posts

infrecursion

@infrecursion1

가입일 Nisan 2020

126 팔로잉39 팔로워

infrecursion@infrecursion1·12h

@inababi @HedgeyeComm Lmao claude sheep have the best cope.

English

Salina Mendoza@inababi·17h

@HedgeyeComm If they fired Sam Altman and took him off the board, they would actually have a chance. No bullshit.

English

556

Andrew Freedman, CFA 🦅@HedgeyeComm·19h

Hard pivot to become Anthropic

First Squawk@FirstSquawk

OPENAI TO MERGE CHATGPT, CODEX APP & BROWSER INTO DESKTOP SUPERAPP TO STREAMLINE RESOURCES & USER EXPERIENCE - WSJ

English

786

69.8K

infrecursion@infrecursion1·12h

@HedgeyeComm Is this somekind of a joke or just average hallucination of a claude sheep? Anthropic does not even have a model that does audio or image. It's pretty dogshit experience almost anywhere.

English

infrecursion@infrecursion1·12h

@SynBio1 Yes, but not because of what you think is the cause. It will be because humans like you become completely irrelevant.

English

Jake Wintermute 🧬/acc@SynBio1·1d

This is the core problem of the next 5 years in the AI scientist era: infinite hypotheses about which nobody cares

Sayan@thesayannayak

At this rate everyone’s gonna have their own app and zero users.

English

660

21.1K

infrecursion@infrecursion1·18h

@hypertectonic @btibor91 It could be a completely different model.

English

tecto@hypertectonic·18h

@btibor91 Everyone treating this like a feature announcement when it's actually OpenAI admitting Sora as a standalone product didn't work.

English

991

Tibor Blaho@btibor91·19h

ChatGPT/1.2026.076 (Android) adds an announcement that "Video in ChatGPT is here" - "Transform text and image into video with dialogue, soundtrack, and style."

English

279

24.2K

infrecursion@infrecursion1·1d

@nayshins Start by deleting your own codebase, every line. That will get rid of so much slop.

English

187

Jake@nayshins·1d

Has anyone documented all the code slop patterns yet? I want to lint for them and banish them to hades.

English

201

22K

infrecursion@infrecursion1·1d

@petergostev The post below is a valid criticism and why I can't take this benchmark seriously. It's just choosing my preferred model with extra steps. x.com/FakePsyho/stat…

Psyho@FakePsyho

I have eyeballed some of the outputs and imho the results are not as clear-cut as being presented (and that's an understatement). For a lot of outputs, it's very debatable what's the proper way of categorizing them. It seems to me that each model has it's own default style when answering a bullshit question and this style is surprisingly consistent between different questions. What you're mainly measuring is the preference that each judge has for each particular writing style. My other complain is that there's no diversity in questions: you're just embedding a random unrelated or made up terminology in a somewhat valid short question.

English

Peter Gostev@petergostev·2d

BullshitBench update: The new GPT-5.4 mini and nano models score quite low. This screenshot shows OpenAI models only, on the full list would put GPT-5.4-mini around 40th place and Nano is around 70th place. Again thinking didn't help much at all.

Peter Gostev@petergostev

BullshitBench v2 is out! It is one of the few benchmarks where models are generally not getting better (except Claude) and where reasoning isn't helping. What's new: 100 new questions, by domain (coding (40 Q's), medical (15), legal (15), finance (15), physics(15)), 70+ model variants tested. BullshitBench is already at 380 starts on GitHub - all questions, scripts, responses and judgements are there so check it out. TL;DR: - Results replicated - @AnthropicAI latest models are scoring exceptionally well - @Alibaba_Qwen is another very strong performer - OpenAI and Google models are not doing well and are not improving - Domains do not show much difference - rates of BS detection are about the same across all domains - Reasoning, if anything, has negative effect - Newer models don't do that much better than older ones (except Anthropic) Links: - Data explorer: petergpt.github.io/bullshit-bench… - GitHub: github.com/petergpt/bulls… Highly recommend the data explorer where you can study the data and the questions & sample answers.

English

6.8K

infrecursion@infrecursion1·2d

@gabrielchua Please put GPT-5.4 mini on ChatGPT. I want to use a reasoning model for many small to medium capability tasks like web search that doesn't require full GPT-5.4. I don't want to waste my limits using the full model. 5.4 mini should be available for selection, not as a fallback.

English

189

Gabriel Chua@gabrielchua·2d

Now with `gpt-5.4-mini` and `nano` out, I put together a simple cheat sheet of the latest OpenAI models by use case. Noticed at a few recent hackathons & meetups: some folks still default to `gpt-4o-mini` for LLMs and `whisper-1` for transcription. Newer options tend to fit better now with much better performance. If you’re running into issues switching, lmk!

English

285

29.1K

infrecursion@infrecursion1·2d

@ryanwinchester Yes, there are still people offering rides and riding on horse carriages. What's your point?

English

127

Ryan Winchester@ryanwinchester·2d

"wrote" ? a lot are still doing it

Sam Altman@sama

I have so much gratitude to people who wrote extremely complex software character-by-character. It already feels difficult to remember how much effort it really took. Thank you for getting us to this point.

English

141

8.3K

infrecursion@infrecursion1·2d

@krzyzanowskim No, sorry the worst codebase to work with are those by you, no exceptions. You're a shit coder.

English

Marcin Krzyzanowski@krzyzanowskim·3d

the worst. literally the worst. no exceptions. the worst codebase to work with coding agents is the one generated by the agents itself. usually, the best codebase to work with agents is the one that predates coding agents.

English

288

16.4K

infrecursion@infrecursion1·2d

@MyNameIsOnni @LeahLundqvist It's likely satire. They're quoting from the marketing post. Dall-E 2 is better.

English

1.9K

Onni@MyNameIsOnni·2d

@LeahLundqvist How is that in any way an improvement?

English

357

18.5K

leah lundqvist@LeahLundqvist·3d

"Even with the same prompt, DALL-E 3 significantly improves upon DALL-E 2"

English

435

1.2M

infrecursion@infrecursion1·2d

@jxnlco Why isn't mini offered as an option in ChatGPT? I want to be able to use a mini reasoning model for medium complexity tasks and search without having to spend 5.4 limits.

English

jason liu@jxnlco·3d

Super exciting for subagents

OpenAI@OpenAI

GPT-5.4 mini is available today in ChatGPT, Codex, and the API. Optimized for coding, computer use, multimodal understanding, and subagents. And it’s 2x faster than GPT-5 mini. openai.com/index/introduc…

English

5.4K

infrecursion@infrecursion1·2d

@alekzan @dejavucoder Opus is not even close to GPT pro. GPT pro is a different level altogether.

English

Alejandro Capellán | IA y Negocios Digitales@alekzan·2d

@dejavucoder I don’t think so. Opus could be GPT pro and sonnet vanilla GPT. Mini and specially nano have never been good models. They’re always behind of even identical open weight models.

English

274

sankalp@dejavucoder·3d

is what opus 4.6 to sonnet 4.6 the same as gpt 5.4 to gpt 5.4-mini? we will find out. so far this hasnt beem true

English

5.6K

infrecursion@infrecursion1·2d

@nickcammarata Fr, that's so annoying.

English

Nick@nickcammarata·2d

alternatively openai make a slightly better model that just answers the research question I asked rather than writing several moderately helpful long lists

English

3.4K

Nick@nickcammarata·2d

anthropic please fix your awful ios transcription my workflow rn while walking is is talking to chatgpt for its whisper model and copying and pasting to claude and i want to be freed from this

English

421

25.3K

infrecursion@infrecursion1·3d

@LokiJulianus "Yeah imma take everything happening out there and try to fit it inside my worldview, because as everyone knows I am the center of the universe" - You.

English

542

Just Loki@LokiJulianus·3d

Yeah, this does not sound like imminent recursive self-improvement to me.

English

1.4K

347.8K

infrecursion@infrecursion1·3d

@lefthanddraft It seems humans like you will get replaced before you keep repeating the same dumb questions with no undestanding of how these things work.

English

Wyatt Walls@lefthanddraft·4d

This thing is going to find a cure for cancer before it stops falling for dumb tricks.

English

159

102

8.4K

254.5K

infrecursion@infrecursion1·3d

@mweinbach This is completely irrelevant post AGI/RSI.

English

Max Weinbach@mweinbach·3d

Consumer AI will be left to Google and Apple Funny enough, both using core Google model technology and (likely for now) infra I wonder if this changes the OpenAI hardware strategy

The Wall Street Journal@WSJ

Exclusive: OpenAI’s top executives are finalizing plans for a major strategy shift to refocus the company around coding and business users on.wsj.com/3N6CFyr

English

909

113.1K

infrecursion@infrecursion1·4d

@OfficialLoganK That was how many Claude credits Logan?

English

Logan Kilpatrick@OfficialLoganK·4d

Spent all day vibe coding, fixing bugs in AI Studio, and polishing the experience :) So much fun!

English

185

1.4K

86.6K

infrecursion@infrecursion1·4d

@Sauers_ Dude what tf do you have in your system prompts that you get these weird results all the time?

English

1.1K

Sauers@Sauers_·4d

My Codex has been grinding lean proofs for 12h straight now; I checked in and apparently it created and then deleted 1k+ lines of "Simulation Theory"

English

233

18.9K

infrecursion@infrecursion1·4d

@ToFollowBrights @Zai_org Lmao how insane is your entitlement? This lab has single handedly released one after another SOTA quality open source models and the moment they try to raise some money (from their products not vc), they become OpenAI wannabe. Go fuck yourself.

English

ᅟTFB@ToFollowBrights·4d

@Zai_org not renewing my annual subscription then... only supporting organizations that share, not more Open AI wannabes

English

2.8K

Z.ai@Zai_org·5d

Introducing GLM-5-Turbo: A high-speed variant of GLM-5, excellent in agent-driven environments such as OpenClaw. Coding Plan Max: z.ai/subscribe OpenRouter: openrouter.ai/z-ai/glm-5-tur… API: docs.z.ai/guides/llm/glm…

English

178

292

2.6K

infrecursion@infrecursion1·4d

@nlarusstone Lmao, so you're basically so far up in your arse that you think the dog getting better was some kind of scam someone pulled off to confirm your worldview?

English

Nicholas Larus-Stone@nlarusstone·4d

The funny thing is AI already is accelerating drug discovery AND consumer biotech is going to be a big thing. This just ain’t it

English

1.9K

탐색

@inababi @HedgeyeComm @SynBio1 @hypertectonic @btibor91 @nayshins @petergostev @gabrielchua