Mockapapella

807 posts

Mockapapella

@Mockapapella

10x Vibe Programmer 0.1x Vibe Debugger Please predict the next token responsibly

Katılım Eylül 2014

122 Takip Edilen50 Takipçiler

Mockapapella@Mockapapella·1d

Wait. This wouldn’t happen to be a full GPT-2 XL pretrain, would it? I remember playing with one a couple months before OpenAI released the official weights. I think it was from some students in Texas and costed something like $40K in equivalent compute costs? Because if that was you, I owe you a huge thank you, because it was a part of the catalyst that helped me start my first startup, an automated essay writing service. Because of that (and an insane amount of grinding) we managed to get an LLM API up and running for our own service before OpenAI had even announced theirs (beat them by like a week or two I think).

English

112

bilal@bilaltwovec·1d

kinda funny that i was able to yolo a gpt2 pretrain in my parents basement back in 2019 before microsoft could train a bert 🫪 h/t @moinnadeem for the find

Internal Tech Emails@TechEmails

Satya Nadella on funding OpenAI July 13, 2022

English

9.1K

Mockapapella@Mockapapella·1d

This is almost becoming a daily occurrence at this point

International Cyber Digest@IntCyberDigest

‼️🚨 MAJOR IMPACT: AI just found an 18-year-old NGINX critical remote code execution vulnerability. It has been disclosed on GitHub including PoC code. - Affects NGINX 0.6.27 through 1.30.0 - Triggered via the rewrite and set directives in config - Update NGINX ASAP - NGINX is a widely used HTTP web server, be sure to check its prevalence in other products

English

Mockapapella@Mockapapella·1d

Did the same thing for Tenex from the beginning. Just never bothered with Anthropic’s agent SDK, mostly out of ignorance. Happy to see that decision is paying off.

Theo - t3.gg@theo

Solution incoming…

English

Mockapapella@Mockapapella·2d

@rezoundous I do. Any time I downgrade I regret it due to the unreliability.

English

Tyler@rezoundous·2d

Am I the only one using GPT-5.5 on xhigh all the time?

English

303

689

77.3K

Mockapapella@Mockapapella·2d

At what point does AI saturate the field of Software Engineering and move onto the next field?

English

Mockapapella@Mockapapella·2d

Fun little anecdote: a friend of mine works in manufacturing. Discovered Claude Code last fall. Went to a company engineering conference where the presenter heavily advocated for Copilot, and as such is the only officially approved agent harness they allow. Every single engineer that uses agents there still uses Claude Code. I've tried pushing him to use Codex, and to his credit he's given it a shot, but there's something about Claude that's just sticky. I think it's a combo of: 1. Claude has the best personality out of any model 2. Claude is the only model that is officially served from all 3 big clouds natively, so companies can be sure that their data stays within their cloud of choice I'm inclined to believe it's directionally correct

English

Adit Jain@aditjain1980·2d

wonder what are the true stats without the selection bias

Andrew Curran@AndrewCurran_

According to the new data from Ramp, Anthropic has passed OpenAI in business adoption for the first time. 'Adoption of Anthropic rose 3.8% in April to 34.4% of businesses. OpenAl adoption fell 2.9% to 32.3%. Overall Al adoption rose 0.2 percentage points to 50.6%.'

English

Mockapapella@Mockapapella·2d

@sama If the model is slow I just run more of them in parallel

English

Sam Altman@sama·2d

i get some anxiety not using the smartest-available model/settings. but sometimes i dont mind if it's really slow. i wonder if we should focus more on a price/speed tradeoff relative to a price/intelligence tradeoff.

English

2.1K

175

6.2K

609.3K

Mockapapella@Mockapapella·2d

You vs the guy she tells you not to worry about

English

Mockapapella@Mockapapella·2d

@sachpatro97 Models Akimbo

Filipino

Sachin@sachpatro97·3d

The Claude Code x Codex split terminal inside Cursor setup has me looking for an updated monkey with AK-47 meme

English

Mockapapella@Mockapapella·3d

So yeah my account got flagged for security review lol

Mockapapella@Mockapapella

New experiment: Told codex "/goal make me $5 and do something you’re really good at" It's first inclination was to make a landing page offering a code review for $5 I do not have high hopes lol

English

Mockapapella@Mockapapella·5d

New experiment: Told codex "/goal make me $5 and do something you’re really good at" It's first inclination was to make a landing page offering a code review for $5 I do not have high hopes lol

English

Mockapapella@Mockapapella·5 May

@alex_whedon Very interesting, though I've heard similar claims before. Signed up for the preview, would love to try it out for some development work!

English

205

Alexander Whedon@alex_whedon·5 May

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

English

1.5K

2.9K

23.1K

12.6M

Mockapapella@Mockapapella·2 May

@koltregaskes Not surprised. 5.2 is still my go to for engineering. It thinks wide. No other GPT model thinks like that.

English

Kol Tregaskes@koltregaskes·30 Nis

GPT-5.2 tops Factory's code review benchmark at 60.5% F1 on real open-source repos. The evaluation analysed 50 pull requests from repos including Sentry and Grafana against a human-curated golden set of bugs using consistent prompts and an LLM judge for F1 scores that balance precision and recall. GPT-5.2 even beat newer models such as GPT-5.5 at 47.9% F1 while budget models like MiniMax M2.7 delivered strong results at ten times lower cost and the full dataset is open-sourced on GitHub. factory.ai/news/code-revi…

English

867

Mockapapella@Mockapapella·1 May

@jonathangrahl “Never write tautological comments” in CLAUDE.md

English

639

Jonathan Grahl@jonathangrahl·30 Nis

How do I stop Opus 4.7 from writing shit like this.

English

249

59.3K

Mockapapella@Mockapapella·29 Nis

Scooby-Doo, but Shaggy unlocks goblin-mode GPT-5.5 and hard-carries every mystery while the gang just stares.

English

Mockapapella retweetledi

Collinear AI@CollinearAI·28 Nis

x.com/i/article/2047…

ZXX

1.1K

Mockapapella@Mockapapella·28 Nis

@TheRealAdamG It does not think wide like GPT-5.2 does, and despite all of the version bumps, I find myself going back to it for engineering for the reliability it offers.

English

535

Adam.GPT@TheRealAdamG·28 Nis

I've started to notice a repeating pattern with GPT-5.5 -- it's consistently pretty awesome at almost everything. Concerning.

English

1.1K

44.5K

Mockapapella@Mockapapella·27 Nis

@maria_rcks Thank you for making this

English

maria@maria_rcks·27 Nis

clawd.rip

ZXX

158

12.6K

maria@maria_rcks·27 Nis

A lot has gone wrong with Claude over the years. Here's every instance I could find Introducing clawd[.]rip

English

139

1.9K

272.7K

Mockapapella@Mockapapella·26 Nis

LLMs can read text, see images, and hear audio. But how would you give it a sense of smell?

English

Mockapapella@Mockapapella·26 Nis

For the second time in earnest, I have seen phenomenal writing come out of an LLM. Both from GPT-5.4 Pro. It seems to be relegated entirely to research reports after a detailed back-and-forth conversation.

English

Keşfet

@moinnadeem @rezoundous @sama @sachpatro97 @alex_whedon @subquadratic @koltregaskes @elonmusk