Michal Pstrag

16 posts

Michal Pstrag

@micpsst

applied ai / engineering

Katılım Haziran 2025

35 Takip Edilen3 Takipçiler

Michal Pstrag@micpsst·25 Şub

@nikunj @trq212 @womeninaisf @AnthropicAI SF feminists must love it

English

Nikunj Kothari@nikunj·25 Şub

Excited to help the @womeninaisf club host a Claude Code event w/ the team at @AnthropicAI next week. We're looking for volunteers to DEMO their projects - if this is one of you, please sign up at the link in the comments. Link to register to attend - also in the next tweet!

English

177

22.8K

Michal Pstrag@micpsst·20 Şub

@HarryStebbings If everyone did that, the internet would be a much better place

English

Harry Stebbings@HarryStebbings·20 Şub

The craft of your product is the respect for your customers. We do not release 30% of shows we record. Every single show, 10 years in, with large media teams, I still listen to every single one pre-release. We remove 35-45% of shows to optimise word to value ratio. Details are not details, they are the product.

English

8.2K

Michal Pstrag@micpsst·19 Şub

@_djdumpling No qwen?

Filipino

Alex Wa@_djdumpling·18 Şub

new blog! What methodologies do labs use to train frontier models? The blog distills 7 open-weight model reports from frontier labs, covering architecture, stability, optimizers, data curation, pre/mid/post-training + RL, and behaviors/safety djdumpling.github.io/2026/01/31/fro…

English

285

284.6K

Michal Pstrag@micpsst·28 Oca

@AviSchiffmann Great case for modern psychiatry

English

230

Michal Pstrag@micpsst·28 Oca

@dianalokada What if she’s not tech-savvy?

English

diana@dianalokada·28 Oca

if your girl is texting you back fast and being extra nice, im sorry to tell you but it’s not your girl, it’s her Clawdbot focus on yourself king

English

164

13K

Michal Pstrag@micpsst·13 Oca

@pmddomingos They still grew revenue in 2025

English

1.2K

Pedro Domingos@pmddomingos·13 Oca

RIP Stack Overflow.

English

788

1.9K

19.8K

4.7M

Michal Pstrag@micpsst·12 Oca

@battleangelviv @Starlink As much as I'd like it to be true, it's not

English

Viv 🪩@battleangelviv·12 Oca

.@starlink has forever changed the world because it means that no tyrant or authoritarian government is able cut off their people from the world

English

319

Michal Pstrag@micpsst·7 Oca

@TicTocTick Gym isn’t always about burning calories

English

Emini tic@TicTocTick·7 Oca

A fool wastes an hour in the gym burning 400 calories. Wise man decides in 3 seconds not to eat a 500 calorie cheese cake 🍰

English

190

10.1K

Michal Pstrag@micpsst·29 Ara

@Teknium You can prefill theorems/definitions. It's basically what reasoning tries to do - chiming in the missing pieces of relevant context

English

Teknium 🪽@Teknium·29 Ara

@micpsst the reasoning cots are not really just about information retreival, so I'm not sure I agree. What are you doing with RAG that solves olympiad math problems aside from rag'ing the answers?

English

365

Teknium 🪽@Teknium·28 Ara

Reasoning in LLMs has actually broken at least one intuition about data that I thought I was confident in. Prior to reasoning models, there was a lot I could predict based on the data that went in, such as things like average output lengths and limits on how many tokens they'd generate. It used to be if you trained on outputs of 4k max, you'd have a near-0% chance to generate 10k+ tokens. But with reasoning models, they actually learn a function through this data that can make it generate way way way beyond what length of output tokens you trained on. I think this justifies calling it "reasoning", because it actually learned a function similar to reasoning, by generating tokens that look like thinking to improve accuracy until it is confident in it having found the correct answer, and even if you train on 10k cot tokens max, models will still think, potentially through the entire 128k+ context length it has. Something else interesting about "reasoning" is that we observed when scaling Hermes 4 from 14b, to 70b, to 405B, that the thinking lengths went down and down for the same set of problems as the model got bigger. This also implies that the reasoning process is very much tied to innate intelligence, because the problem is, relative to each model, a different difficulty, and it literally *thinks longer* if it is less intelligent! Just some fun facts for you on this Sunday :)

English

657

44.7K

Michal Pstrag@micpsst·29 Ara

True, but doable with GraphRAG-ish solutions. It also feels more stable than relying on reasoning at inference. Most of the time, when reasoning activates, the model just repeats or paraphrases what's already in the prefilled context, which is generally super low info. You can just duplicate the context and often get the same outcome. There's a recent Google paper on this, though they didn't benchmark against reasoning models arxiv.org/pdf/2512.14982

English

Teknium 🪽@Teknium·29 Ara

@micpsst Sure but that’s not really easy lol

English

446

Michal Pstrag@micpsst·27 Ara

They should do the same for trading apps

Reuters@Reuters

New York to require social media platforms to display mental health warnings reut.rs/49cy1FS reut.rs/49cy1FS

English

510

Michal Pstrag@micpsst·18 Ara

@ekzhang1 Well done, Eric!

English

198

Eric Zhang@ekzhang1·18 Ara

I'm open-sourcing jax-js — a machine learning library for the web, in pure JavaScript jax-js is the first ML compiler that runs in the browser, generating fast WebGPU kernels. Built from scratch over the past year as a personal side project Details: ekzhang.substack.com/p/jax-js-an-ml…

English

294

2.9K

196.7K

Michal Pstrag@micpsst·18 Ara

@dianalokada Said no girl ever

English

340

diana@dianalokada·18 Ara

girls are like ugh he is so weird and autistic i wish he was my boyfriend

English

585

26.5K

Michal Pstrag@micpsst·8 Ara

@pmddomingos You roasting NeurIPS papers on YT would be a banger

English

1.3K

Pedro Domingos@pmddomingos·8 Ara

Report from NeurIPS: the state of the art in LLMs is doing fake reasoning with fake RL.

English

251

32.7K

Michal Pstrag@micpsst·26 Eyl

@poetengineer__ Subtle incentive to keep the context clean

English

Kat ⊷ the Poet Engineer@poetengineer__·26 Eyl

not sure if it's just me, but chatgpt used to update the conversation's title as its content evolves, and now the title's just fixed on whatever you ask first.

English

3.2K

Keşfet

@nikunj @trq212 @womeninaisf @AnthropicAI @HarryStebbings @_djdumpling @AviSchiffmann @dianalokada