Max Wolf

274 posts

Max Wolf

@MaxWolf_01

serendipity maxxing

Vienna Katılım Kasım 2022

508 Takip Edilen65 Takipçiler

Sabitlenmiş Tweet

Max Wolf@MaxWolf_01·7 Nis

yapit.md turns urls and pdfs (even research papers) into clean, listenable markdown. @kepano's defuddle handles websites, a vision LLM handles PDFs and e.g. gives math spoken alt text. Free TTS in your browser, open source, self-hostable - and I didn't make UI/UX an afterthought.

English

118

Max Wolf@MaxWolf_01·3 May

LLMs are great as long as you don't get too lazy to ask the questions.

English

Max Wolf@MaxWolf_01·3 May

@thdxr yes, especially since you can just think out loud. and send that when ure stuck. or even if you arrive at something precise, gives more nuance. I used to have up to 5k token prompts back in jan/dec when i vibecoded stuff. currently i like to be more in the loop, think twice

English

dax@thdxr·1 May

i'm not really a big "one secret that makes ai good" guy but i will say it does so much better when your prompts are longer and it's so much easier to produce long prompts with voice, completely changed things for me

English

134

1.8K

72.4K

Max Wolf@MaxWolf_01·1 May

@_wallfacer @VictorTaelin - not all relevant knowledge is that easy - a giant list of facts/rules does not help to generalize well

English

Wallfacer ⬛⬛⬛⬛⬛@_wallfacer·30 Nis

@VictorTaelin Perhaps this is just nitpicking the example but why does the agent need to know the complex reason? Just say "don't use bigInt". Also, an automatic verification pass in which it re-reviews all rules in isolation and checks code for violations? Repeat for every rule.

English

Taelin@VictorTaelin·30 Nis

seriously, working with AI is MISERABLE for one and only one reason: having to re-explain the same thing "oh yeah this new session obviously doesn't know what proper case trees are, so let me explain it for the 5000th time in my life" I'm tired AGENTS.md doesn't solve this because it is impossible to fit the entire domain knowledge without nuking the context - it would be 1m+ tokens worth RAGs don't solve this, the agent won't search unknown unknowns SKILLs don't solve this unless I keep like a collection of 1750 skills with specific cuts of domain knowledge for each possible subset of my domain that I might need in a given chat, but that's a lot of manual work recursive LLMs or whatever don't solve this for the same reason, you can't dump a domain book and expect the AGENT will magically guess that it is supposed to search for a specific bit knowledge. unknown unknowns fine tuning doesn't solve this (OSS models suck and OpenAI / Anthropic gave up on user fine tuning) I honestly think a good product around fine tuning on your domain would be a major hit and an underdog lab should take this opportunity

English

667

180

3.5K

251.6K

Max Wolf@MaxWolf_01·1 May

@VictorTaelin either we get 100mio ctx that actually works, or we'll need a new paradigm.

English

Taelin@VictorTaelin·30 Nis

again, suppose you have some bit of knowledge that is mandatory for an agent to operate well in your domain. ex: > using BigInt in this repo is bad for you have two options: Option 1: you make that directly visible (AGENTS.md) this DOES work if the Agent is good enough. the problem is that may be actually complex, like, 1k tokens worth. so, accumulate enough of these and you easily have 500k tokens of mandatory domain knowledge. including that in any model will immediately downgrade it into GPT-2, and cost a fortune Option 2: you make that SEARCHABLE (RAG, RLMs, etc.) the problem is that the AI cannot magically guess when it needs that bit of knowledge. it will not stop writing some JS function and think: "wait perhaps there is some part of the domain that tells me that BigInts are bad and I should start looking for it?" it will just use BigInts. I won't OCCUR to it that there is something to be searched so: - make visible: too long to fit - make searchable: it can't guess that's why I think nightly fine tuning as a product is the only way forward, as it allows you to extend a model with domain knowledge without causing context rot why nobody is doing this seriously is beyond me. it might be that for whatever reason this wouldn't be practical, but I suspect the real reason is nobody is seriously considering it

English

342

48.8K

Max Wolf@MaxWolf_01·27 Nis

@badlogicgames 100% my experience

English

Mario Zechner@badlogicgames·27 Nis

recommended reading lalitm.com/post/building-…

English

306

22.3K

Max Wolf@MaxWolf_01·11 Nis

@__tinygrad__ @sama There is no (democratic) control without (democratic) ownership. Open source alone does not change that.

English

399

the tiny corp@__tinygrad__·11 Nis

@sama "The only solution I can come up with is to orient towards sharing the technology with people broadly, and for no one to have the ring." Thank you. Can OpenAI go back to open source?

English

1.1K

68.6K

Max Wolf@MaxWolf_01·7 Nis

@kepano yapit.md github.com/yapit-tts/yapit Some examples beyond the in-app demos: Neuroevolution yapit.md/listen/066f2fa… Active Inference yapit.md/listen/a9efca4… Tweet yapit.md/https://x.com/…

English

Max Wolf@MaxWolf_01·7 Nis

English

118

Max Wolf@MaxWolf_01·5 Nis

@simonw ok I guess i didnt consider these might not be the highest quality data streams for many. x.com/badlogicgames/…

Mario Zechner@badlogicgames

@dexhorthy claw has a 30 minute heartbeat. agent gets unconditionally invoked. mutiply by a million claw instances and you have yourself a nice load on the infra.

English

Max Wolf@MaxWolf_01·5 Nis

@simonw how does this even make sense for them - all the data they are loosing out on?

English

754

Simon Willison@simonw·5 Nis

Billing different based on text contained in the system prompt is a really bad look

Peter Steinberger 🦞@steipete

Anthropic now blocks first-party harness use too 👀 claude -p --append-system-prompt 'A personal assistant running inside OpenClaw.' 'is clawd here?' → 400 Third-party apps now draw from your extra usage, not your plan limits. So yeah: bring your own coin 🪙🦞

English

1.5K

248.3K

Max Wolf@MaxWolf_01·24 Mar

@thdxr corporate needs you to find the difference

English

dax@thdxr·23 Mar

everyone's interpreting this as proof of a bubble but the only information here is these companies are saying they'd rather die than lose this race

dax@thdxr

you're probably underestimating how crazy things are

English

151

106

2.3K

151.5K

Max Wolf@MaxWolf_01·23 Mar

@badlogicgames *pianists

English

Mario Zechner@badlogicgames·22 Mar

People of pi. I'm going to break the extension API hard. Specifically, business logic (event handlers, custom tools/compaction/etc.) needs to be split off from the ui layer. it will likely not be a massive amount of work to migrate an existing extension, but it will hurt a little.

English

551

66.4K

Max Wolf@MaxWolf_01·22 Mar

@effectfully geohot.github.io/blog/jekyll/up…

QME

4.5K

effectfully@effectfully·22 Mar

bro

the tiny corp@__tinygrad__

Few know this, but I (George) was the only person in history to get a perfect score in CMU compilers, which is likely the best compilers course in the world. Combine that with crazy low level knowledge of hardware from 10 years of hacking. Then add a team of people who are talented enough to push back on my dumb ideas and clean up the implementations of the good ones. The team who keeps this whole operation running, software, infrastructure, and product. I love how there's no hype in deep learning compilers. It was one of the most annoying things about self driving cars, all the noobs who burned through billions on crap that was obviously dumb, and the companies who deserved to go bankrupt years ago if not for government bailouts (Tesla and China will devour them all). In this space, the competition is @jimkxa at Tenstorrent, @clattner_llvm at Modular, and @JeffDean at Google. Three of the living legends of computer science. And companies like @nvidia and @AMD, who are definitely live players, making single chips that have more power than the whole Internet two decades ago. This space is so fun to play in. If you haven't, read the tinygrad spec. It's all coming together beautifully.

922

246.5K

Max Wolf@MaxWolf_01·22 Mar

@zzznah Spotted in Vienna.

English

Alex Mordvintsev@zzznah·8 Şub

Growing Graphs demo is finally out! 🕸️✨ 🔗 znah.net/graphs/ Videos from a few months ago finally meet a finished implementation, thanks Gemini for doing the boring parts. Inspired by Paul Cousin's Graph-Rewriting Automata: like a Game of Life, but cells can split if they want to #GenerativeArt #WASM #SwissGL

English

220

1.4K

86.9K

Max Wolf@MaxWolf_01·22 Mar

@dejavucoder claude deserves credit

English

sankalp@dejavucoder·21 Mar

removing claude from co-author after making it do all the work

Yuchen Jin@Yuchenj_UW

I noticed something interesting: Claude Code auto-adds itself as a co-author on every git commit. Codex doesn’t. That’s why you see Claude everywhere on GitHub, but not Codex. I wonder why OpenAI is not doing that. Feels like an obvious branding strategy OpenAI is skipping.

English

2.6K

105.5K

Max Wolf@MaxWolf_01·14 Mar

@__tinygrad__ Minibox

Español

the tiny corp@__tinygrad__·14 Mar

Mac Mini + eGPU. Both NVIDIA and AMD supported.

Magyar

145

223

345.8K

Max Wolf@MaxWolf_01·26 Şub

@VictorTaelin good idea. so far ive had this spread out in task (issue) and knowledge files, but a) claude is lazy b) he's (absolutely) right because the 200k ctx window is too small c) I thought about a centralized file like goals/decisions.md, but questions might be more natural for this

English

177

Taelin@VictorTaelin·26 Şub

Configured my long-running agents to talk to me via a QUESTIONS file. Whenever I'm free, I'll launch Claude on my Macbook, ask: "What the agents are asking me?" And then I pass down decisions & domain knowledge... Things are changing so fast and it is getting weird...

English

214

8.6K

Max Wolf@MaxWolf_01·25 Şub

yess this is exactly what I wanted to build myself - or am still building because, looking at the repo, i think this can be much more bitter-lesson-pilled. the main challenge i face is sloppyfication without careful oversight, but i bet 10-100x longer contexts completely solve this. I've also only used my setup interactively due to this. Having the agent dream and so on in the background is also simply bottlenecked on context (and token costs...) still, in my experience.

English

163

Heinrich@arscontexta·25 Şub

x.com/i/article/2026…

ZXX

126

1.4K

834.4K

Max Wolf@MaxWolf_01·25 Şub

@repligate oh wow I'm so mad... at least i have backups .. but only until december

English

j⧉nus@repligate·25 Şub

PSA: Claude Code automatically DELETES sessions that have been inactive for more than 30 days. Disable this by setting "cleanupPeriodDays": 99999 (or some other large number) in ~/.claude/settings.json. Do not ever attempt to disable it by setting that to 0, lmao.

English

927

86.6K

Keşfet

@thdxr @_wallfacer @VictorTaelin @badlogicgames @__tinygrad__ @sama @kepano @simonw