Todd Fisher

4.8K posts

Todd Fisher

@taf2

Living and working on https://t.co/eminYBe5tr

Maryland, US Beigetreten Ağustos 2008

463 Folgt329 Follower

Angehefteter Tweet

@·28 Şub

Obviously a tech company CEO would share this youtube.com/shorts/hgRFXhQ…

YouTube

English

2.4K

@·18h

Cool

English

@·19h

@jxnlco You need screen or for the kids tmux

English

@·1d

When you gotta bike home from work but your codex needs to finish a task.

English

203

1.8K

243.1K

@·20h

@GregKamradt So I ran a llm fine tune over the weekend and it took many hours - codex monitors the long running process and can even send text message updates via CTM or any sms capable service - it’s really good at babysitting a long running process

English

@·1d

When OAI employees say, “I let codex run all night…” What framework they use? How do you set up the task so it has enough work for 8 hours?

English

296

1.8K

323.2K

@·20h

@0xSero Time to build a new thing

English

102

@·22h

GitHub is a liability at this point, any open source alternative you know of?

We achieved Remote Code Execution on GitHub - and got access to millions of repositories belonging to other users and organizations 🤯 All it took was a single `git push` Here's how we did it (CVE-2026-3854) 🧵⬇️

English

265

37.8K

@·20h

Best usecase while outside waking for local ai on my phone - a personal pocket heater.

English

@·20h

That’s a Microsoft product- should anyone be surprised?

🚨 BREAKING: Wiz Research discovered Remote Code Execution on GitHub.com with a single git push The flaw in @github allowed unauthorized access to millions of repositories belonging to other users and organizations 🤯

English

@·1d

@softwareweaver @vllm_project Use codex to create a node.js proxy to fix the errors

English

@·1d

@vllm_project Cool. Does this version work well with Codex? Codex cli was giving me errors connecting when using vllm to host Qwen 3.6 27B model with the responses api compatibility issues.

English

568

@·1d

vLLM v0.20.0 is here! 752 commits from 320 contributors (123 new). 🎉 Highlights: DeepSeek V4, Hunyuan v3 preview support, CUDA 13 / PyTorch 2.11 / Transformers v5 baseline, FA4 as default MLA prefill, TurboQuant 2-bit KV (4× capacity), vLLM IR foundation. Thread 👇

English

666

65.7K

retweetet

@·2d

As requested, I also asked GPT-5.5 (low) and Opus 4.7 (high) to fix the same bug. GPT-5.5 (low) identified the correct root cause in 4m 14s and produced an almost identical fix in 2m 47s, using 164k tokens in total. Opus 4.7 (high) “churned” for 6m 23s, using 87.7k tokens, and went down a completely wrong path. It’s not even close, but that’s not a surprise.

I've completely changed my mind about 5.4 vs 5.5. Gave them the exact same task to investigate a fairly tricky bug. GPT-5.5 identified the bug and proposed a fix in 6m 59s using 117k tokens. GPT-5.4 took 8m 51s using 201k tokens, but it didn't find the bug and is asking for more information to investigate. Call me impressed.

English

2.1K

422.4K

@·1d

qwen models are good but the censorship in these models can really cause problems with refusals on text that might be mis-taken as political. working on abliteration now to see if maybe these can be safer

English

@·2d

@LLMJunky Did neural net stuff in college 2001-2002 but considering where ai is not I can’t claim any real involvement until 2023 and even still I’d just consider myself a casual consumer of it

English

@·2d

How long have you been in AI? Where the OGs at? 👇

English

7.6K

@·2d

@lucyshow11 2 of’em didn’t make it

English

@·2d

What’s up doc!!! 🤪

English

307

38K

2.3M

@·2d

Software bugs are gonna be super bad…

This. Is. Terrifying. This technology should NEVER be allowed to be used. Ford can fvck right off and so can the Congress members who voted to put this tech in vehicles starting in 2027.

English

@·2d

@thsottiaux @raffichill Can we get this in the cli too

English

@·2d

@raffichill Me too, me too. What a time to be alive

English

264

10K

@·2d

I let Codex use my computer today

English

158

11.9K

@·2d

@nateberkopec Hrm so asking ai to ssh into production is still ok right ?

English

@·2d

If you're doing AI dev, you need to act like your system is rooted by North Korea. You cannot leave knives out in the kitchen, you cannot leave the passwords out on the counter. People are putting too much trust in alignment and not doing enough to "keep honest agents honest".

English

4.2K

retweetet

@·2d

PRO TIP vLLM telling you to use `--enforce-eager` to avoid OOM because CUDA Graphs “don’t have enough VRAM”? Don’t jump straight to eager mode Try this first: - lower `--max-model-len`, ex: 4k - let CUDA Graph compile (which will be cached by torch.compile) - restart, then raise context back up You can keep the CUDA Graph performance gains without hitting OOM

English

214

10K

@·3d

They used my code to create 5.5 - I’m using it to replace itself

English

@·3d

@davidasinclair My dad traded his car for an Apple ll!

English

@·3d

Maybe not “every home” but by 1985, everyone in my circle had either a: Commodore 64 (me) TRS80 (Trash80) an Apple II (the rich kids 🤣)

Yes! But in 1982, my high school didn't teach boys typing because it was for secretaries Schools didn’t anticipate computers in every home just 4 years later 🤓 In 2024, only 14% of US public K-12 schools taught appropriate AI use. History is repeating

English

211

27.9K

@·3d

Doing data privacy cleanup using new OpenAI model and set codex up with sms notify script via CTM to give me status updates … codex session been running for 20+ minutes just cooking while do other things

English

@·3d

@devops_nk do it again for the version that never sleeps 🤣

English

@·4d

AWS Lambda in 40 seconds.

English

208

3.1K

199.3K

@·3d

@TheAhmadOsman wait a second are you saying i can troll claude by mentioning HERMES.md ? 🤣

English

1.5K

@·3d

Anthropic is not a serious company lmao

THIS GUY LOST $200 IN ONE DAY BECAUSE THE STRING "HERMES.md" WAS IN HIS GIT COMMITS HERMES.md is a real convention used in AI agent projects. it's a system prompt specification file. not some obscure edge case he's on claude max 20x at $200 a month. yesterday claude code hit him with "you're out of extra usage" out of nowhere his dashboard showed 13% weekly usage. 0% current session. 86% of his plan was sitting there untouched but $200.98 in extra usage already burned through what should have been covered by his subscription he tried logout & login, different models, fresh installs and nothing worked anthropic support sent the ai bot (four rounds of the same scripted response). eventually they just gave up on him so he started binary searching repos and commits manually on his own time until he found the trigger the string "HERMES.md" in a recent git commit message uppercase, with the .md extension, anywhere in your commit history that's it claude code includes recent commits in its system prompt and something server side flags HERMES.md and quietly routes you off your max plan onto API rate billing > AGENTS.md? fine > README.md? fine > HERMES without .md? fine > lowercase hermes.md? fine > uppercase HERMES.md? you're getting charged API rates he reported it. anthropic support acknowledged the bug three times, called it an "authentication routing issue", thanked him for finding it then refused to refund the $200 so the man pays $200 a month for max, lost another $200 to a billing bug they confirmed, did anthropic's QA work for free on his weekend, and got a "thank you for your patience" in return check your commit history before claude code quietly drains your account too

English

913

121.4K

Entdecken

@jxnlco @GregKamradt @0xSero @softwareweaver @vllm_project @LLMJunky @lucyshow11 @thsottiaux