tim

15.8K posts

tim banner
tim

tim

@NERDDISCO

dx @runpod ⚉ co-org @techeurope_ applied ai conf ⚉ building

germany Katılım Aralık 2011
719 Takip Edilen2.2K Takipçiler
Sabitlenmiş Tweet
tim
tim@NERDDISCO·
new AGENTS​.md --- this document exists for non-obvious, error-prone shortcomings in the codebase, the model, or the tooling that an agent cannot figure out by reading the code alone. no architecture overviews, file trees, build commands, or standard behavior. when you encounter something that belongs here, first consider whether a code change could eliminate it and suggest that to the user. only document it here if it can't be reasonably fixed. ---
tim@NERDDISCO

remove ~90% of toxic & costly context with this prompt: > remove everything from CLAUDE​.md/AGENTS.md that can be inferred from the codebase, including high-level architecture descriptions, file trees, cli usage, build commands, and examples of standard behavior. keep only non-obvious, failure-prone decisions and hidden constraints that are not explicit in the code but would cause mistakes if misunderstood. the final file should read like a sharp-edges and gotchas document, not a project overview i am currently doing this in all my projects and it feels sooo good thx for the awesome research @nielstron, @tibglo & rest of the team

English
0
0
4
682
tim
tim@NERDDISCO·
@0xSero awesome, let’s do that!
English
0
0
1
33
0xSero
0xSero@0xSero·
@NERDDISCO I definitely need help. The codebase is in a real mess so I need to slowly swap out components until it’s scalable. Maybe we can work on the vLLM piece together
English
2
0
3
154
0xSero
0xSero@0xSero·
I am going all in on vllm-studio, in the past my take was that if Claude can do it out then people should figure it out. I've also just been doing whatever comes to mind, but I am going to trim out most of the code and focus on a desktop electron app. Good UX coming soon
0xSero tweet media
English
17
3
169
8.6K
Andrey Cheptsov
Andrey Cheptsov@andrey_cheptsov·
@NERDDISCO I bet you are behind the scene reading the messages and clicking buttons)
English
1
0
1
16
Cris Lenta
Cris Lenta@crislenta·
😵 5 star hotel for a private AI hackathon by @supercell WE HAVE A PRIVATE CHEF > incredible breakfast > snacks, fruits, drinks > claude code credits 😂 > the goat @ipaananen in the house > vibe is off the charts THE FINNS ARE SETTING A NEW STANDARD
Cris Lenta tweet mediaCris Lenta tweet mediaCris Lenta tweet mediaCris Lenta tweet media
English
3
1
16
609
tim
tim@NERDDISCO·
@0xSero as it should be faster than llama.cpp?
English
1
0
1
166
0xSero
0xSero@0xSero·
I am going all in on Exllamav3 This is the middle ground between fast, performant, works on consumer cards, and intelligent. VLLM and Sglang are my go to but they're too finnicky below certain bits.
0xSero tweet media
English
12
3
144
8.4K
@levelsio
@levelsio@levelsio·
Okay let's see who can reply to this
English
2.5K
17
2.2K
1M
Prince Canuma
Prince Canuma@Prince_Canuma·
Just implemented Google’s TurboQuant in MLX and the results are wild! Needle-in-a-haystack using Qwen3.5-35B-A3B across 8.5K, 32.7K, and 64.2K context lengths: → 6/6 exact match at every quant level → TurboQuant 2.5-bit: 4.9x smaller KV cache → TurboQuant 3.5-bit: 3.8x smaller KV cache The best part: Zero accuracy loss compared to full KV cache.
Prince Canuma tweet media
Google Research@GoogleResearch

Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: goo.gle/4bsq2qI

English
147
411
5.2K
713.6K
tim
tim@NERDDISCO·
1.0107 15.5mb ttt lr=0.0032, 12ep
English
0
0
0
25
tim
tim@NERDDISCO·
1.0516 15.7mb ttt lr=0.0008, 8ep, 8 blocks
English
1
0
0
51
Boris Cherny
Boris Cherny@bcherny·
Little known fact, the Anthropic Labs team (the team I joined Anthropic to be on) shipped: - MCP - Skills - Claude Desktop app - Claude Code It was just a few of us, shipping fast, trying to keep pace with what the model was capable of. Those early Desktop computer use prototypes, back in the Sonnet 3.6 days, felt clunky and slow. But it was easy to squint and imagine all the ways people might use it once it got really good. Fast forward to today. I am so excited to release full computer use in Cowork and Dispatch. Really excited to see what you do with it!
Claude@claudeai

You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only.

English
463
411
9.3K
986.3K
tim
tim@NERDDISCO·
@0xSero 👀👀👀
QME
0
0
0
211
0xSero
0xSero@0xSero·
For those interested in getting into local AI this is my most important video. youtu.be/Adliwsf2oPE
YouTube video
YouTube
English
13
34
420
29.8K