The Practical Stack

6.2K posts

The Practical Stack banner
The Practical Stack

The Practical Stack

@thepractstack

Most engineering content is noise. We filter for what matters: - Backend Architecture - System Design - AI in Production Real decisions. Real trade-offs.

Canada 加入时间 Aralık 2012
82 关注1.1K 粉丝
The Practical Stack
The Practical Stack@thepractstack·
Most engineers use LLMs daily but can't explain what happens between prompt and response. Here's the full pipeline — 8 concepts, one image. Save this. 🔁 Tokenization → Embeddings → Attention → Transformer → KV Cache → Sampling → Context Window → Inference Optimisation
The Practical Stack tweet media
English
3
0
0
29
The Practical Stack
The Practical Stack@thepractstack·
Pick your cache strategy based on what you’re willing to lose: Freshness Latency Durability Everything else is implementation detail.
English
0
0
0
19
The Practical Stack 已转推
The Practical Stack
The Practical Stack@thepractstack·
Most engineers pick a caching strategy based on what they've used before — not what the system actually needs. 8 strategies. Different tradeoffs. Cache-aside ≠ always the answer. 🧵Which one do you default to ? #DistributedSystems #SystemDesign
The Practical Stack tweet media
English
1
1
0
60
The Practical Stack
The Practical Stack@thepractstack·
Tier-1 bank: sub-5ms p99 at 1.6M msg/sec with strict ordering intact. The unlock wasn't just tuning. Multi-DC replica placement done right — skip this and tail latency spikes under load. The Practical Stack 👇 thepracticalstack461.substack.com
The Practical Stack tweet media
English
0
0
0
22
The Practical Stack
The Practical Stack@thepractstack·
@ppetryszen Cleaner primitives help, but the real question is how this behaves under pressure. Scheduling looks simple—until resource contention forces tradeoffs.
English
0
0
0
2