Kartikey ꩜
3.3K posts

Kartikey ꩜
@Kartikey____
Building Hardware 🚶➡️🧘♂️ 🌱




Inference Chips for Agent Workflows @sdianahu Most AI chips are designed for "prompt in, response out." Agents don't work that way. They loop, branch, and hold context across dozens of steps, and current GPUs hit 30–40% utilization as a result. That gap is where purpose-built silicon wins.


Inference Chips for Agent Workflows @sdianahu Most AI chips are designed for "prompt in, response out." Agents don't work that way. They loop, branch, and hold context across dozens of steps, and current GPUs hit 30–40% utilization as a result. That gap is where purpose-built silicon wins.


Did a very different format with @reinerpope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Recommend watching this one on YouTube so you can see the chalkboard. – How batch size affects token cost and speed – How MoE models are laid out across GPU racks – How pipeline parallelism spreads model layers across racks – Why Ilya said, “As we now know, pipelining is not wise.” – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal – Deducing long context memory costs from API pricing – Convergent evolution between neural nets and cryptography





May 5th will hence forth be known as International 555 Timer Day. May 5th at 5:55 to be precise.

A virtual online outreach session is being organised by BEL in association with iDEX-DIO/DDP-MoD, to disseminate information on BEL’s DRISHTI (DPSU-driven Research & Innovation for Strategic and High-impact Technology Integration) Challenges / Problem Statements. @DefProdnIndia

Robotics firms in Shenzhen by segment: - Control systems: ~3,600 - Cloud “brain”: ~3,000 - Energy & communications systems: ~1,700 - Actuation systems: ~600 - Sensing/perception systems: ~550 - Drive systems: ~420 - Structural systems: ~330 - Transmission systems: ~160 From the Shenzhen Robot Industry Development White Paper (2025) 深圳市机器人产业发展白皮书(2025年) m.thepaper.cn/newsDetail_for…







