
Finn Puklowski
26 posts

Finn Puklowski
@FPuklowski
Chairman @ https://t.co/sVLdM1SsKR Building @ https://t.co/2EB7cIGTCv - the world's fastest ai neocloud - with no GPUs





We raised $15m to build the ASICs-first inference cloud. We're betting big on alternatives to GPUs, and the result is that we are already 5-8x faster on most models. Read more about General Compute on Tech Crunch! @FPuklowski @fastinference techcrunch.com/2026/05/28/has…



Agentic AI changed the conversation. It’s no longer just humans interacting with models. Now models are planning, reviewing, refining, & collaborating with other models to solve complex tasks. 🎧 @SumtiJairath breaks it down in this @dcdnews ep: podcasts.apple.com/us/podcast/epi…

This pricing is not arbitrary. As you move along the Pareto frontier to higher interactivities (faster tokens for your slop), you are able to serve fewer concurrent users (on GPGUs). Therefore the price of your hardware is amortized over fewer users and individual users must pay more.




