
Every AI company should be optimizing for speed of inference.
It’s the difference between at 10fps game and a 60fps game. The same game feels horrible at 10 and amazing at 60.
It’s not just a quality be degrees situation, it’s transformative.
The same model feels 100x smarter and can get more done with realtime inference.
English















