Shimon Vainer retweetledi

We initially cared about local LLMs, but KV caches appear in more than text.
So we also investigated OCTOPUS for autoregressive video and audio transformers.
Joined work with @VikramVoleti, Simon Donné, @esx2ve
GIF
English






















