Andreas Kurth retweetledi

HEROv2's compiler makes software-managed memory hierarchies as easy to program as HW-managed caches. It can tile data structures & insert DMA transfers, achieving up to 90% of the performance of painstakingly hand-tuned code.
github.com/pulp-platform/…
arxiv.org/abs/2201.03861

English

































