The best laypersons's summary I could find: *"SLIDE doesn’t need GPUs because it...

ganzuul · on March 6, 2020

Seems this is related to an adaptive approach which GPUs don't have support for, but could be made to. I think this means the next version of TPUs will support it, and then GPUs follow closely after.

yvdriess · on March 6, 2020

No, their approach changes the fundamental access pattern into something anathema to GPU and TPU architectures.

In ELI5 or layman's terms: current GPU/TPU accelerators are specialized in doing very regular and predictable calculations very fast. In deep learning a lot of those predictable calculations are not needed, like multiplying with zero. This approach leverages that and only does the minimal necessary calculations, but that makes it very irregular and unpredictable. Regular CPUs are better suited for those kind of irregular calculations, because most other general software is that as well.

numpad0 · on March 6, 2020

In layman's response that sounds like that network could use normalization

Rannath · on March 6, 2020

Simplify normalization if you want a layman's terms.

tyingq · on March 6, 2020

Maybe..."Our analysis suggests that SLIDE is a memory-bound application"