Browse By Tags

Sorry, but there are no more tags available to filter with.
  • Optimised GPU convolution for low memory integrated devices -such as arm processors /GPUs?

    I wish to implement convolution on arm mali GPUs and want it to be optimised for both speed and memory ? What's the best way to do this? GEMM based MCMK convolutions are not suited as they utilise a lot of memory. Also, a direct implementation on GPU…