Browse By Tags

  • Optimised OpenCL SGEMM implementation for ARM Mali Midgard GPUs.

    I wish to implement an optimised sgemm for Mali MidGard Gpu whichas of now only support OpenCL 1.2.  As far as I know, OpenCL 1.2 doesn't support subgroup extensions and Mali GPUs don't have any benefits for local memory tiling. So What should be the best…