I wish to implement an optimised sgemm for Mali MidGard Gpu whichas of now only support OpenCL 1.2. As far as I know, OpenCL 1.2 doesn't support subgroup extensions and Mali GPUs don't have any benefits for local memory tiling. So What should be the best…