Hi,
I am trying to utilize a texture cache for non-texture data(binary tree structure) with OpenCL on Mali-G76
since L1 seems very slow for random tree data access.
In my understanding, data will be in Texture Cache with using OpenCL API clCreateImage.
So I force to put tree data structures into tan Image2D area allocated by the API.
And ithis approach significantly improves performance so far.
Currently I put the structures with raster scan manner in Image2D area.
In addition, Someone suggested me to try z-ordering for more TC performance
https://en.wikipedia.org/wiki/Z-order_curve
So my question is, is it possible to improve performance by putting the data structure with z-ordering manner?
I also heard that some GPUs support z-ordering by hardware, in that case, z-ordering by kernel code may be in vain or get worse.
I could not find any Mali document related to Texture Cache, so if anyone give any information about TC, it would be really appreciated.
Regards