Hi,
I am trying to utilize a texture cache for non-texture data(binary tree structure) with OpenCL on Mali-G76
since L1 seems very slow for random tree data access.
In my understanding, data will be in Texture Cache with using OpenCL API clCreateImage.
So I force to put tree data structures into tan Image2D area allocated by the API.
And ithis approach significantly improves performance so far.
Currently I put the structures with raster scan manner in Image2D area.
In addition, Someone suggested me to try z-ordering for more TC performance
https://en.wikipedia.org/wiki/Z-order_curve
So my question is, is it possible to improve performance by putting the data structure with z-ordering manner?
I also heard that some GPUs support z-ordering by hardware, in that case, z-ordering by kernel code may be in vain or get worse.
I could not find any Mali document related to Texture Cache, so if anyone give any information about TC, it would be really appreciated.
Regards
View all questions in Graphics and Gaming forum