• Zero Copy Buffer Allocation on Arm Mali MidGard GPUs Opencl1.2
    I am trying to allocate a zero copy buffer on Mali Midgard GPUs . The OpenCL 1.2 guide mentions that the only sure shot way to do this is to use the flag CL_MEM_ALLOC_HOST_PTR SO, First we need to...
  • Irregular behaviour of vectors in OpenCL(1.2) kernels
    So, I am trying to perform some operation inside an OpenCL kernel. I have this buffer named filter which is a 3x3 matrix initialized with value 1. I pass this as an argument to the OpenCL kernel from...
  • Optimised OpenCL SGEMM implementation for ARM Mali Midgard GPUs.
    I wish to implement an optimised sgemm for Mali MidGard Gpu whichas of now only support OpenCL 1.2. As far as I know, OpenCL 1.2 doesn't support subgroup extensions and Mali GPUs don't have any benefits...
  • Map/Unmap operations with Zero copy buffer.
    When we create physical zero copy buffers using cl_arm_import_memory, do we really need to perform map/unmap operations everytime we make changes to the buffer from the CPU side. Since, both GPU and CPU...
  • Can I manually flush the GPU cache in OpenCL kernel code?
    Hi all, I am playing with Mali T624 and OpenCL. By playing the kernel-space midgard driver, I am now able to access some I/O memory in the OpenCL kernel. However, the I/O memory I am accessing is volatile...