• Zero Copy Buffers using cl_arm_import_memory extension in OpenCL 1.2 - arm mali midgard GPUs.
    Hi, I wish to allocate a vector and use it's data pointer to allocate a zero copy buffer on the GPU. There is this cl_arm_import_memory extension which can be used to do this. But I am not sure wether...
  • Optimised GPU convolution for low memory integrated devices -such as arm processors /GPUs?
    I wish to implement convolution on arm mali GPUs and want it to be optimised for both speed and memory ? What's the best way to do this? GEMM based MCMK convolutions are not suited as they utilise a lot...
  • Irregular behaviour of vectors in OpenCL(1.2) kernels
    So, I am trying to perform some operation inside an OpenCL kernel. I have this buffer named filter which is a 3x3 matrix initialized with value 1. I pass this as an argument to the OpenCL kernel from...
  • Zero Copy Buffer Allocation on Arm Mali MidGard GPUs Opencl1.2
    I am trying to allocate a zero copy buffer on Mali Midgard GPUs . The OpenCL 1.2 guide mentions that the only sure shot way to do this is to use the flag CL_MEM_ALLOC_HOST_PTR SO, First we need to...
  • Can I access host memory directly in OpenCL kernel with Arm cpu and mali gpu?
    I need a way to create an memory object which can be accessed parallelly both in CPU and GPU, this memory will be read only. How can I do it?