• Can I access host memory directly in OpenCL kernel with Arm cpu and mali gpu?

    liddk
    liddk

    I need a way to create an memory object which can be accessed parallelly both in CPU and GPU, this memory will be read only. How can I do it?

    • Answered
    • 2 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Questions About buffer/image use CL_MEM_USE_HOST_PTR flag

    ly700
    ly700

    Hi!when i was using CL_MEM_USE_HOST_PTR flag to create buffer or image, i hvae some questions about the driver's action. Hope for help.
    Q1: Why Copies the data pointed to by the host memory pointer into the buffer when the first kernel using this buffer…

    • 2 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Opencl Kernels overhead(queue time) in Mali-G72

    introspecs33
    introspecs33


    Hi,

    I am using OpenCL 2.0 in Mali-G72 based Android device and I am encountering a very large kernel queue/submit time overhead (CL_PROFILING_COMMAND_START - CL_PROFILING_COMMAND_QUEUED). it is sometimes 10X higher than the kernel execution time (CL_PROFILING_COMMAND_END…

    • Answered
    • 4 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Cache data invalidation between kernel calls in OpenCL

    Yury
    Yury

    Hello.

    Does L2 cache data invalidated after finishing each kernel invocation? If not the case, does invalidation depends on memory type (SVM, old buffers, mapped old buffers, etc)?

    Question related to Bifrost and Valhall architectures.

    • Answered
    • 5 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Strange profiling value are returned when running clGetEventProfilingInfo on G77

    gang.zheng
    gang.zheng
    I am running OpenCL program on Mali-G77 r0p1, I want to query profiling data with OpenCL program. Sample OpenCL program like below:

    /* Enqueue kernel */
    clEnqueueNDRangeKernel(queue, kernel, 1, NULL, &num_items,
    NULL, 0, NULL, &prof_event);
    if(err…
    • 7 months ago
    • High Performance Computing
    • HPC forum
  • The difference between clEnqueueSVMMap and clEnqueueMapBuffer

    willhua
    willhua

    Hi, Is there some difference between clEnqueueSVMMap for coarse SVM and clEnqueueMapBuffer in theory? If there is, will this cause difference about performance?

    • Answered
    • 8 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • about clImportMemoryARM

    Hxxfff
    Hxxfff

    1, clImportMemoryARM 可以导入HAL_PIXEL_FORMAT_YCrCb_420_SP (NV21格式的图)的GraphicBuffer吗?
    2, 支持cl_arm_import_memory_host的话, 任何host 端malloc出来的memory都可以 clImportMemoryARM吗, 有无约束?
    3, GraphicBuffer lock&lockYCbCr 出来的内存地址(CPU可用)可否用cl_arm_import_memory_host 来clImportMemoryARM…

    • 8 months ago
    • 中文社区
    • 中文社区论区
  • Register spilling for different threads count

    Yury
    Yury

    Hi. According to Arm Mali GPU Datasheet 2020.pdf document there are several modes for maximum thread count, for Mali G76 it is 2 such modes, 768 threads for 0-32 work registers, and 384 for 33-64 work registers.

    Is it possible that register spilling can…

    • Answered
    • 8 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • The efficient way to load and store memory in Mali G-72

    Tarun Annapareddy
    Tarun Annapareddy

    Hi,

    I am doing an image crop and writing to the destination. I am using a vector load and store of 8 uchar's. can someone help in optimizing this kernel . any mali G-72 gpu specific changes required?

    uchar* src_y : source pointer to Y data of the…

    • 10 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Buffer create taking 10 ms on mali G-72

    Tarun Annapareddy
    Tarun Annapareddy

    Hi,

    I am working on a video solution code. where I have to provide source image to GPU and do computation and write in the destination. I read that using buffer creates in the loop every time will add GPU overhead.so, I implemented the following. but…

    • Answered
    • 10 months ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • How to pass "-cl-std=CL2" option to malioc?

    ivn
    ivn

    I wan to build OpenCL 2.0 kernel (it uses OpenCL C 2.0 language in kernel code) using malioc (Mali Offline Compiler) .

    Here is my command line:

    malioc --name TestKernel --core Mali-G76 kernels.cl

    In my kernel code I check value of __OPENCL_VERSION__ and it…

    • Answered
    • over 1 year ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Map/Unmap operations with Zero copy buffer.

    abhi.verma
    abhi.verma

    When we create physical zero copy buffers using cl_arm_import_memory, do we really need to perform map/unmap operations everytime we make changes to the buffer from the CPU side. Since, both GPU and CPU access the same memory, will not the changes propagate…

    • Answered
    • over 1 year ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Optimised OpenCL SGEMM implementation for ARM Mali Midgard GPUs.

    abhi.verma
    abhi.verma

    I wish to implement an optimised sgemm for Mali MidGard Gpu whichas of now only support OpenCL 1.2.  As far as I know, OpenCL 1.2 doesn't support subgroup extensions and Mali GPUs don't have any benefits for local memory tiling. So What should be the best…

    • Answered
    • over 1 year ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Irregular behaviour of vectors in OpenCL(1.2) kernels

    abhi.verma
    abhi.verma

    So, I am trying to perform some operation inside an OpenCL kernel. I have this buffer named filter which is a 3x3 matrix initialized with value 1.

    I pass this as an argument to the OpenCL kernel from the host side. The issue is when I try to fetch this…

    • Answered
    • over 1 year ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Zero Copy Buffer Allocation on Arm Mali MidGard GPUs Opencl1.2

    abhi.verma
    abhi.verma

    I am trying to allocate a zero copy buffer on  Mali Midgard GPUs . The OpenCL 1.2 guide mentions that the only sure shot way to do this is to use the flag

    CL_MEM_ALLOC_HOST_PTR

    SO, First we need to allocate the Gpu memory using the flag and then perform…

    • Answered
    • over 1 year ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • cl_khr_subgroups questions

    Yury
    Yury

    Hello,
    Since lack of local memory in Mali, I am trying to use subgroups as Intel does in clDNN library, although they have local memory but registers exchange even faster than local memory. I have three questions about subgroups in Bifrost and Valhall implementation…

    • Answered
    • over 1 year ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • New online training course - Machine Learning using Arm

    NickT
    NickT

    We are very pleased to announce a new online training topic - Machine Learning using Arm.

    About the course

    This training topic covers essential information on Arm’s IP solutions for optimizing Machine Learning (ML) applications for Arm hardware. The…

    • over 1 year ago
    • Processors
    • Machine Learning IP blog
  • IWOCL - Enhanced Support for OpenCL Open Source Tools and Resources

    Anastasia Stulova
    Anastasia Stulova

    The LLVM project is an open source compiler framework that supports code-generation for many hardware platforms. Major platform vendors produce toolchains based on the LLVM Project due to its permissive free software license model as well as the modular…

    • over 1 year ago
    • Software Tools
    • Tools, Software and IDEs blog
  • Android 9 Mali G72 OpenCL library for C++ build

    hterrolle
    hterrolle

    hi,

    I was able to run JNI Opencl kernel on my mali T830 by coping the system/vendor/lib/libGLES_mali.so and libOpenCL.so to the JNI folder on android studio. of course we to load some library through

    System.loadLibrary(xxx).


    I hace done the same with my…
    • over 1 year ago
    • Open Source Software and Platforms
    • Android forum
  • Arm Compute Library 19.05 is coming!

    Gian Marco Iodice
    Gian Marco Iodice

    I know, I know… it’s been a while since I wrote my last Arm Compute Library blog, but I promise that your patience will be rewarded. There’s a whole host of freshly integrated functions, features and performance optimizations that I want to share with…

    • over 1 year ago
    • Graphics and Gaming
    • Graphics and Gaming blog
  • Is there another way to install opencl or to use arm compute library without opencl?

    teak
    teak

    I'd like to manipulate GPU to neural net.

    My environment is odroid-xu4 that constructed with exynos5422(cortex-15, cortex-7, mali-t628GPU) and TIZEN OS based linux kernel

    i googled about GPU, they recommended using OpenCL and Arm compute library 

    …
    • over 2 years ago
    • Open Source Software and Platforms
    • Machine Learning forum
  • NEON Driver

    SirineKOUADA
    SirineKOUADA

    Hi,

    I have seen the ARM webpage to download the NEON driver but I can't find it.

    In fact, I want to run OpenCl on ARM Cortex A but I can't because I need to install the NEON tool.

    Is this driver available for download?

    Thanks

    Sirine.

    • over 2 years ago
    • Open Source Software and Platforms
    • Machine Learning forum
  • OpenCL mode in ARM Streamline

    xinkuokuo
    xinkuokuo

    Hi,
    I find OpenCL mode in ARM DS-5 Streamline user guide, but can't get it when using it. I am using DS-5 5.27.1 and r7p0 driver.
    The user guide says "To enable OpenCL mode, you must create an instrumentation configuration file. For details of the…

    • over 3 years ago
    • Software Tools
    • Arm Development Studio forum
  • What is the specific structure of Mali-T880 on Helio X20?

    david
    david

    Hi,all

    I am now based on MediaTek's Helio X20 development Opencl algorithm, but it is not clear that the specific structure of the X20 mali-T880, including the number of shadercore, the size of the L1 cach, L2 cach size, etc.. Can you provide specific…

    • over 3 years ago
    • Graphics and Gaming
    • Graphics and Gaming forum
  • Where is the OpenCL SDK for Mali T604

    Tim Legrand
    Tim Legrand
    Note: This was originally posted on 30th January 2013 at http://forums.arm.com

    Hi,

    Where is the OpenCL SDK for Mali GPUs if available, and when will it be if not?

    Many thanks,
    Tim
    • over 7 years ago
    • Graphics and Gaming
    • Graphics and Gaming forum
>