This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

OpenCL support for Mali-T628 MP6 on Arndale Octa?

Summary

Is OpenCL support for the Mali-T628 (for example as found in the Exynos 5420 SoC on the Arndale Octa board) available? If so, how to set it up?

More details

According to the vendor, OpenCL should be supported, but the Arndale Octa Wiki does not state how this can be achieved.

I am using the latest Linaro developer build and installed Mali drivers that contain OpenCL libraries for Mali T604. According to this guide, the driver actually contains references to the Mali T628. So I tried to create the udev rule as specified, which is supposed to solve a permission problem with /dev/mali0, but I found that there is no /dev/mali0 on my installation at all. So my conclusion is that the driver indeed does not support T628.

When I execute a clinfo utility, clGetDeviceInfo returns CL_OUT_OF_HOST_MEMORY for some device properties. Why can I query the GPU for some characteristics, but does this fail for some others? When running a normal application, the same error appears when trying to create an OpenCL Context.

I was surprised to find this topic, where yoshi seems to have OpenCL working and can run benchmarks on his Arndale Octa board. How is this possible if there is no driver available? Or am I just missing something? I hope that you can help me to also establish a working OpenCL development environment.

Parents
  • Hi Chris,

    I put all files from the binary userspace driver in the /usr/lib/ directory and compiled just using: "g++ program.cpp -lOpenCL". This gives the same result as compiling like this: "g++ program.cpp /usr/lib/libOpenCL.so". The output is too long to show here.

    Compiling using "g++ clpeak-arndale-octa.cpp /usr/lib/libmali.so" works just fine: But now that I am further looking into the compilation, I noticed the following:

    # ldd a.out

      /usr/lib/libmali.so (0xb5e03000)

      libstdc++.so.6 => /usr/lib/arm-linux-gnueabihf/libstdc++.so.6 (0xb5d4a000)

      libm.so.6 => /lib/arm-linux-gnueabihf/libm.so.6 (0xb5cde000)

      libgcc_s.so.1 => /lib/arm-linux-gnueabihf/libgcc_s.so.1 (0xb5cbd000)

      libc.so.6 => /lib/arm-linux-gnueabihf/libc.so.6 (0xb5bd6000)

      /lib/ld-linux-armhf.so.3 (0xb6f6d000)

      librt.so.1 => /lib/arm-linux-gnueabihf/librt.so.1 (0xb5bc8000)

      libpthread.so.0 => /lib/arm-linux-gnueabihf/libpthread.so.0 (0xb5bac000)

      libdl.so.2 => /lib/arm-linux-gnueabihf/libdl.so.2 (0xb5ba1000)

    The resulting binary does not contain any reference to a OpenCL or Mali library. Does this mean that the program runs on the CPU instead of GPU? But then, why does it report the two Mali-T628 devices? I am confused.

Reply
  • Hi Chris,

    I put all files from the binary userspace driver in the /usr/lib/ directory and compiled just using: "g++ program.cpp -lOpenCL". This gives the same result as compiling like this: "g++ program.cpp /usr/lib/libOpenCL.so". The output is too long to show here.

    Compiling using "g++ clpeak-arndale-octa.cpp /usr/lib/libmali.so" works just fine: But now that I am further looking into the compilation, I noticed the following:

    # ldd a.out

      /usr/lib/libmali.so (0xb5e03000)

      libstdc++.so.6 => /usr/lib/arm-linux-gnueabihf/libstdc++.so.6 (0xb5d4a000)

      libm.so.6 => /lib/arm-linux-gnueabihf/libm.so.6 (0xb5cde000)

      libgcc_s.so.1 => /lib/arm-linux-gnueabihf/libgcc_s.so.1 (0xb5cbd000)

      libc.so.6 => /lib/arm-linux-gnueabihf/libc.so.6 (0xb5bd6000)

      /lib/ld-linux-armhf.so.3 (0xb6f6d000)

      librt.so.1 => /lib/arm-linux-gnueabihf/librt.so.1 (0xb5bc8000)

      libpthread.so.0 => /lib/arm-linux-gnueabihf/libpthread.so.0 (0xb5bac000)

      libdl.so.2 => /lib/arm-linux-gnueabihf/libdl.so.2 (0xb5ba1000)

    The resulting binary does not contain any reference to a OpenCL or Mali library. Does this mean that the program runs on the CPU instead of GPU? But then, why does it report the two Mali-T628 devices? I am confused.

Children
  • All the errors are undefined reference errors at link time, the symbols should be there so offhand not sure why that's happening, but in any case linking against libmali.so is working

    In your above output, ldd a.out is reporting /usr/lib/libmali.so on the first line, so thats working as expected.

    Hth,

    Chris

  • Whoops, I missed that line. In that case I can conclude the following:

    I am running a kernel containing the proper Mali kernel driver, my program is linked to the corresponding userspace binary and T628-MP6 is correctly recognized.

    Is it indeed DVFS that is preventing the benchmark to achieve the expected performance level? Why is /sys/class/misc/mali0/device/dvfs missing?

  • Hi Bramv,

    Taken shamelessly from an answer by peterharris in another thread:


    "The DVFS code for the GPU is not directly managed by our drivers - it is part of the platform integration provided in the BSP from Insignal. This style of integration occurs because the DVFS analogue parts which control F and V for the power domains are not part of the ARM IP. This question is probably best asked to Samsung or Insignal, as they maintain the BSP for that platform."

    It is possible to disable features such as DVFS by recompiling the linux kernel and mali kernel module with the correct configuration.  The reason this reduced performance (normally) happens is because DVFS ties the GPU frequencies to the workload of the CPU.  As you are running your intensive GPU test, the CPU is left to idle and so DVFS drops the CPU core speed, unfortunately, also dropping the GPU frequency.

    A means of stopping this happening would be to add some CPU intensive code to run whilst the GPU code is running to stop DVFS dropping the frequencies.

    Hope this Helps,

    Rich

  • With regards to the Linker errors, you should be able to fix the issue by linking against both mali and OpenCL.  OpenCL will provide the runtime linker target (even in the absence of mali) whilst mali will provide the symbols at compile time, stopping the errors in your shared paste.

    Hope this helps,

    Rich

  • Hi Rich,

    I just tried to run the GPU benchmark while the CPU was busy compiling a new kernel. There are however no significant differences in the results.

    To see whether I could make DVFS work, I enabled "CONFIG_MALI_MIDGARD_DVFS=y", but this results in undefined references while compiling:

      LD      init/built-in.o

    drivers/built-in.o: In function `mali_dvfs_update_asv':

    :(.text+0x517a8): undefined reference to `exynos_lot_id'

    :(.text+0x517ac): undefined reference to `exynos_lot_id'

    drivers/built-in.o: In function `mali_sysfs_show_asv':

    :(.text+0x51ab6): undefined reference to `exynos_asv_group_get'

    :(.text+0x51b2a): undefined reference to `exynos5420_is_g3d_mp6'

    :(.text+0x51b50): undefined reference to `exynos_lot_id'

    :(.text+0x51b58): undefined reference to `exynos_lot_id'

    drivers/built-in.o: In function `mali_dvfs_event_proc':

    :(.text+0x51fc8): undefined reference to `exynos_result_of_asv'

    :(.text+0x51fcc): undefined reference to `exynos_result_of_asv'

    make: *** [vmlinux] Error 1

    Are you using the same kernel configuration that Guillaume mentioned?

    ./scripts/kconfig/merge_config.sh linaro/configs/linaro-base.conf linaro/configs/distribution.conf linaro/configs/arndale_octa.conf linaro/configs/lt-arndale_octa.conf linaro/configs/mali-arndale-octa.conf
  • Hi bramv,

    I've not been able to look into this a huge amount for you unfortunately.

    I was able to compile your code and run it on an Odroid X-U3 which is running a very similar SoC and I didn't see performance numbers anywhere near what you see, they are somewhat better.  I can only assume this is a BSP issue, but I haven't had an Arndale Octa at hand to create my own BSP and test.

    I will try and investigate at this end when I get the opportunity ,

    Rich