I try to printf logs within the kernel of opencl, but it does not work. And then I try the example code of <program guide of opencl>, it does not work too. What I did wrong? please give me some suggestions, the example code are as follows:
#include…
I try to printf logs within the kernel of opencl, but it does not work. And then I try the example code of <program guide of opencl>, it does not work too. What I did wrong? please give me some suggestions, the example code are as follows:
#include…
Hello, I am jang-hyun Son, from seoul korea
I am now studying OpenCL using Mali GPU Midgard, and I have a question.
In Mali GPU OpenCL Development guide, I found a concept, Shader core group.
I know that Shader core is equal to OpenCL Compute unit, and There…

Hi, I am jang-hyun Son, a college student from South Korea.
I will study Mali GPU with OpenCL, and before that, I am now studying about the Mali G71 architecture.
My question is that, Why are there multiple(actually 4) L2 Caches in the image above.
…I am trying to use Huawei Mate9 to test ARM compute library, since it has been mentioned in the blog. However, I cannot run any OpenCL apps. I was wondering if OpenCL is really supported on Huawei Mate9. Do I need to modify the configuration to enable…
Hi
It's me again!! I've moved on from compiling via scons, sorting out the libOpenCL stub libs and running the examples. The inacantation shown in the documentation under Building for linux-> How to manually build the examples has some issues that I…
Hi,
I'm trying to run arm_compute_benchmark on an Android phone with mali GPU supported, follow the steps provided in https://arm-software.github.io/ComputeLibrary/latest/tests.xhtml#writing_tests.
But get following error:
WARNING: linker: /data/local…
Is there any way to get an offline asm dump for OpenCL kernels?
We have a convolution shader generator that we use to generate for everything from iOS Metal to Android OpenCL.
The kernels work with reasonable perf on NVidia, AMD, Intel, Apple and Adreno…
The Mali-G71 GPU is the latest and greatest offering in the Mali high-performance family of GPUs. Built on the brand new Bifrost architecture, Mali-G71 represents a whole new level of high-end mobile graphics capabilities whilst still maintaining Mali…
Hi folks,
I'm trying to use OpenCL the above mentioned platform and am getting the following error message:
clBuildProgram is missing from OpenCL shared library.
The OS is Ubuntu 18.04 LTS, and the package providing the OpenCL library is:
libmali-rk…
If you’d like to develop your Convolutional Neural Networks using just the Compute Library and a Raspberry Pi, this step-by-step guide will show you how… and it comes complete with all the tools you’ll need to get up and running.
If…
I am trying to optimize my OpenCL code running on a Samsung Galaxy S7 (SM-G930F) with an Exynos Octa 8890 and a Mali-T880 using Android. According to OpenCL-Z the gpu supports OpenCL 1.2 full profile. I have a lot of OpenCL code correctly compiling and…
Hi :
I use RK3288 SoC and force the Mali T760MP4 work at 600Mhz. "clpeak" program from git hub is used for testing performance. "clpeak" always show mali works at 200Mhz not 600Mhz.
(1) OS is TinkerOS_Debian V1.8. It can download from…
Hi all:
I use RK3288 with Linux 4.4 and opencl program running on it. I saw some error message print of the console.
[ 6247.837157] mali ffa30000.gpu: JS: Job Hard-Stopped (took more than 50 ticks at 100 ms/tick)
[ 6247.845711] mali ffa30000.gpu: JS: Job…
Hi all:
I use a RK3288 with mali T760 MP4 board. T760 has 16 shader core.
(1) Is there 64 shader core (16*4) in mali T760 MP4?
(2) Does one opencl work-group use more than 1 shader core ?
Thank you
-Jack
Hi all:
I got the following result by compiling a opencl program. I do not know what is "work registers" and "uniform registers". Mali T600 a register bank that contains 1024 registers. Are the 1024 registers the sum of "wor resigters…
Hi folks!
Here we are! For the first hands-on guide of the new Computer Vision and Machine Learning software library developed at Arm: Compute Library!
Compute Library is a rich collection of functions for image processing, computer vision and machine…
I'am trying use opencl to replace some matrix multiplication and vector compute. but the gpu always slower cpu 2--4 times.
In cpu we use neon simd,in gpu i also use vector like float4,float16 .
I do those test on MT6753(ARM-A53 @ 1.5GHz,Mali T720…
I wonder how portable is OpenCL code. If I write something for T-628 MP6 will it run on T-880 etc.? For sure min. OpenCL standard must be supported by HW but what about possibly different GPU architecture that may make some code unique on one GPU but…
Hi everyone,
I'm using OpenCL on an Exynos 8890 Octacore CPU with ARM Mali-T880 MP12 GPU (Samsung S7 edge). And it is taking a high overhead when creating a buffer from the call clCreateBuffer. I'd like to know more about this issue. Is anything related…
Hi,
I'm working an an Android image processing app which uses OpenCL on Mali GPU.
I have a problem where I get a random seg faults on clEnqueueUnmapMemObject or a following clReleaseMemObject
The program just seg faults and I can't catch any of the OpenCL…
I was working on an Android app which uses OpenCL on the Galaxy Note 5 which has the Mali T-760 GPU. I pulled the library from the device and got the headers from the Mali OpenCL SDK and everything worked.
I'm trying to build and run the same app on the…
I'm currently porting vision algorithms to OpenCL that is specifically target for the Mali T800 gpus. For this particular problem I'm running on the T-880 series.
I have several contiguous buffers of sizes 512x512 * (1, 2 and 4).
After the four…
I'm trying to run the parboil benchmark suite on a Mali-T628 using the Mali OpenCL SDK.
I created Makefile.conf with the following values:
OPENCL_PATH =/PATH/TO/Mali_OpenCL_SDK_V1.1.0
OPENCL_INC_PATH = $(OPENCL_PATH)/include
OPENCL_LIB_PATH = $(OPENCL_PATH…
Hi,
I'm trying to profile GPU utilization of the Mali T628 on an Odroid-XU4 board using Streamline. When I run some graphics workloads, sometime I can see "GPU Fragment Activity", "GPU Vertex Compute" and "GPU Vertex-Tiling-Compute Activity" adds up…
I'm trying to run an OpenCL kernel multiple times, in sequence. When doing this I cannot get a consistent timing between the first execution and every other execution of the kernel. Each time the kernel is run I reset the kernel arguments and call clEnqueueNDRangekernel…