Hi,
I got new xiaomi 13T pro with Mali-G715-Immortalis MC11 r1p2 and méditek 9200+.
And the speed of my program is 20% less than on my old hauwei honnor play with mali G72 MC24.
I added
#pragma OPENCL EXTENSION cl_khr_priority_hints : enable
And this improved the speed from 140ms to 110 ms but still superior to the 80ms of the hauwei.Does using CL_Buffer with Mali-G715 could drop the speed.
Without knowing what you are doing, it's going to be very hard to provide any specific advice. Have you profiled both platforms with our Streamline profiler? It's free-of-charge as part of Arm Performance Studio.
Kind regards, Pete
thanks for the information. I run streamline but i got this error.
Could not initialize class com.arm.streamline.jni.elfdwarf.ElfDwarfParser java.lang.NoClassDefFoundError: Could not initialize class com.arm.streamline.jni.elfdwarf.ElfDwarfParser at com.arm.streamline.analysis.elfdwarf.ElfDwarf.isProcessingNeeded(ElfDwarf.java:102) at com.arm.streamline.analysis.session.SessionProcessor.produceReport(SessionProcessor.java:486) at com.arm.streamline.capture.apc.APCCapture.lambda$17(APCCapture.java:366) at com.arm.streamline.capture.apc.APCCapture.doIfValidCaptureSettings(APCCapture.java:430) at com.arm.streamline.capture.apc.APCCapture.analyze(APCCapture.java:339) at com.arm.streamline.live.LiveCaptureUiUtils.lambda$1(LiveCaptureUiUtils.java:59) at com.arm.streamline.common.utility.Task.run(Task.java:291) at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:572) at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:317) at java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1144) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:642) at java.base/java.lang.Thread.run(Thread.java:1583) Caused by: java.lang.ExceptionInInitializerError: Exception java.lang.AssertionError: Failed to load Streamline JNI lib [in thread "Main thread for report"] at com.arm.streamline.jni.elfdwarf.ElfDwarfParser.<clinit>(Unknown Source) ... 13 more
I was not able to execute those command line.
1. Run the following command on the device:setprop dalvik.vm.dex2oat-flags --no-strip-symbols2. Re-install the APK file3. To verify the options for dex2oat are set correctly, run the command:getprop dalvik.vm.dex2oat-flags4. To check whether DEX files contain .debug_* sections, you can use the GNU tools readelfcommand, for example:readelf -S .../images/*.dex
the problem of NoClassDefFoundErro is because i use windows7.
But i could see the graphics. And xiaomi only use 4 core using OpenCL and all the core using only OpenGL.
That could be the problem ?
If you are only seeing 4 cores used for OpenCL that could certainly explain it. This sounds like a customization from the OEM outside of our standard driver, so you might be able to find out more on the OEM forums.
i made a new test and if i use OpenCL and OpenGL in the same APK. xiaomi use only 4 core and the kernel became even slower.
and where is the OEM forum ;))
It is possible to install your standart driver at the place of the xiaomi driver. Can i do something except buying another phonne for testing.
And may be that you could what is the best phonne to run OpenCL. ;))
It is definitly not a problem of OEM. It is a problem with android toolchain and nkd version. It is the mess.
I managed to compile android APK in 64bit under the xiaomi 13T pro and all the core are running with OpenCL. I can get good performance nearly the same as the hauwei honnor play.
But honnor play is still more stable in time processing. The xiaomi is very fluctuing from 35 ms to 80 ms from one frame to another.