This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Mali Graphics Debugger : libGLES_mgd.so makes crash application

Hello ARM Community,

I am trying to use the mali graphics debbuger on an android device Samsung Galaxy Tab 3 8" SM_T310 powered by an ARM gpu.

I've managed to install the library and mgddaemon on the device. But as soon as I change the egl.cfg file to use the mgd lib (which I had to rename libEGL_mgd.so to be found by my device)

all application I start crash after a black screen.

In the debugger, the connection with the deamon is ok. It manages to capture one frame, which I put the trace in attachment, before the device app crashes.

The tab is running under Android 4.2.2 and has been factory reset right before doing all the manipulations for mali graphics debugger.

The version of the Mali Graphics Debugger is 1.2.1.97319195.

Could anyone help me on this problem? Is it a known issue on this tab?

Thank you in advance!

CrashTrace.mgd.zip
Parents
  • Ok so the story so far is:

    In the symbolicLink.txt file we can see these 2 lines:

       

         D/libEGL  (27973): loaded /system/lib/egl/libEGL_mgd.so

         D/libEGL  (27973): loaded /system/lib/egl/libGLESv2_mgd.so

    so Android EGL is managing to find MGD when looking for libEGL and libGLESv2, good stuff. I can also see in the output that MGD attempts to and succeeds in opening libEGL_mali.so. This is triggered on the first call to an egl function, and from the AppSettingTrace file I can see a whole bunch of eglGetProcAddress calls which is the Android EGL layer making calls to the library to get common extensions. Good stuff, Android can call into the real libs through MGD and MGD is tracing these.

    I see no other activity for EGL or GLES, which implies that the application is making no other calls to either EGL or GLES through the Android EGL layer. This seems a bit weird.

    Are you sure they're symlinks and havent ended up being copied as 2 separate libraries to the directory? Can you make a symlink from libGLES_mgd.so to mgd.so and send me logcat and mgd trace for that? Having EGL load libGLES_*.so is the ideal way and the way MGD expects to be loaded, so we might be hitting a strange bug by not doing this. The fact that it seems to be creating a different socket to the daemon for EGL and GLES at present is ringing alarm bells for me, so I'll double check that.

    Hopefully we can get this working

    Thanks,

    Chris

Reply
  • Ok so the story so far is:

    In the symbolicLink.txt file we can see these 2 lines:

       

         D/libEGL  (27973): loaded /system/lib/egl/libEGL_mgd.so

         D/libEGL  (27973): loaded /system/lib/egl/libGLESv2_mgd.so

    so Android EGL is managing to find MGD when looking for libEGL and libGLESv2, good stuff. I can also see in the output that MGD attempts to and succeeds in opening libEGL_mali.so. This is triggered on the first call to an egl function, and from the AppSettingTrace file I can see a whole bunch of eglGetProcAddress calls which is the Android EGL layer making calls to the library to get common extensions. Good stuff, Android can call into the real libs through MGD and MGD is tracing these.

    I see no other activity for EGL or GLES, which implies that the application is making no other calls to either EGL or GLES through the Android EGL layer. This seems a bit weird.

    Are you sure they're symlinks and havent ended up being copied as 2 separate libraries to the directory? Can you make a symlink from libGLES_mgd.so to mgd.so and send me logcat and mgd trace for that? Having EGL load libGLES_*.so is the ideal way and the way MGD expects to be loaded, so we might be hitting a strange bug by not doing this. The fact that it seems to be creating a different socket to the daemon for EGL and GLES at present is ringing alarm bells for me, so I'll double check that.

    Hopefully we can get this working

    Thanks,

    Chris

Children
  • Hello Chris!

    I checked if symlinks was good and they seems ok to me. I've put screenshots in attachment so you can check it.

    I tried what you've suggested by adding a symlink for libGLES_mgd.so. But it results in no changes in the behaviour of the application.

    You will find in attachment the logcat and the mgd trace.

    Thank you!

    Ybe42

    Edit : Interesting fact! With the same configuration (same symlinks and egl.cfg), if I disable mgddaemon and close mgd on the host, the application managed to be launched. I've added the logcat in attachment (logcat-24-02-2014 - No mgddaemon.txt).

  • Hmm.. in the log without mgddaemon running, MGD appears to load libEGL_mali.so from /system/lib/egl, and libGLESv2_mali.so from /vendor/lib/egl. Can you confirm this is the case? Was it like this from stock or did you copy some things around?

    That still doesn't explain why it works without the daemon running however... But, here's what I have so far:

    1. Your version of Android doesn't appear to be attempting to load a monolithic library (libGLES_*.so) at all. It seems to only look for the non-monolithic libraries, libEGL, libGLESv2, and libGLESv1_CM. This explains why you got further when you renamed it to libEGL.so, and is likely the root cause of the problem. If you look at the Android source code, it always looks for the monolithic library first, but it looks like this path has been removed in your Android build.
    2. Symlinking libEGL and libGLESv2 to mgd.so fixes the above, and this works as long as the daemon is not running (i.e. passing straight through, no attempt to trace).
    3. With the mgddaemon running, the application seems to fall over before any GLES activity occurs, because in the log we never see any "Attempting to load libGLESv2" or similar from mgd_interceptor. The load should happen on the first call to a GLES API function. We do see activity for libEGL and get trace for that in the host when it is connected.
    4. When symlinking the non-monolithic libraries to mgd.so, there are 2 connections made from the interceptor to the daemon, one for each library.

    This all seems to stem from your platform not looking for a monolithic library at all, but debate is ongoing internally I notice that the mali driver does not use symlinks, and instead has proper shims, so we will probably have to do the same and provide some non-monolithic shims to solve this problem. This week is MWC and I don't have a Tab3 to test on at present, but we'll get this on the backlog and it should be fixed in a future release. For now unfortunately I don't think it will work on a system that does not load monolithic libraries.


    EDIT: Assuming you can build Android for this device, doing so might be an option, it would result in the standard Android EGL loading mechanism.

    Thanks,

    Chris

  • Hmm.. in the log without mgddaemon running, MGD appears to load libEGL_mali.so from /system/lib/egl, and libGLESv2_mali.so from /vendor/lib/egl. Can you confirm this is the case? Was it like this from stock or did you copy some things around?

    Strangely, no. I don't see any libGLESv2_mali.so in the vendor directory. I don't know how it can find it there. In attachment, the list of files in the vendor/lib directory.

    Well, I understand your explanation on libraries loading. It is weird that I am the first facing this problem, isn't it? Maybe Samsung has just change their kernel recently...(The tab was bought just a month ago)?

    So I'll wait till your next release hopping it will be fixed !

    In any case, thank you very much for your help and explanations! Don't hesitate to contact me if you have any new information on this problem or if you want me to test some procedure on the galaxy tab.

    Bye,

    Ybe42

  • Bit of an update, the core issue here is that recent ROMs for certain Samsung devices remove the ability to load monolithic libraries, and they instead only look for the fragmented libEGL_mgd, libGLESv2_mgd, and libGLESv1_CM_mgd libraries. The solution therefore is to create symlinks from these to libGLES_mgd and the problem should be solved. There are some separate issues on the Tab3 apparently but this has worked fine on everything else I've tested it on, so let us know if you hit troubles. These steps will be included in the user guide going forward also, so if you're getting here from google be sure to check that for up-to-date instructions and troubleshooting.