[ 
https://issues.apache.org/jira/browse/IMPALA-12643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17804431#comment-17804431
 ] 

ASF subversion and git services commented on IMPALA-12643:
----------------------------------------------------------

Commit dac7f409ba0619180835ea908349ac5108a58c3a in impala's branch 
refs/heads/master from Joe McDonnell
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=dac7f409b ]

IMPALA-12643 (part 2): Fallback to safe libraries on error in 
resolve_minidumps.py

Since resolve_minidumps.py's call to minidump_stackwalk can go haywire
due to bad symbols in shared libraries, this adds a fallback mechanism
where it tries again with a "safe" list of shared libraries. These are
limited to the ones that make the most difference in resolving minidumps
(libc, libstdc++, and libjvm). The list of safe libraries can be
customized via the --safe_library_list.

Testing:
 - Verified that this uses the fallback on Centos 7 and resolves
   the minidumps successfully.

Change-Id: I6bb4c9f65f9c27bb3b86c7ff2f3a6a48e258ef01
Reviewed-on: http://gerrit.cloudera.org:8080/20863
Reviewed-by: Michael Smith <[email protected]>
Tested-by: Joe McDonnell <[email protected]>


> resolve_minidumps.py's minidump_stackwalk call hits OOM on Centos7
> ------------------------------------------------------------------
>
>                 Key: IMPALA-12643
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12643
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Infrastructure
>    Affects Versions: Impala 4.4.0
>            Reporter: Joe McDonnell
>            Assignee: Joe McDonnell
>            Priority: Major
>
> When looking into fixing IMPALA-11909 to enable running resolve_minidumps.py 
> from finalize.sh, testing on Centos 7 showed that the minidump_stackwalk 
> invocation from resolve_minidumps.py went haywire and used all the system's 
> memory.
> resolve_minidumps.py is unique in processing all applicable system libraries. 
> If some system library had corrupt symbols, it could cause minidump_stackwalk 
> to get into an infinite loop.
> We should track down the offending library, but we should also run 
> minidump_stackwalk with a limit on its memory usage to avoid using all the 
> system memory.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to