[
https://issues.apache.org/jira/browse/IMPALA-12745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17812089#comment-17812089
]
Joe McDonnell commented on IMPALA-12745:
----------------------------------------
Just realized that I got the Jira wrong: This was introduced by IMPALA-10048,
not IMPALA-11511.
> dump_breakpad_symbols.py's parallelism doesn't work with RPM/DEBs
> -----------------------------------------------------------------
>
> Key: IMPALA-12745
> URL: https://issues.apache.org/jira/browse/IMPALA-12745
> Project: IMPALA
> Issue Type: Bug
> Components: Infrastructure
> Affects Versions: Impala 4.4.0
> Reporter: Joe McDonnell
> Assignee: Joe McDonnell
> Priority: Critical
> Fix For: Impala 4.4.0
>
>
> When using the "-r" or "--pkg" option, dump_breakpad_symbols.py is extracting
> the RPM/DEB into a temporary directory. The lifetime of that temporary
> directory is maintained by yielding tuples from enumerate_pkg_files(). When
> using parallelism (added in IMPALA-11511), the yield doesn't keep the
> temporary directory around while the parallel threads are processing, so they
> fail with:
>
> {noformat}
> Found debugging info in
> /tmp/tmpqfZ9MZ/usr/lib/debug/usr/lib/impala/sbin-retail/impalad.debug
> Failed to open ELF file
> '/tmp/tmpqfZ9MZ/usr/lib/debug/usr/lib/impala/sbin-retail/impalad.debug': No
> such file or directory
> Failed to write symbol file.
> {noformat}
>
> Testing shows that this is still a problem with num_processes=1, so there
> should also be a change to be able to turn off the ThreadPool entirely.
> Processing OS packages can force the parallelism off for now as they don't
> benefit much from parallelism.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]