[jira] [Commented] (IMPALA-12564) TSAN test run fails to start Hive Server on Ubuntu 22.04

2024-04-10 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17835885#comment-17835885
 ] 

ASF subversion and git services commented on IMPALA-12564:
--

Commit 408c119f7da022e2f6de2df2b5af4546eefdb5bc in impala's branch 
refs/heads/master from Laszlo Gaal
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=408c119f7 ]

IMPALA-12564: Prevent Hive loading libfesupport.so in the minicluster during 
TSAN runs

During TSAN runs all Impala binaries (including libfesupport.so) are
built with TSAN options, which include a reference to the external
symbol __tsan_init. This causes a problem for libfesupport.so when it is
loaded into Hive during minicluster startup, because the Java VM running
Hive's code cannot supply this symbol (the stock JVM is obviously not
built with TSAN).

Unfortunately this symbol resolution failure causes Hive's JVM simply to
abort on Red Hat 8 (or later) and on Ubuntu 20.04 (or later).
On earlier versions of the same platforms the JVM turned the same
failure into an UnsatisfiedLinkError exception, which is actually
handled by Hive.

This patch prevents libfesupport.so from being loaded into Hive for TSAN
runs so that the minicluster can actually be started. This is achieved
by not adding the directory containing libfesupport.so to
JAVA_LIBRARY_PATH, preventing the JVM from finding it.

Change-Id: Ie030d9876c297d6e9dae80eba37e525ee2bccb20
Reviewed-on: http://gerrit.cloudera.org:8080/21191
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> TSAN test run fails to start Hive Server on Ubuntu 22.04
> 
>
> Key: IMPALA-12564
> URL: https://issues.apache.org/jira/browse/IMPALA-12564
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Michael Smith
>Assignee: Laszlo Gaal
>Priority: Major
>
> Hive Server startup hangs during ARM TSAN test runs. Logs were not 
> particularly illuminating.
> hive-server2.out shows
> {code}
> /usr/lib/jvm/java-8-openjdk-amd64/bin/java: symbol lookup error: 
> /home/michael/Impala/be/build/debug/service/libfesupport.so: undefined 
> symbol: __tsan_init
> {code}
> Can work around it locally by commenting out 
> https://github.com/apache/impala/blob/4.3.0/testdata/bin/run-hive-server.sh#L140-L146.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-12564) TSAN test run fails to start Hive Server on Ubuntu 22.04

2024-03-22 Thread Laszlo Gaal (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829995#comment-17829995
 ] 

Laszlo Gaal commented on IMPALA-12564:
--

https://gerrit.cloudera.org/c/21191

> TSAN test run fails to start Hive Server on Ubuntu 22.04
> 
>
> Key: IMPALA-12564
> URL: https://issues.apache.org/jira/browse/IMPALA-12564
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Michael Smith
>Assignee: Laszlo Gaal
>Priority: Major
>
> Hive Server startup hangs during ARM TSAN test runs. Logs were not 
> particularly illuminating.
> hive-server2.out shows
> {code}
> /usr/lib/jvm/java-8-openjdk-amd64/bin/java: symbol lookup error: 
> /home/michael/Impala/be/build/debug/service/libfesupport.so: undefined 
> symbol: __tsan_init
> {code}
> Can work around it locally by commenting out 
> https://github.com/apache/impala/blob/4.3.0/testdata/bin/run-hive-server.sh#L140-L146.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-12564) TSAN test run fails to start Hive Server on Ubuntu 22.04

2024-03-20 Thread Laszlo Gaal (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829302#comment-17829302
 ] 

Laszlo Gaal commented on IMPALA-12564:
--

Since TSAN runs conclude successfully even in the absence of a loaded 
libfesupport.so, the suggested workaround is to suppress the attempt to load it 
into Hive during a TSAN run.
A test run of TSAN-specific tests with the removal hardcoded was successful, so 
I'll submit a patch for this workaround.

> TSAN test run fails to start Hive Server on Ubuntu 22.04
> 
>
> Key: IMPALA-12564
> URL: https://issues.apache.org/jira/browse/IMPALA-12564
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Michael Smith
>Assignee: Michael Smith
>Priority: Major
>
> Hive Server startup hangs during ARM TSAN test runs. Logs were not 
> particularly illuminating.
> hive-server2.out shows
> {code}
> /usr/lib/jvm/java-8-openjdk-amd64/bin/java: symbol lookup error: 
> /home/michael/Impala/be/build/debug/service/libfesupport.so: undefined 
> symbol: __tsan_init
> {code}
> Can work around it locally by commenting out 
> https://github.com/apache/impala/blob/4.3.0/testdata/bin/run-hive-server.sh#L140-L146.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-12564) TSAN test run fails to start Hive Server on Ubuntu 22.04

2024-03-20 Thread Laszlo Gaal (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17829300#comment-17829300
 ] 

Laszlo Gaal commented on IMPALA-12564:
--

This is actually interesting.
I started chasing the symbol load failure by inserting 
{{LD_DEBUG=libs,files,symbols}} into {{testdata/bin/run-hive-server.sh}}, the 
script that launches Hive for the minicluster. The dynamic linker's debug 
output confirmed that the above symbol, {{__tsan_init}} fails to get resolved 
when libfesupport.so is loaded by the JVM as Hive tries to load it in 
https://github.com/apache/impala/blob/master/fe/src/main/java/org/apache/impala/service/FeSupport.java#L549-L555,
 via 
https://github.infra.cloudera.com/CDH/hive/blob/cdpd-master/ql/src/java/org/apache/hadoop/hive/ql/engine/EngineLoader.java

What is interesting is that the symbol resolution failure, and the subsequent 
failure to load libfesupport.so into Hive's JVM is actually present on all the 
platforms, even on the ones that seem to survive the load attempt (Centos 7 and 
Ubuntu 18.04). The difference between the platforms is in the JVM's subsequent 
reaction:
* on older platforms (Centos 7, Ubuntu 18.04) the JVM swallows the load failure 
and turns it into a Java exception that can be caught in FeSupport.java. This 
allows execution to continue normally.
* on newer platforms the JVM just emits the message about the symbol resolution 
failure and terminates, causing the minicluster's helper script that waits for 
the Hive startup to time out eventually.

This doesn't even seem to be a difference between the JVM/JDK versions native 
to the platform: I tried importing an older platform's JDK8 version to a newer 
OS (downloading it as a tarball and setting it up in its own directory tree, 
then pointing the PATH at this tree as well as setting JAVA_HOME for Impala), 
but the failure remained.

> TSAN test run fails to start Hive Server on Ubuntu 22.04
> 
>
> Key: IMPALA-12564
> URL: https://issues.apache.org/jira/browse/IMPALA-12564
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Michael Smith
>Assignee: Michael Smith
>Priority: Major
>
> Hive Server startup hangs during ARM TSAN test runs. Logs were not 
> particularly illuminating.
> hive-server2.out shows
> {code}
> /usr/lib/jvm/java-8-openjdk-amd64/bin/java: symbol lookup error: 
> /home/michael/Impala/be/build/debug/service/libfesupport.so: undefined 
> symbol: __tsan_init
> {code}
> Can work around it locally by commenting out 
> https://github.com/apache/impala/blob/4.3.0/testdata/bin/run-hive-server.sh#L140-L146.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-12564) TSAN test run fails to start Hive Server on Ubuntu 22.04

2024-03-07 Thread Laszlo Gaal (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17824358#comment-17824358
 ] 

Laszlo Gaal commented on IMPALA-12564:
--

I don't think this is ARM-related: I saw exactly the same failure on RHEL 8.6 
in an Intel-based run.
To me it looks more like a difference between older and newer platforms: the 
fact that existing code works on Ubuntu 20.04 and on Centos 7.9 works, while 
the same code fails on Red Hat 8.6 and Ubuntu 22.04 suggest a platform 
difference caused by major version upgrades.

> TSAN test run fails to start Hive Server on Ubuntu 22.04
> 
>
> Key: IMPALA-12564
> URL: https://issues.apache.org/jira/browse/IMPALA-12564
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Michael Smith
>Assignee: Michael Smith
>Priority: Major
>
> Hive Server startup hangs during ARM TSAN test runs. Logs were not 
> particularly illuminating.
> hive-server2.out shows
> {code}
> /usr/lib/jvm/java-8-openjdk-amd64/bin/java: symbol lookup error: 
> /home/michael/Impala/be/build/debug/service/libfesupport.so: undefined 
> symbol: __tsan_init
> {code}
> Can work around it locally by commenting out 
> https://github.com/apache/impala/blob/4.3.0/testdata/bin/run-hive-server.sh#L140-L146.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org