Kandarpa created ARROW-11075:
--------------------------------
Summary: Getting reference not found with OCR enabled pyarrow
Key: ARROW-11075
URL: https://issues.apache.org/jira/browse/ARROW-11075
Project: Apache Arrow
Issue Type: Bug
Components: C++
Affects Versions: 1.0.1
Environment: PPC64LE
Reporter: Kandarpa
Generated the pyarrow with OCR enabled on Power using following steps:
{code:java}
export ARROW_HOME=$CONDA_PREFIX
mkdir cpp/build
cd cpp/build
cmake -DCMAKE_INSTALL_PREFIX=$ARROW_HOME \
-DCMAKE_INSTALL_LIBDIR=lib \
-DARROW_WITH_BZ2=ON \
-DARROW_WITH_ZLIB=ON \
-DARROW_WITH_ZSTD=ON \
-DARROW_WITH_LZ4=ON \
-DARROW_WITH_SNAPPY=ON \
-DARROW_WITH_BROTLI=ON \
-DARROW_PARQUET=ON \
-DARROW_PYTHON=ON \
-DARROW_BUILD_TESTS=ON \
-DARROW_CUDA=ON \
-DCUDA_CUDA_LIBRARY=/usr/local/cuda/lib64/stubs/libcuda.so \
-DARROW_ORC=ON \
..
make -j
make install
cd ../../python
python setup.py build_ext --bundle-arrow-cpp --with-orc --with-cuda
--with-parquet bdist_wheel
{code}
With the generated whl package installed, ran CUDF tests and observed following
error:
*_ERROR cudf - ImportError:
/conda/envs/rmm/lib/python3.7/site-packages/pyarrow/_orc.cpython-37m-powerpc64le-linux-gnu.so:
undefined symbol: _ZN5arrow8adapters3orc13OR..._*
Please find the whole error log below:
================================================================================
ERRORS
================================================================================
____________________________________________________________________ ERROR
collecting test session
_____________________________________________________________________
/conda/envs/rmm/lib/python3.7/importlib/__init__.py:127: in import_module
return _bootstrap._gcd_import(name[level:], package, level)
<frozen importlib._bootstrap>:1006: in _gcd_import
???
<frozen importlib._bootstrap>:983: in _find_and_load
???
<frozen importlib._bootstrap>:953: in _find_and_load_unlocked
???
<frozen importlib._bootstrap>:219: in _call_with_frames_removed
???
<frozen importlib._bootstrap>:1006: in _gcd_import
???
<frozen importlib._bootstrap>:983: in _find_and_load
???
<frozen importlib._bootstrap>:953: in _find_and_load_unlocked
???
<frozen importlib._bootstrap>:219: in _call_with_frames_removed
???
<frozen importlib._bootstrap>:1006: in _gcd_import
???
<frozen importlib._bootstrap>:983: in _find_and_load
???
<frozen importlib._bootstrap>:967: in _find_and_load_unlocked
???
<frozen importlib._bootstrap>:677: in _load_unlocked
???
<frozen importlib._bootstrap_external>:728: in exec_module
???
<frozen importlib._bootstrap>:219: in _call_with_frames_removed
???
cudf/cudf/__init__.py:60: in <module>
from cudf.io import (
cudf/cudf/io/__init__.py:8: in <module>
from cudf.io.orc import read_orc, read_orc_metadata, to_orc
cudf/cudf/io/orc.py:6: in <module>
from pyarrow import orc as orc
/conda/envs/rmm/lib/python3.7/site-packages/pyarrow/orc.py:24: in <module>
import pyarrow._orc as _orc
{color:#de350b}E ImportError:
/conda/envs/rmm/lib/python3.7/site-packages/pyarrow/_orc.cpython-37m-powerpc64le-linux-gnu.so:
undefined symbol:
_ZN5arrow8adapters3orc13ORCFileReader4ReadEPSt10shared_ptrINS_5TableEE{color}
======================================================================= short
test summary info
========================================================================
*_ERROR cudf - ImportError:
/conda/envs/rmm/lib/python3.7/site-packages/pyarrow/_orc.cpython-37m-powerpc64le-linux-gnu.so:
undefined symbol: _ZN5arrow8adapters3orc13OR..._*
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Interrupted:
1 error during collection
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
=========================================================================== 1
error in 1.54s
===========================================================================
Fatal Python error: Segmentation fault
--
This message was sent by Atlassian Jira
(v8.3.4#803005)