[ 
https://issues.apache.org/jira/browse/ARROW-17292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17616715#comment-17616715
 ] 

Weston Pace commented on ARROW-17292:
-------------------------------------

The asof join test failure is very useful.  I dug into it further and unearthed 
ARROW-18018.  It's possible that ARROW-18018 is the cause for the hash join 
test failure as well.  By introducing delays and stress I was able to trigger 
the hash join test to segfault as a result of ARROW-18018.  However, without 
being able to reproduce it and get a stack trace, it is pretty much impossible 
to tell for sure.  As a result, I have left ARROW-18018 as a separate JIRA.

Once it merges in, we should see if this failure continues to occur.

Either way, we have likely instances of this failure going back as far as I can 
go.  E.g. we started tracking nightly failures in Zulip in May and I still see 
this test failure sporadically though I cannot confirm the cause because Github 
no longer has the logs.  My suspicion is that, whatever this bug is, we have 
probably already released several releases with it, and it should not be a 
blocker for 10.0.0.

> [C++] Segmentation fault on arrow-compute-hash-join-node-test on macos 
> nightlies
> --------------------------------------------------------------------------------
>
>                 Key: ARROW-17292
>                 URL: https://issues.apache.org/jira/browse/ARROW-17292
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++
>            Reporter: Raúl Cumplido
>            Assignee: Vibhatha Lakmal Abeykoon
>            Priority: Blocker
>              Labels: Nightly, pull-request-available
>             Fix For: 10.0.0
>
>          Time Spent: 17.5h
>  Remaining Estimate: 0h
>
> Some of our nightly builds are failing due to a segmentation fault on 
> hash-join tests:
> {code:java}
>  33/90 Test #35: arrow-compute-hash-join-node-test .........***Failed    1.21 
> sec
> Running arrow-compute-hash-join-node-test, redirecting output into 
> /var/folders/24/8k48jl6d249_n_qfxwsl6xvm0000gn/T/arrow-HEAD.XXXXX.W72iCJcj/cpp-build/build/test-logs/arrow-compute-hash-join-node-test.txt
>  (attempt 1/1)
> /Users/runner/work/crossbow/crossbow/arrow/cpp/build-support/run-test.sh: 
> line 88: 78018 Segmentation fault: 11  $TEST_EXECUTABLE "$@" > $LOGFILE.raw 
> 2>&1
> Running main() from 
> /var/folders/24/8k48jl6d249_n_qfxwsl6xvm0000gn/T/arrow-HEAD.XXXXX.W72iCJcj/cpp-build/googletest_ep-prefix/src/googletest_ep/googletest/src/gtest_main.cc
> [==========] Running 29 tests from 4 test suites.
> [----------] Global test environment set-up.
> [----------] 10 tests from HashJoin
> [ RUN      ] HashJoin.Suffix
> [       OK ] HashJoin.Suffix (4 ms)
> [ RUN      ] HashJoin.Random
> /private/var/folders/24/8k48jl6d249_n_qfxwsl6xvm0000gn/T/arrow-HEAD.XXXXX.W72iCJcj/cpp-build/src/arrow/compute/exec
>  {code}
> The failures can be seen. It seems to be only related to macos from the 
> failed jobs:
> [verify-rc-source-cpp-macos-conda-amd64|https://github.com/ursacomputing/crossbow/runs/7631965199?check_suite_focus=true]
> [verify-rc-source-integration-macos-conda-amd64|https://github.com/ursacomputing/crossbow/runs/7631969879?check_suite_focus=true]
> [verify-rc-source-python-macos-amd64|https://github.com/ursacomputing/crossbow/runs/7631926429?check_suite_focus=true]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to