[
https://issues.apache.org/jira/browse/KUDU-3438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xixu Wang updated KUDU-3438:
----------------------------
Description:
The unit test of TabletCopyClientAbortTest maybe core. See the core stack
information.
{code:java}
/root/kudu/src/kudu/tserver/tablet_server-test-base.cc:130: FailureFailedBad
status: IO error: Couldn't create tablet metadata: Failed to create
TabletMetadata: All healthy data directories are full (error 28)
W20230123 18:02:20.993072 869956 reactor.cc:684] Failed to create an outbound
connection to 255.255.255.255:1 because connect() failed: Network error:
connect(2) error: Network is unreachable (error 101)
/root/kudu/src/kudu/tserver/tablet_copy-test-base.h:49: FailureExpected:
StartTabletServer(kNumDataDirs) doesn't generate new fatal failures in the
current thread.Actual: it does.
*** Aborted at 1674468140 (unix time) try "date -d @1674468140" if you are
using GNU date ***
PC: @ 0x0 (unknown)
*** SIGSEGV (@0x0) received by PID 868247 (TID 0x7f2d76bb8a00) from PID 0;
stack trace: ***
@ 0x7f2d7964e9f6 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7f2d7d4c6630 (unknown)
@ 0x4a32d0 kudu::tserver::TabletCopyClientTest::StartCopy()
@ 0x4a51c8 kudu::tserver::TabletCopyClientAbortTest::SetUp()
@ 0x7f2d81704bfe testing::internal::HandleExceptionsInMethodIfSupported<>()
@ 0x7f2d816f9566 testing::Test::Run()
@ 0x7f2d816f9795 testing::TestInfo::Run()
@ 0x7f2d816f9cdf testing::TestSuite::Run()
@ 0x7f2d816fa29f testing::internal::UnitTestImpl::RunAllTests()
@ 0x7f2d8170513e testing::internal::HandleExceptionsInMethodIfSupported<>()
@ 0x7f2d816f983d testing::UnitTest::Run()@ 0x7f2d81cc7f76 RUN_ALL_TESTS()
@ 0x7f2d81cc72e6 main
@ 0x7f2d77f17555 __libc_start_main
@ 0x48e879 (unknown)Segmentation fault (core dumped)
{code}
The reason is TabletCopyClientTest::SetUp() of TabletCopyClientAbortTest may
fail, for example, because of the full disk. TabletCopyClient will be not
initialized. Therefore using TabletCopyClient in StartCopy() will cause
coredump.
!image-2023-01-30-10-10-40-439.png!
was:
The unit test of TabletCopyClientAbortTest maybe core. See the core stack
information.
{code:java}
/root/kudu/src/kudu/tserver/tablet_server-test-base.cc:130: FailureFailedBad
status: IO error: Couldn't create tablet metadata: Failed to create
TabletMetadata: All healthy data directories are full (error 28)
W20230123 18:02:20.993072 869956 reactor.cc:684] Failed to create an outbound
connection to 255.255.255.255:1 because connect() failed: Network error:
connect(2) error: Network is unreachable (error 101)
/root/kudu/src/kudu/tserver/tablet_copy-test-base.h:49: FailureExpected:
StartTabletServer(kNumDataDirs) doesn't generate new fatal failures in the
current thread.Actual: it does.
/root/kudu/src/kudu/tserver/tablet_copy_client-test.cc:112: FailureExpected:
TabletCopyTest::SetUp() doesn't generate new fatal failures in the current
thread.Actual: it does.
W20230123 18:02:20.993108 870018 heartbeater.cc:399] Failed 3 heartbeats in a
row: no longer allowing fast heartbeat attempts.
*** Aborted at 1674468140 (unix time) try "date -d @1674468140" if you are
using GNU date ***
PC: @ 0x0 (unknown)
*** SIGSEGV (@0x0) received by PID 868247 (TID 0x7f2d76bb8a00) from PID 0;
stack trace: ***
@ 0x7f2d7964e9f6 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7f2d7d4c6630 (unknown)
@ 0x4a32d0 kudu::tserver::TabletCopyClientTest::StartCopy()
@ 0x4a51c8 kudu::tserver::TabletCopyClientAbortTest::SetUp()
@ 0x7f2d81704bfe testing::internal::HandleExceptionsInMethodIfSupported<>()
@ 0x7f2d816f9566 testing::Test::Run()
@ 0x7f2d816f9795 testing::TestInfo::Run()
@ 0x7f2d816f9cdf testing::TestSuite::Run()
@ 0x7f2d816fa29f testing::internal::UnitTestImpl::RunAllTests()
@ 0x7f2d8170513e testing::internal::HandleExceptionsInMethodIfSupported<>()
@ 0x7f2d816f983d testing::UnitTest::Run()@ 0x7f2d81cc7f76 RUN_ALL_TESTS()
@ 0x7f2d81cc72e6 main
@ 0x7f2d77f17555 __libc_start_main
@ 0x48e879 (unknown)Segmentation fault (core dumped)
{code}
The reason is TabletCopyClientTest::SetUp() of TabletCopyClientAbortTest may
fail, for example, because of the full disk. TabletCopyClient will be not
initialized. Therefore using TabletCopyClient in StartCopy() will cause
coredump.
!image-2023-01-30-10-10-40-439.png!
> The unit test of TabletCopyClientAbortTest maybe core
> -----------------------------------------------------
>
> Key: KUDU-3438
> URL: https://issues.apache.org/jira/browse/KUDU-3438
> Project: Kudu
> Issue Type: Bug
> Reporter: Xixu Wang
> Priority: Major
> Attachments: image-2023-01-30-10-10-40-439.png
>
>
> The unit test of TabletCopyClientAbortTest maybe core. See the core stack
> information.
> {code:java}
> /root/kudu/src/kudu/tserver/tablet_server-test-base.cc:130: FailureFailedBad
> status: IO error: Couldn't create tablet metadata: Failed to create
> TabletMetadata: All healthy data directories are full (error 28)
> W20230123 18:02:20.993072 869956 reactor.cc:684] Failed to create an outbound
> connection to 255.255.255.255:1 because connect() failed: Network error:
> connect(2) error: Network is unreachable (error 101)
> /root/kudu/src/kudu/tserver/tablet_copy-test-base.h:49: FailureExpected:
> StartTabletServer(kNumDataDirs) doesn't generate new fatal failures in the
> current thread.Actual: it does.
> *** Aborted at 1674468140 (unix time) try "date -d @1674468140" if you are
> using GNU date ***
> PC: @ 0x0 (unknown)
> *** SIGSEGV (@0x0) received by PID 868247 (TID 0x7f2d76bb8a00) from PID 0;
> stack trace: ***
> @ 0x7f2d7964e9f6 google::(anonymous namespace)::FailureSignalHandler()
> @ 0x7f2d7d4c6630 (unknown)
> @ 0x4a32d0 kudu::tserver::TabletCopyClientTest::StartCopy()
> @ 0x4a51c8 kudu::tserver::TabletCopyClientAbortTest::SetUp()
> @ 0x7f2d81704bfe testing::internal::HandleExceptionsInMethodIfSupported<>()
> @ 0x7f2d816f9566 testing::Test::Run()
> @ 0x7f2d816f9795 testing::TestInfo::Run()
> @ 0x7f2d816f9cdf testing::TestSuite::Run()
> @ 0x7f2d816fa29f testing::internal::UnitTestImpl::RunAllTests()
> @ 0x7f2d8170513e testing::internal::HandleExceptionsInMethodIfSupported<>()
> @ 0x7f2d816f983d testing::UnitTest::Run()@ 0x7f2d81cc7f76 RUN_ALL_TESTS()
> @ 0x7f2d81cc72e6 main
> @ 0x7f2d77f17555 __libc_start_main
> @ 0x48e879 (unknown)Segmentation fault (core dumped)
> {code}
>
>
> The reason is TabletCopyClientTest::SetUp() of TabletCopyClientAbortTest may
> fail, for example, because of the full disk. TabletCopyClient will be not
> initialized. Therefore using TabletCopyClient in StartCopy() will cause
> coredump.
> !image-2023-01-30-10-10-40-439.png!
--
This message was sent by Atlassian Jira
(v8.20.10#820010)