[
https://issues.apache.org/jira/browse/ARROW-15793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17498690#comment-17498690
]
Yibo Cai commented on ARROW-15793:
----------------------------------
Turn on GRPC_VERBOSITY=DEBUG
{code:bash}
[==========] Running 1 test from 1 test suite.
[----------] Global test environment set-up.
[----------] 1 test from TestDoPut
[ RUN ] TestDoPut.DoPutLargeBatch
D0228 02:51:30.645780167 2893580 ev_posix.cc:173] Using polling
engine: epollex
D0228 02:51:30.645883567 2893580 lb_policy_registry.cc:42] registering LB
policy factory for "grpclb"
D0228 02:51:30.645890327 2893580 lb_policy_registry.cc:42] registering LB
policy factory for "priority_experimental"
D0228 02:51:30.645897087 2893580 lb_policy_registry.cc:42] registering LB
policy factory for "weighted_target_experimental"
D0228 02:51:30.645903847 2893580 lb_policy_registry.cc:42] registering LB
policy factory for "pick_first"
D0228 02:51:30.645906887 2893580 lb_policy_registry.cc:42] registering LB
policy factory for "round_robin"
D0228 02:51:30.645910967 2893580 dns_resolver_ares.cc:500] Using ares dns
resolver
D0228 02:51:30.645924767 2893580 certificate_provider_registry.cc:33]
registering certificate provider factory for "file_watcher"
D0228 02:51:30.645929047 2893580 lb_policy_registry.cc:42] registering LB
policy factory for "cds_experimental"
D0228 02:51:30.645931967 2893580 lb_policy_registry.cc:42] registering LB
policy factory for "xds_cluster_impl_experimental"
D0228 02:51:30.645938127 2893580 lb_policy_registry.cc:42] registering LB
policy factory for "xds_cluster_resolver_experimental"
D0228 02:51:30.645942327 2893580 lb_policy_registry.cc:42] registering LB
policy factory for "xds_cluster_manager_experimental"
I0228 02:51:30.646008368 2893580 server_builder.cc:319] Synchronous
server. Num CQs: 1, Min pollers: 1, Max Pollers: 2, CQ timeout (msec): 10000
I0228 02:51:30.646368128 2893580 socket_utils_common_posix.cc:352]
TCP_USER_TIMEOUT is available. TCP_USER_TIMEOUT will be used thereafter
I0228 02:51:30.648109410 2893580 subchannel.cc:1113] New connected
subchannel at 0x7002400 for subchannel 0x6ffc420
I0228 02:51:30.702831602 2893580 chttp2_transport.cc:1715] ipv6:[::1]:59224:
Sending goaway err={"created":"@1646016690.702800442","description":"Server
shutdown","file":"/home/cyb/arrow/cpp/relwithdebinfo/grpc_ep-prefix/src/grpc_ep/src/core/lib/surface/server.cc","file_line":468,"grpc_status":0}
D0228 02:51:30.710433651 2893580 init.cc:227] grpc_shutdown
starts clean-up now
// stuck here for 10 seconds ...
[ OK ] TestDoPut.DoPutLargeBatch (10018 ms)
[----------] 1 test from TestDoPut (10018 ms total)
[----------] Global test environment tear-down
[==========] 1 test from 1 test suite ran. (10018 ms total)
[ PASSED ] 1 test.
{code}
> [C++][FlightRPC] DoPutLargeBatch test sometimes stucks for 10 seconds
> ---------------------------------------------------------------------
>
> Key: ARROW-15793
> URL: https://issues.apache.org/jira/browse/ARROW-15793
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++, FlightRPC
> Reporter: Yibo Cai
> Priority: Major
>
> Normally the test finishes in 100ms. But it often costs 10s on my test
> machine.
> Debug build is good.
> I did brief debug, looks it's related to
> [https://github.com/apache/arrow/pull/12302].
> It stuck 10 seconds in destructing grpc::Server at code
> [https://github.com/apache/arrow/blob/master/cpp/src/arrow/flight/server.cc#L863]
> To reproduce:
> {code:bash}
> $ cmake -GNinja -DARROW_BUILD_TESTS=ON -DCMAKE_BUILD_TYPE=RelWithDebInfo
> -DARROW_FLIGHT=ON ..
> $ ninja arrow-flight-test
> $ relwithdebinfo/arrow-flight-test --gtest_filter="*DoPutLargeBatch*"
> [==========] Running 1 test from 1 test suite.
> [----------] Global test environment set-up.
> [----------] 1 test from TestDoPut
> [ RUN ] TestDoPut.DoPutLargeBatch
> [ OK ] TestDoPut.DoPutLargeBatch (10017 ms)
> [----------] 1 test from TestDoPut (10017 ms total)
> [----------] Global test environment tear-down
> [==========] 1 test from 1 test suite ran. (10017 ms total)
> [ PASSED ] 1 test.
> {code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)