[ 
https://issues.apache.org/jira/browse/ARROW-15793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17498690#comment-17498690
 ] 

Yibo Cai commented on ARROW-15793:
----------------------------------

Turn on GRPC_VERBOSITY=DEBUG
{code:bash}
[==========] Running 1 test from 1 test suite.
[----------] Global test environment set-up.
[----------] 1 test from TestDoPut
[ RUN      ] TestDoPut.DoPutLargeBatch
D0228 02:51:30.645780167 2893580 ev_posix.cc:173]            Using polling 
engine: epollex
D0228 02:51:30.645883567 2893580 lb_policy_registry.cc:42]   registering LB 
policy factory for "grpclb"
D0228 02:51:30.645890327 2893580 lb_policy_registry.cc:42]   registering LB 
policy factory for "priority_experimental"
D0228 02:51:30.645897087 2893580 lb_policy_registry.cc:42]   registering LB 
policy factory for "weighted_target_experimental"
D0228 02:51:30.645903847 2893580 lb_policy_registry.cc:42]   registering LB 
policy factory for "pick_first"
D0228 02:51:30.645906887 2893580 lb_policy_registry.cc:42]   registering LB 
policy factory for "round_robin"
D0228 02:51:30.645910967 2893580 dns_resolver_ares.cc:500]   Using ares dns 
resolver
D0228 02:51:30.645924767 2893580 certificate_provider_registry.cc:33] 
registering certificate provider factory for "file_watcher"
D0228 02:51:30.645929047 2893580 lb_policy_registry.cc:42]   registering LB 
policy factory for "cds_experimental"
D0228 02:51:30.645931967 2893580 lb_policy_registry.cc:42]   registering LB 
policy factory for "xds_cluster_impl_experimental"
D0228 02:51:30.645938127 2893580 lb_policy_registry.cc:42]   registering LB 
policy factory for "xds_cluster_resolver_experimental"
D0228 02:51:30.645942327 2893580 lb_policy_registry.cc:42]   registering LB 
policy factory for "xds_cluster_manager_experimental"
I0228 02:51:30.646008368 2893580 server_builder.cc:319]      Synchronous 
server. Num CQs: 1, Min pollers: 1, Max Pollers: 2, CQ timeout (msec): 10000
I0228 02:51:30.646368128 2893580 socket_utils_common_posix.cc:352] 
TCP_USER_TIMEOUT is available. TCP_USER_TIMEOUT will be used thereafter
I0228 02:51:30.648109410 2893580 subchannel.cc:1113]         New connected 
subchannel at 0x7002400 for subchannel 0x6ffc420
I0228 02:51:30.702831602 2893580 chttp2_transport.cc:1715]   ipv6:[::1]:59224: 
Sending goaway err={"created":"@1646016690.702800442","description":"Server 
shutdown","file":"/home/cyb/arrow/cpp/relwithdebinfo/grpc_ep-prefix/src/grpc_ep/src/core/lib/surface/server.cc","file_line":468,"grpc_status":0}
D0228 02:51:30.710433651 2893580 init.cc:227]                grpc_shutdown 
starts clean-up now

// stuck here for 10 seconds ...

[       OK ] TestDoPut.DoPutLargeBatch (10018 ms)
[----------] 1 test from TestDoPut (10018 ms total)

[----------] Global test environment tear-down
[==========] 1 test from 1 test suite ran. (10018 ms total)
[  PASSED  ] 1 test.
{code}

> [C++][FlightRPC] DoPutLargeBatch test sometimes stucks for 10 seconds
> ---------------------------------------------------------------------
>
>                 Key: ARROW-15793
>                 URL: https://issues.apache.org/jira/browse/ARROW-15793
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++, FlightRPC
>            Reporter: Yibo Cai
>            Priority: Major
>
> Normally the test finishes in 100ms. But it often costs 10s on my test 
> machine.
> Debug build is good.
> I did brief debug, looks it's related to 
> [https://github.com/apache/arrow/pull/12302].
> It stuck 10 seconds in destructing grpc::Server at code 
> [https://github.com/apache/arrow/blob/master/cpp/src/arrow/flight/server.cc#L863]
> To reproduce:
> {code:bash}
> $ cmake -GNinja -DARROW_BUILD_TESTS=ON -DCMAKE_BUILD_TYPE=RelWithDebInfo 
> -DARROW_FLIGHT=ON ..
> $ ninja arrow-flight-test
> $ relwithdebinfo/arrow-flight-test --gtest_filter="*DoPutLargeBatch*"
> [==========] Running 1 test from 1 test suite.
> [----------] Global test environment set-up.
> [----------] 1 test from TestDoPut
> [ RUN      ] TestDoPut.DoPutLargeBatch
> [       OK ] TestDoPut.DoPutLargeBatch (10017 ms)
> [----------] 1 test from TestDoPut (10017 ms total)
> [----------] Global test environment tear-down
> [==========] 1 test from 1 test suite ran. (10017 ms total)
> [  PASSED  ] 1 test.
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to