[
https://issues.apache.org/jira/browse/HAWQ-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chunling Wang closed HAWQ-559.
------------------------------
> QD hangs when QE is killed after connected to QD
> ------------------------------------------------
>
> Key: HAWQ-559
> URL: https://issues.apache.org/jira/browse/HAWQ-559
> Project: Apache HAWQ
> Issue Type: Bug
> Components: Dispatcher
> Affects Versions: 2.0.0.0-incubating
> Environment: mac os X 10.10
> Reporter: Chunling Wang
> Assignee: Lili Ma
> Fix For: 2.0.0.0-incubating
>
>
> When the first query finishes, the QE is still alive. Then we run the second
> query. After the thread of QD is created and bind to QE but not send data to
> QE, we kill this QE and find QD hangs.
> Here is the backtrace when QD hangs:
> {code}
> * thread #1: tid = 0x1c4afd, 0x00007fff890355be libsystem_kernel.dylib`poll +
> 10, queue = 'com.apple.main-thread', stop reason = signal SIGSTOP
> * frame #0: 0x00007fff890355be libsystem_kernel.dylib`poll + 10
> frame #1: 0x000000010745692c postgres`receiveChunksUDP [inlined]
> udpSignalPoll + 42 at ic_udp.c:2882
> frame #2: 0x0000000107456902 postgres`receiveChunksUDP + 26 at
> ic_udp.c:2715
> frame #3: 0x00000001074568e8 postgres`receiveChunksUDP [inlined]
> waitOnCondition(timeout_us=250000) + 82 at ic_udp.c:1599
> frame #4: 0x0000000107456896
> postgres`receiveChunksUDP(pTransportStates=0x00007ff2a381ae48,
> pEntry=0x00007ff2a18f2230, motNodeID=<unavailable>,
> srcRoute=0x00007fff58c0ce96, conn=<unavailable>, inTeardown='\0') + 726 at
> ic_udp.c:4039
> frame #5: 0x0000000107452a86 postgres`RecvTupleChunkFromAnyUDP [inlined]
> RecvTupleChunkFromAnyUDP_Internal + 498 at ic_udp.c:4146
> frame #6: 0x0000000107452894
> postgres`RecvTupleChunkFromAnyUDP(mlStates=<unavailable>,
> transportStates=<unavailable>, motNodeID=1, srcRoute=0x00007fff58c0ce96) +
> 100 at ic_udp.c:4167
> frame #7: 0x0000000107442254 postgres`RecvTupleFrom [inlined]
> processIncomingChunks(mlStates=0x00007ff2a3812a30,
> transportStates=0x00007ff2a381ae48, motNodeID=1, srcRoute=<unavailable>) + 34
> at cdbmotion.c:684
> frame #8: 0x0000000107442232
> postgres`RecvTupleFrom(mlStates=0x00007ff2a3812a30,
> transportStates=<unavailable>, motNodeID=1, tup_i=0x00007fff58c0cf00,
> srcRoute=-100) + 370 at cdbmotion.c:610
> frame #9: 0x00000001071c8778 postgres`ExecMotion [inlined]
> execMotionUnsortedReceiver(node=<unavailable>) + 57 at nodeMotion.c:466
> frame #10: 0x00000001071c873f postgres`ExecMotion(node=<unavailable>) +
> 1071 at nodeMotion.c:298
> frame #11: 0x00000001071a4835
> postgres`ExecProcNode(node=0x00007ff2a38164b8) + 613 at execProcnode.c:999
> frame #12: 0x00000001071b9f82 postgres`ExecAgg + 104 at nodeAgg.c:1163
> frame #13: 0x00000001071b9f1a postgres`ExecAgg + 316 at nodeAgg.c:1693
> frame #14: 0x00000001071b9dde postgres`ExecAgg(node=0x00007ff2a3815348) +
> 126 at nodeAgg.c:1138
> frame #15: 0x00000001071a4803
> postgres`ExecProcNode(node=0x00007ff2a3815348) + 563 at execProcnode.c:979
> frame #16: 0x000000010719ecfd
> postgres`ExecutePlan(estate=0x00007ff2a3814e30, planstate=0x00007ff2a3815348,
> operation=CMD_SELECT, numberTuples=0, direction=<unavailable>,
> dest=0x00007ff2a28db178) + 1181 at execMain.c:3218
> frame #17: 0x000000010719e619
> postgres`ExecutorRun(queryDesc=0x00007ff2a3811f00,
> direction=ForwardScanDirection, count=0) + 569 at execMain.c:1213
> frame #18: 0x00000001072e7fc2 postgres`PortalRun + 14 at pquery.c:1649
> frame #19: 0x00000001072e7fb4
> postgres`PortalRun(portal=0x00007ff2a1893e30, count=<unavailable>,
> isTopLevel='\x01', dest=<unavailable>, altdest=0x00007ff2a28db178,
> completionTag=0x00007fff58c0d530) + 1124 at pquery.c:1471
> frame #20: 0x00000001072e4a8e
> postgres`exec_simple_query(query_string=0x00007ff2a380fe30,
> seqServerHost=0x0000000000000000, seqServerPort=-1) + 2078 at postgres.c:1745
> frame #21: 0x00000001072e0c4c postgres`PostgresMain(argc=<unavailable>,
> argv=<unavailable>, username=0x00007ff2a201bcf0) + 9404 at postgres.c:4754
> frame #22: 0x000000010729a002 postgres`ServerLoop [inlined] BackendRun +
> 105 at postmaster.c:5889
> frame #23: 0x0000000107299f99 postgres`ServerLoop at postmaster.c:5484
> frame #24: 0x0000000107299f99 postgres`ServerLoop + 9593 at
> postmaster.c:2163
> frame #25: 0x0000000107296f3b postgres`PostmasterMain(argc=<unavailable>,
> argv=<unavailable>) + 5019 at postmaster.c:1454
> frame #26: 0x0000000107200ca9 postgres`main(argc=9,
> argv=0x00007ff2a141eef0) + 1433 at main.c:209
> frame #27: 0x00007fff95e8c5c9 libdyld.dylib`start + 1
> thread #2: tid = 0x1c4afe, 0x00007fff890355be libsystem_kernel.dylib`poll +
> 10
> frame #0: 0x00007fff890355be libsystem_kernel.dylib`poll + 10
> frame #1: 0x000000010744d8e3 postgres`rxThreadFunc(arg=<unavailable>) +
> 2163 at ic_udp.c:6251
> frame #2: 0x00007fff95e822fc libsystem_pthread.dylib`_pthread_body + 131
> frame #3: 0x00007fff95e82279 libsystem_pthread.dylib`_pthread_start + 176
> frame #4: 0x00007fff95e804b1 libsystem_pthread.dylib`thread_start + 13
> thread #3: tid = 0x1c4b02, 0x00007fff890343f6
> libsystem_kernel.dylib`__select + 10
> frame #0: 0x00007fff890343f6 libsystem_kernel.dylib`__select + 10
> frame #1: 0x00000001074ec47e postgres`pg_usleep(microsec=<unavailable>) +
> 78 at pgsleep.c:43
> frame #2: 0x0000000107400c26
> postgres`generateResourceRefreshHeartBeat(arg=0x00007ff2a141ce90) + 166 at
> rmcomm_QD2RM.c:1519
> frame #3: 0x00007fff95e822fc libsystem_pthread.dylib`_pthread_body + 131
> frame #4: 0x00007fff95e82279 libsystem_pthread.dylib`_pthread_start + 176
> frame #5: 0x00007fff95e804b1 libsystem_pthread.dylib`thread_start + 13
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)