[jira] [Updated] (TRAFODION-1107) LP Bug: 1438466 - Multiple tdm_arkcmp child processes started after receipt of HBase error
[ https://issues.apache.org/jira/browse/TRAFODION-1107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suresh Subbiah updated TRAFODION-1107: -- Fix Version/s: (was: 2.2.0) > LP Bug: 1438466 - Multiple tdm_arkcmp child processes started after receipt > of HBase error > -- > > Key: TRAFODION-1107 > URL: https://issues.apache.org/jira/browse/TRAFODION-1107 > Project: Apache Trafodion > Issue Type: Bug > Components: sql-cmp >Reporter: Joanie Cooper >Assignee: Qifan Chen >Priority: Critical > > During a fresh test of running the compGeneral regression suite > while artificially interjecting an error return from the TrxRegionEndpoint > coprocessor, numerous tdm_arkcmp child processes were started. > Before the error hit, we seemed to have a normal number of compilers > [$Z0005MG] 000,3170 001 GEN ES--A-- $Z0002KK$Z0002IVtdm_arkcmp > > [$Z0005MG] 000,3292 001 GEN ES--A-- $Z0002P2$Z0002KKtdm_arkcmp > > [$Z0005MG] 000,3816 001 GEN ES--A-- $Z000341$Z0002P2tdm_arkcmp > > [$Z0005MG] 000,3886 001 GEN ES--A-- $Z000361$Z000341tdm_arkcmp > After forcing the error, it looks like we have new compilers being generated, > all ultimately part of the original tdm_arkcmp parent off of the sqlci > session. > This is a result of a drop statement. From the sqlci window, the > statement appears hung, as it never returns. But, it appears the > compilers keep generating new children and the query ultimately never returns. > When I killed the query, it had 174 compilers running. > I tried a pstack for one of the compilers, I’ve attached it below. > g4t3037{joaniec}3: sqps > Processing cluster.conf on local host g4t3037.houston.hp.com > [$Z000AF9] Shell/shell Version 1.0.1 Release 1.1.0 (Build release [joaniec], > date 26Mar15) > [$Z000AF9] %ps > [$Z000AF9] NID,PID(os) PRI TYPE STATES NAMEPARENT PROGRAM > [$Z000AF9] --- --- --- --- > --- > [$Z000AF9] 000,00018562 000 WDG ES--A-- $WDG000 NONEsqwatchdog > > [$Z000AF9] 000,00018563 000 PSD ES--A-- $PSD000 NONEpstartd > > [$Z000AF9] 000,00018592 001 DTM ES--A-- $TM0NONEtm > > [$Z000AF9] 000,00019243 001 GEN ES--A-- $ZSC000 NONEmxsscp > > [$Z000AF9] 000,00019274 001 SSMP ES--A-- $ZSM000 NONEmxssmp > > [$Z000AF9] 000,00020982 001 GEN ES--A-- $ZLOBSRV0 NONEmxlobsrvr > > [$Z000AF9] 000,7356 001 GEN ES--A-- $Z000606NONEsqlci > > [$Z000AF9] 000,7416 001 GEN ES--A-- $Z00061W$Z000606tdm_arkcmp > > [$Z000AF9] 000,7960 001 GEN ES--A-- $Z0006HF$Z00061Wtdm_arkcmp > > [$Z000AF9] 000,8021 001 GEN ES--A-- $Z0006J6$Z0006HFtdm_arkcmp > > [$Z000AF9] 000,8079 001 GEN ES--A-- $Z0006KU$Z0006J6tdm_arkcmp > > [$Z000AF9] 000,8137 001 GEN ES--A-- $Z0006MH$Z0006KUtdm_arkcmp > > [$Z000AF9] 000,8194 001 GEN ES--A-- $Z0006P4$Z0006MHtdm_arkcmp > > [$Z000AF9] 000,8252 001 GEN ES--A-- $Z0006QS$Z0006P4tdm_arkcmp > > [$Z000AF9] 000,8312 001 GEN ES--A-- $Z0006SH$Z0006QStdm_arkcmp > > [$Z000AF9] 000,8369 001 GEN ES--A-- $Z0006U4$Z0006SHtdm_arkcmp > > [$Z000AF9] 000,8427 001 GEN ES--A-- $Z0006VS$Z0006U4tdm_arkcmp > > [$Z000AF9] 000,8491 001 GEN ES--A-- $Z0006XL$Z0006VStdm_arkcmp > > [$Z000AF9] 000,9023 001 GEN ES--A-- $Z0007CT$Z0006XLtdm_arkcmp > > [$Z000AF9] 000,9081 001 GEN ES--A-- $Z0007EG$Z0007CTtdm_arkcmp > > [$Z000AF9] 000,9141 001 GEN ES--A-- $Z0007G6$Z0007EGtdm_arkcmp > > [$Z000AF9] 000,9202 001 GEN ES--A-- $Z0007HX$Z0007G6tdm_arkcmp > > [$Z000AF9] 000,9262 001 GEN ES--A-- $Z0007JM$Z0007HXtdm_arkcmp > > [$Z000AF9] 000,9320 001 GEN ES--A-- $Z0007LA$Z0007JMtdm_arkcmp > > [$Z000AF9] 000,9489 001 GEN ES--A-- $Z0007R4$Z0007LAtdm_arkcmp > > [$Z000AF9] 000,9547 001 GEN ES--A-- $Z0007SS$Z0007R4tdm_arkcmp > > [$Z000AF9] 000,9604 001 GEN ES--A-- $Z0007UE$Z0007SStdm_arkcmp > > [$Z000AF9] 000,9661 001 GEN ES--A-- $Z0007W1$Z0007UEtdm_arkcmp > > [$Z000AF9] 000,9728 001 GEN ES--A-- $Z0007XY$Z0007W1tdm_arkcmp > > [$Z000AF9] 000,00010268 001 GEN ES--A-- $Z0008DD$Z0007XYtdm_arkcmp > > [$Z000AF9] 000,00010364 001 GEN ES--A-- $Z0008G4$Z0008DDtdm_arkcmp > > [$Z000AF9] 000,00010421 001 GEN ES--A-- $Z0008HR$Z0008G4tdm_arkcmp > > [$
[jira] [Updated] (TRAFODION-1107) LP Bug: 1438466 - Multiple tdm_arkcmp child processes started after receipt of HBase error
[ https://issues.apache.org/jira/browse/TRAFODION-1107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pierre Smits updated TRAFODION-1107: Fix Version/s: (was: 2.1-incubating) 2.2.0 > LP Bug: 1438466 - Multiple tdm_arkcmp child processes started after receipt > of HBase error > -- > > Key: TRAFODION-1107 > URL: https://issues.apache.org/jira/browse/TRAFODION-1107 > Project: Apache Trafodion > Issue Type: Bug > Components: sql-cmp >Reporter: Joanie Cooper >Assignee: Qifan Chen >Priority: Critical > Fix For: 2.2.0 > > > During a fresh test of running the compGeneral regression suite > while artificially interjecting an error return from the TrxRegionEndpoint > coprocessor, numerous tdm_arkcmp child processes were started. > Before the error hit, we seemed to have a normal number of compilers > [$Z0005MG] 000,3170 001 GEN ES--A-- $Z0002KK$Z0002IVtdm_arkcmp > > [$Z0005MG] 000,3292 001 GEN ES--A-- $Z0002P2$Z0002KKtdm_arkcmp > > [$Z0005MG] 000,3816 001 GEN ES--A-- $Z000341$Z0002P2tdm_arkcmp > > [$Z0005MG] 000,3886 001 GEN ES--A-- $Z000361$Z000341tdm_arkcmp > After forcing the error, it looks like we have new compilers being generated, > all ultimately part of the original tdm_arkcmp parent off of the sqlci > session. > This is a result of a drop statement. From the sqlci window, the > statement appears hung, as it never returns. But, it appears the > compilers keep generating new children and the query ultimately never returns. > When I killed the query, it had 174 compilers running. > I tried a pstack for one of the compilers, I’ve attached it below. > g4t3037{joaniec}3: sqps > Processing cluster.conf on local host g4t3037.houston.hp.com > [$Z000AF9] Shell/shell Version 1.0.1 Release 1.1.0 (Build release [joaniec], > date 26Mar15) > [$Z000AF9] %ps > [$Z000AF9] NID,PID(os) PRI TYPE STATES NAMEPARENT PROGRAM > [$Z000AF9] --- --- --- --- > --- > [$Z000AF9] 000,00018562 000 WDG ES--A-- $WDG000 NONEsqwatchdog > > [$Z000AF9] 000,00018563 000 PSD ES--A-- $PSD000 NONEpstartd > > [$Z000AF9] 000,00018592 001 DTM ES--A-- $TM0NONEtm > > [$Z000AF9] 000,00019243 001 GEN ES--A-- $ZSC000 NONEmxsscp > > [$Z000AF9] 000,00019274 001 SSMP ES--A-- $ZSM000 NONEmxssmp > > [$Z000AF9] 000,00020982 001 GEN ES--A-- $ZLOBSRV0 NONEmxlobsrvr > > [$Z000AF9] 000,7356 001 GEN ES--A-- $Z000606NONEsqlci > > [$Z000AF9] 000,7416 001 GEN ES--A-- $Z00061W$Z000606tdm_arkcmp > > [$Z000AF9] 000,7960 001 GEN ES--A-- $Z0006HF$Z00061Wtdm_arkcmp > > [$Z000AF9] 000,8021 001 GEN ES--A-- $Z0006J6$Z0006HFtdm_arkcmp > > [$Z000AF9] 000,8079 001 GEN ES--A-- $Z0006KU$Z0006J6tdm_arkcmp > > [$Z000AF9] 000,8137 001 GEN ES--A-- $Z0006MH$Z0006KUtdm_arkcmp > > [$Z000AF9] 000,8194 001 GEN ES--A-- $Z0006P4$Z0006MHtdm_arkcmp > > [$Z000AF9] 000,8252 001 GEN ES--A-- $Z0006QS$Z0006P4tdm_arkcmp > > [$Z000AF9] 000,8312 001 GEN ES--A-- $Z0006SH$Z0006QStdm_arkcmp > > [$Z000AF9] 000,8369 001 GEN ES--A-- $Z0006U4$Z0006SHtdm_arkcmp > > [$Z000AF9] 000,8427 001 GEN ES--A-- $Z0006VS$Z0006U4tdm_arkcmp > > [$Z000AF9] 000,8491 001 GEN ES--A-- $Z0006XL$Z0006VStdm_arkcmp > > [$Z000AF9] 000,9023 001 GEN ES--A-- $Z0007CT$Z0006XLtdm_arkcmp > > [$Z000AF9] 000,9081 001 GEN ES--A-- $Z0007EG$Z0007CTtdm_arkcmp > > [$Z000AF9] 000,9141 001 GEN ES--A-- $Z0007G6$Z0007EGtdm_arkcmp > > [$Z000AF9] 000,9202 001 GEN ES--A-- $Z0007HX$Z0007G6tdm_arkcmp > > [$Z000AF9] 000,9262 001 GEN ES--A-- $Z0007JM$Z0007HXtdm_arkcmp > > [$Z000AF9] 000,9320 001 GEN ES--A-- $Z0007LA$Z0007JMtdm_arkcmp > > [$Z000AF9] 000,9489 001 GEN ES--A-- $Z0007R4$Z0007LAtdm_arkcmp > > [$Z000AF9] 000,9547 001 GEN ES--A-- $Z0007SS$Z0007R4tdm_arkcmp > > [$Z000AF9] 000,9604 001 GEN ES--A-- $Z0007UE$Z0007SStdm_arkcmp > > [$Z000AF9] 000,9661 001 GEN ES--A-- $Z0007W1$Z0007UEtdm_arkcmp > > [$Z000AF9] 000,9728 001 GEN ES--A-- $Z0007XY$Z0007W1tdm_arkcmp > > [$Z000AF9] 000,00010268 001 GEN ES--A-- $Z0008DD$Z0007XYtdm_arkcmp > > [$Z000AF9] 000,00010364 001 GEN ES--A-- $Z0008G4$Z0008DDtdm_arkcmp > > [$Z000AF9] 000,00010421 001