[jira] [Updated] (TRAFODION-1107) LP Bug: 1438466 - Multiple tdm_arkcmp child processes started after receipt of HBase error

2018-03-06 Thread Suresh Subbiah (JIRA)

 [ 
https://issues.apache.org/jira/browse/TRAFODION-1107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suresh Subbiah updated TRAFODION-1107:
--
Fix Version/s: (was: 2.2.0)

> LP Bug: 1438466 - Multiple tdm_arkcmp child processes started after receipt 
> of HBase error
> --
>
> Key: TRAFODION-1107
> URL: https://issues.apache.org/jira/browse/TRAFODION-1107
> Project: Apache Trafodion
>  Issue Type: Bug
>  Components: sql-cmp
>Reporter: Joanie Cooper
>Assignee: Qifan Chen
>Priority: Critical
>
> During a fresh test of running the compGeneral regression suite
> while artificially interjecting an error return from the TrxRegionEndpoint
> coprocessor, numerous tdm_arkcmp child processes were started.
> Before the error hit, we seemed to have a normal number of compilers
> [$Z0005MG] 000,3170 001 GEN  ES--A-- $Z0002KK$Z0002IVtdm_arkcmp   
>   
> [$Z0005MG] 000,3292 001 GEN  ES--A-- $Z0002P2$Z0002KKtdm_arkcmp   
>   
> [$Z0005MG] 000,3816 001 GEN  ES--A-- $Z000341$Z0002P2tdm_arkcmp   
>   
> [$Z0005MG] 000,3886 001 GEN  ES--A-- $Z000361$Z000341tdm_arkcmp
> After forcing the error, it looks like we have new compilers being generated,
> all ultimately part of the original tdm_arkcmp parent off of the sqlci 
> session.
> This is a result of a drop statement.  From the sqlci window, the
> statement appears hung, as it never returns.  But, it appears the
> compilers keep generating new children and the query ultimately never returns.
> When I killed the query, it had 174 compilers running.
> I tried a pstack for one of the compilers, I’ve attached it below.
> g4t3037{joaniec}3: sqps
> Processing cluster.conf on local host g4t3037.houston.hp.com
> [$Z000AF9] Shell/shell Version 1.0.1 Release 1.1.0 (Build release [joaniec], 
> date 26Mar15)
> [$Z000AF9] %ps  
> [$Z000AF9] NID,PID(os)  PRI TYPE STATES  NAMEPARENT  PROGRAM
> [$Z000AF9]  ---  --- --- --- 
> ---
> [$Z000AF9] 000,00018562 000 WDG  ES--A-- $WDG000 NONEsqwatchdog   
>   
> [$Z000AF9] 000,00018563 000 PSD  ES--A-- $PSD000 NONEpstartd  
>   
> [$Z000AF9] 000,00018592 001 DTM  ES--A-- $TM0NONEtm   
>   
> [$Z000AF9] 000,00019243 001 GEN  ES--A-- $ZSC000 NONEmxsscp   
>   
> [$Z000AF9] 000,00019274 001 SSMP ES--A-- $ZSM000 NONEmxssmp   
>   
> [$Z000AF9] 000,00020982 001 GEN  ES--A-- $ZLOBSRV0   NONEmxlobsrvr
>   
> [$Z000AF9] 000,7356 001 GEN  ES--A-- $Z000606NONEsqlci
>   
> [$Z000AF9] 000,7416 001 GEN  ES--A-- $Z00061W$Z000606tdm_arkcmp   
>   
> [$Z000AF9] 000,7960 001 GEN  ES--A-- $Z0006HF$Z00061Wtdm_arkcmp   
>   
> [$Z000AF9] 000,8021 001 GEN  ES--A-- $Z0006J6$Z0006HFtdm_arkcmp   
>   
> [$Z000AF9] 000,8079 001 GEN  ES--A-- $Z0006KU$Z0006J6tdm_arkcmp   
>   
> [$Z000AF9] 000,8137 001 GEN  ES--A-- $Z0006MH$Z0006KUtdm_arkcmp   
>   
> [$Z000AF9] 000,8194 001 GEN  ES--A-- $Z0006P4$Z0006MHtdm_arkcmp   
>   
> [$Z000AF9] 000,8252 001 GEN  ES--A-- $Z0006QS$Z0006P4tdm_arkcmp   
>   
> [$Z000AF9] 000,8312 001 GEN  ES--A-- $Z0006SH$Z0006QStdm_arkcmp   
>   
> [$Z000AF9] 000,8369 001 GEN  ES--A-- $Z0006U4$Z0006SHtdm_arkcmp   
>   
> [$Z000AF9] 000,8427 001 GEN  ES--A-- $Z0006VS$Z0006U4tdm_arkcmp   
>   
> [$Z000AF9] 000,8491 001 GEN  ES--A-- $Z0006XL$Z0006VStdm_arkcmp   
>   
> [$Z000AF9] 000,9023 001 GEN  ES--A-- $Z0007CT$Z0006XLtdm_arkcmp   
>   
> [$Z000AF9] 000,9081 001 GEN  ES--A-- $Z0007EG$Z0007CTtdm_arkcmp   
>   
> [$Z000AF9] 000,9141 001 GEN  ES--A-- $Z0007G6$Z0007EGtdm_arkcmp   
>   
> [$Z000AF9] 000,9202 001 GEN  ES--A-- $Z0007HX$Z0007G6tdm_arkcmp   
>   
> [$Z000AF9] 000,9262 001 GEN  ES--A-- $Z0007JM$Z0007HXtdm_arkcmp   
>   
> [$Z000AF9] 000,9320 001 GEN  ES--A-- $Z0007LA$Z0007JMtdm_arkcmp   
>   
> [$Z000AF9] 000,9489 001 GEN  ES--A-- $Z0007R4$Z0007LAtdm_arkcmp   
>   
> [$Z000AF9] 000,9547 001 GEN  ES--A-- $Z0007SS$Z0007R4tdm_arkcmp   
>   
> [$Z000AF9] 000,9604 001 GEN  ES--A-- $Z0007UE$Z0007SStdm_arkcmp   
>   
> [$Z000AF9] 000,9661 001 GEN  ES--A-- $Z0007W1$Z0007UEtdm_arkcmp   
>   
> [$Z000AF9] 000,9728 001 GEN  ES--A-- $Z0007XY$Z0007W1tdm_arkcmp   
>   
> [$Z000AF9] 000,00010268 001 GEN  ES--A-- $Z0008DD$Z0007XYtdm_arkcmp   
>   
> [$Z000AF9] 000,00010364 001 GEN  ES--A-- $Z0008G4$Z0008DDtdm_arkcmp   
>   
> [$Z000AF9] 000,00010421 001 GEN  ES--A-- $Z0008HR$Z0008G4tdm_arkcmp   
>   
> 

[jira] [Updated] (TRAFODION-1107) LP Bug: 1438466 - Multiple tdm_arkcmp child processes started after receipt of HBase error

2018-03-04 Thread Pierre Smits (JIRA)

 [ 
https://issues.apache.org/jira/browse/TRAFODION-1107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pierre Smits updated TRAFODION-1107:

Fix Version/s: (was: 2.1-incubating)
   2.2.0

> LP Bug: 1438466 - Multiple tdm_arkcmp child processes started after receipt 
> of HBase error
> --
>
> Key: TRAFODION-1107
> URL: https://issues.apache.org/jira/browse/TRAFODION-1107
> Project: Apache Trafodion
>  Issue Type: Bug
>  Components: sql-cmp
>Reporter: Joanie Cooper
>Assignee: Qifan Chen
>Priority: Critical
> Fix For: 2.2.0
>
>
> During a fresh test of running the compGeneral regression suite
> while artificially interjecting an error return from the TrxRegionEndpoint
> coprocessor, numerous tdm_arkcmp child processes were started.
> Before the error hit, we seemed to have a normal number of compilers
> [$Z0005MG] 000,3170 001 GEN  ES--A-- $Z0002KK$Z0002IVtdm_arkcmp   
>   
> [$Z0005MG] 000,3292 001 GEN  ES--A-- $Z0002P2$Z0002KKtdm_arkcmp   
>   
> [$Z0005MG] 000,3816 001 GEN  ES--A-- $Z000341$Z0002P2tdm_arkcmp   
>   
> [$Z0005MG] 000,3886 001 GEN  ES--A-- $Z000361$Z000341tdm_arkcmp
> After forcing the error, it looks like we have new compilers being generated,
> all ultimately part of the original tdm_arkcmp parent off of the sqlci 
> session.
> This is a result of a drop statement.  From the sqlci window, the
> statement appears hung, as it never returns.  But, it appears the
> compilers keep generating new children and the query ultimately never returns.
> When I killed the query, it had 174 compilers running.
> I tried a pstack for one of the compilers, I’ve attached it below.
> g4t3037{joaniec}3: sqps
> Processing cluster.conf on local host g4t3037.houston.hp.com
> [$Z000AF9] Shell/shell Version 1.0.1 Release 1.1.0 (Build release [joaniec], 
> date 26Mar15)
> [$Z000AF9] %ps  
> [$Z000AF9] NID,PID(os)  PRI TYPE STATES  NAMEPARENT  PROGRAM
> [$Z000AF9]  ---  --- --- --- 
> ---
> [$Z000AF9] 000,00018562 000 WDG  ES--A-- $WDG000 NONEsqwatchdog   
>   
> [$Z000AF9] 000,00018563 000 PSD  ES--A-- $PSD000 NONEpstartd  
>   
> [$Z000AF9] 000,00018592 001 DTM  ES--A-- $TM0NONEtm   
>   
> [$Z000AF9] 000,00019243 001 GEN  ES--A-- $ZSC000 NONEmxsscp   
>   
> [$Z000AF9] 000,00019274 001 SSMP ES--A-- $ZSM000 NONEmxssmp   
>   
> [$Z000AF9] 000,00020982 001 GEN  ES--A-- $ZLOBSRV0   NONEmxlobsrvr
>   
> [$Z000AF9] 000,7356 001 GEN  ES--A-- $Z000606NONEsqlci
>   
> [$Z000AF9] 000,7416 001 GEN  ES--A-- $Z00061W$Z000606tdm_arkcmp   
>   
> [$Z000AF9] 000,7960 001 GEN  ES--A-- $Z0006HF$Z00061Wtdm_arkcmp   
>   
> [$Z000AF9] 000,8021 001 GEN  ES--A-- $Z0006J6$Z0006HFtdm_arkcmp   
>   
> [$Z000AF9] 000,8079 001 GEN  ES--A-- $Z0006KU$Z0006J6tdm_arkcmp   
>   
> [$Z000AF9] 000,8137 001 GEN  ES--A-- $Z0006MH$Z0006KUtdm_arkcmp   
>   
> [$Z000AF9] 000,8194 001 GEN  ES--A-- $Z0006P4$Z0006MHtdm_arkcmp   
>   
> [$Z000AF9] 000,8252 001 GEN  ES--A-- $Z0006QS$Z0006P4tdm_arkcmp   
>   
> [$Z000AF9] 000,8312 001 GEN  ES--A-- $Z0006SH$Z0006QStdm_arkcmp   
>   
> [$Z000AF9] 000,8369 001 GEN  ES--A-- $Z0006U4$Z0006SHtdm_arkcmp   
>   
> [$Z000AF9] 000,8427 001 GEN  ES--A-- $Z0006VS$Z0006U4tdm_arkcmp   
>   
> [$Z000AF9] 000,8491 001 GEN  ES--A-- $Z0006XL$Z0006VStdm_arkcmp   
>   
> [$Z000AF9] 000,9023 001 GEN  ES--A-- $Z0007CT$Z0006XLtdm_arkcmp   
>   
> [$Z000AF9] 000,9081 001 GEN  ES--A-- $Z0007EG$Z0007CTtdm_arkcmp   
>   
> [$Z000AF9] 000,9141 001 GEN  ES--A-- $Z0007G6$Z0007EGtdm_arkcmp   
>   
> [$Z000AF9] 000,9202 001 GEN  ES--A-- $Z0007HX$Z0007G6tdm_arkcmp   
>   
> [$Z000AF9] 000,9262 001 GEN  ES--A-- $Z0007JM$Z0007HXtdm_arkcmp   
>   
> [$Z000AF9] 000,9320 001 GEN  ES--A-- $Z0007LA$Z0007JMtdm_arkcmp   
>   
> [$Z000AF9] 000,9489 001 GEN  ES--A-- $Z0007R4$Z0007LAtdm_arkcmp   
>   
> [$Z000AF9] 000,9547 001 GEN  ES--A-- $Z0007SS$Z0007R4tdm_arkcmp   
>   
> [$Z000AF9] 000,9604 001 GEN  ES--A-- $Z0007UE$Z0007SStdm_arkcmp   
>   
> [$Z000AF9] 000,9661 001 GEN  ES--A-- $Z0007W1$Z0007UEtdm_arkcmp   
>   
> [$Z000AF9] 000,9728 001 GEN  ES--A-- $Z0007XY$Z0007W1tdm_arkcmp   
>   
> [$Z000AF9] 000,00010268 001 GEN  ES--A-- $Z0008DD$Z0007XYtdm_arkcmp   
>   
> [$Z000AF9] 000,00010364 001 GEN  ES--A-- $Z0008G4$Z0008DDtdm_arkcmp   
>   
> [$Z000AF9] 000,00010421