[ 
https://issues.apache.org/jira/browse/HBASE-11341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14031430#comment-14031430
 ] 

Jerry He commented on HBASE-11341:
----------------------------------

This enhancement is a good fix for the problem!

I have not seen that strange warning message previously on live cluster, 
including 0.94, 0.96, and 0.98.

In theory, it seems that it should not happen in the procedure framework. 
The master procedure coordinator creates /hbase/<proc>/reached/<instance>,  
then watch for create events on /hbase/<proc>/reached/<instance>/<member>.
It does not listen for create event on /hbase/<proc>/reached/<instance>.

On the other hand, region server member listens for create event on 
/hbase/<proc>/reached/<instance>.  Then creates 
/hbase/<proc>/reached/<instance>/<member>

My guess is still that the master and the region server sides are mixed up now.

Does it sound right?


> ZKProcedureCoordinatorRpcs should respond only to members
> ---------------------------------------------------------
>
>                 Key: HBASE-11341
>                 URL: https://issues.apache.org/jira/browse/HBASE-11341
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.99.0, 0.94.20, 0.98.3
>            Reporter: Matteo Bertozzi
>            Assignee: Matteo Bertozzi
>            Priority: Minor
>             Fix For: 0.99.0, 0.94.21, 0.98.4
>
>         Attachments: HBASE-11341-v0.patch
>
>
> ZKProcedureCoordinatorRpcs nodeCreated() responds to events of every znode 
> under the procedure and not only members.
> {code}
> INFO: Received created event: /hbase/<proc>/reached/<instance>
> INFO: Received created event: /hbase/<proc>/reached/<instance>/<member>
> {code}
> the result is a warning (no other side effects)
> {code}
> WARN [main-EventThread] procedure.ProcedureCoordinator: Member '<instance>' 
> is trying to release an unknown procedure 'reached
> {code}
> [~apurtell] posted in HBASE-10926 the steps to reproduce it:
> {noformat}
> $ cd ./src/hbase
> $ git checkout master
> $ mvn -DskipTests clean install
> {noformat}
> In one console:
> {noformat}
> $ ./bin/hbase master start
> {noformat}
> In another console:
> {noformat}
> $ ./bin/hbase org.apache.hadoop.hbase.util.LoadTestTool -num_keys 100000 \
>     -read 10:100 -write 1:100:10
> {noformat}
> In a third console:
> {noformat}
> $ ./bin/hbase shell
> hbase> while true do ; flush 'cluster_test' ; sleep 10 ; end
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to