[jira] Updated: (HBASE-1878) BaseScanner results can't be trusted at all (Related to hbase-1784)

stack (JIRA) Wed, 30 Sep 2009 16:39:59 -0700

     [ 
https://issues.apache.org/jira/browse/HBASE-1878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


stack updated HBASE-1878:
-------------------------

    Attachment: basescanner.patch

Here's a patch.

I've been testing it up on cluster doing restarts of individual regionserver.

The below happened at least once:

{code}
2009-09-30 19:54:40,784 [IPC Server handler 1 on 60001] INFO 
org.apache.hadoop.hbase.master.ServerManager: Processing MSG_REPORT_OPEN: 
TestTable,0195089359,12
54340471703 from XX.XX.XX.XX,60020,1254339974414; 1 of 1
2009-09-30 19:54:40,784 [HMaster] DEBUG org.apache.hadoop.hbase.master.HMaster: 
Processing todo: PendingOpenOperation from  XX.XX.XX.XX,60020,125
4339974414
2009-09-30 19:54:40,784 [HMaster] INFO 
org.apache.hadoop.hbase.master.RegionServerOperation: 
TestTable,0195089359,1254340471703 open on  XX.XX.XX.XX:60020
2009-09-30 19:54:40,840 [HMaster] INFO 
org.apache.hadoop.hbase.master.RegionServerOperation: Updated row 
TestTable,0195089359,1254340471703 in region .META.,,
1 with startcode=1254339974414, server= XX.XX.XX.XX:60020
2009-09-30 19:54:41,389 [main-SendThread] DEBUG 
org.apache.zookeeper.ClientCnxn: Got ping response for 
sessionid:0x1240c6fb09c0000 after 1ms
2009-09-30 19:54:42,043 [RegionManager.metaScanner] DEBUG 
org.apache.hadoop.hbase.master.BaseScanner: GET got different address than 
SCAN: sa= XX.XX.XX.XX:60
020, serverAddress= XX.XX.XX.XX:60020
2009-09-30 19:54:42,043 [RegionManager.metaScanner] DEBUG 
org.apache.hadoop.hbase.master.BaseScanner: GET got different startcode than 
SCAN: sc=1254339974414,
 serverAddress=1254339099457
{code}

Above is saying that the GET got different result than scan...  You can see the 
row being updated seconds beforehand.

I'm going to commit this change.  It means effectively that each scan now is 
heavier in that its a scan and then a get on each row.

> BaseScanner results can't be trusted at all (Related to hbase-1784)
> -------------------------------------------------------------------
>
>                 Key: HBASE-1878
>                 URL: https://issues.apache.org/jira/browse/HBASE-1878
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>         Attachments: basescanner.patch
>
>
> Gario this morning got a few double-assignments doing rolling restart.
> {code}
>    1.
>       Sep 29 15:18:24 hnn0.int.meetup.com [hbase] 2009-09-29 15:18:24,010 
> [IPC Server handler 177 on 60000] INFO  
> org.apache.hadoop.hbase.master.ServerManager  - Processing MSG_REPORT_CLOSE: 
> MemberFeedIndex,8828808,1252839447949 from 
> hdn8.int.meetup.com,60020,1253572119095
>    2.
>       Sep 29 15:18:39 hnn0.int.meetup.com [hbase] 2009-09-29 15:18:39,369 
> [IPC Server handler 4 on 60000] INFO  
> org.apache.hadoop.hbase.master.RegionManager  - Assigning region 
> MemberFeedIndex,8828808,1252839447949 to 
> hdn7.int.meetup.com,60020,1254251818253
>    3.
>       Sep 29 15:18:42 hnn0.int.meetup.com [hbase] 2009-09-29 15:18:42,399 
> [IPC Server handler 32 on 60000] INFO  
> org.apache.hadoop.hbase.master.ServerManager  - Processing 
> MSG_REPORT_PROCESS_OPEN: MemberFeedIndex,8828808,1252839447949 from 
> hdn7.int.meetup.com,60020,1254251818253; 3 of 19
>    4.
>       Sep 29 15:18:42 hnn0.int.meetup.com [hbase] 2009-09-29 15:18:42,400 
> [IPC Server handler 32 on 60000] INFO  
> org.apache.hadoop.hbase.master.ServerManager  - Processing MSG_REPORT_OPEN: 
> MemberFeedIndex,8828808,1252839447949 from 
> hdn7.int.meetup.com,60020,1254251818253; 13 of 19
>    5.
>       Sep 29 15:19:01 hnn0.int.meetup.com [hbase] 2009-09-29 15:19:01,195 
> [HMaster] INFO  org.apache.hadoop.hbase.master.RegionServerOperation  - 
> MemberFeedIndex,8828808,1252839447949 open on 192.168.60.213:60020
>    6.
>       Sep 29 15:19:01 hnn0.int.meetup.com [hbase] 2009-09-29 15:19:01,196 
> [HMaster] INFO  org.apache.hadoop.hbase.master.RegionServerOperation  - 
> Updated row MemberFeedIndex,8828808,1252839447949 in region .META.,,1 with 
> startcode=1254251818253, server=192.168.60.213:60020
>    7.
>       Sep 29 15:19:01 hnn0.int.meetup.com [hbase] 2009-09-29 15:19:01,230 
> [RegionManager.metaScanner] DEBUG org.apache.hadoop.hbase.master.BaseScanner  
> - Current assignment of MemberFeedIndex,8828808,1252839447949 is not valid;  
> serverAddress=192.168.60.214:60020, startCode=1253572119095 unknown.
>    8.
>       Sep 29 15:19:03 hnn0.int.meetup.com [hbase] 2009-09-29 15:19:03,059 
> [IPC Server handler 88 on 60000] INFO  
> org.apache.hadoop.hbase.master.RegionManager  - Assigning region 
> MemberFeedIndex,8828808,1252839447949 to 
> hdn8.int.meetup.com,60020,1254251940990
>    9.
>       Sep 29 15:19:06 hnn0.int.meetup.com [hbase] 2009-09-29 15:19:06,089 
> [IPC Server handler 97 on 60000] INFO  
> org.apache.hadoop.hbase.master.ServerManager  - Processing MSG_REPORT_OPEN: 
> MemberFeedIndex,8828808,1252839447949 from 
> hdn8.int.meetup.com,60020,1254251940990; 10 of 19
>   10.
>       Sep 29 15:19:06 hnn0.int.meetup.com [hbase] 2009-09-29 15:19:06,091 
> [HMaster] INFO  org.apache.hadoop.hbase.master.RegionServerOperation  - 
> MemberFeedIndex,8828808,1252839447949 open on 192.168.60.214:60020
>   11.
>       Sep 29 15:19:06 hnn0.int.meetup.com [hbase] 2009-09-29 15:19:06,092 
> [HMaster] INFO  org.apache.hadoop.hbase.master.RegionServerOperation  - 
> Updated row MemberFeedIndex,8828808,1252839447949 in region .META.,,1 with 
> startcode=1254251940990, server=192.168.60.214:60020
> {code}
> There's 35ms between update of row and ruling of region being unassigned.
> HBASE-1784 added a GET on .META. row before ruling a region unassigned IFF 
> the server cell was empty.
> Need to do this GET every time.  BaseScanner can't be trusted.  Its info is 
> stale.   Need to hurry up with 0.21 rewrite.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HBASE-1878) BaseScanner results can't be trusted at all (Related to hbase-1784)

Reply via email to