[ 
https://issues.apache.org/jira/browse/HBASE-10000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13844592#comment-13844592
 ] 

Ted Yu commented on HBASE-10000:
--------------------------------

There was one unclosed log, shown below.
{code}
2013-12-10 19:13:22,630 INFO  [MASTER_SERVER_OPERATIONS-hor13n02:60000-2] 
master.SplitLogManager: started splitting 2 logs in 
[hdfs://hor13n01.gq1.ygridcore.net:8020/apps/hbase/data/WALs/hor13n05.gq1.ygridcore.net,60020,1386702460286-splitting]
2013-12-10 19:13:22,636 INFO  [pool-19-thread-2] util.FSHDFSUtils: 
recoverLease=false, attempt=0 on 
file=hdfs://hor13n01.gq1.ygridcore.net:8020/apps/hbase/data/WALs/hor13n05.gq1.ygridcore.net,60020,1386702460286-splitting/hor13n05.gq1.ygridcore.net%2C60020%2C1386702460286.1386702750923
 after 1386702802636ms
2013-12-10 19:13:22,636 INFO  [pool-19-thread-1] util.FSHDFSUtils: 
recoverLease=true, attempt=0 on 
file=hdfs://hor13n01.gq1.ygridcore.net:8020/apps/hbase/data/WALs/hor13n05.gq1.ygridcore.net,60020,1386702460286-splitting/hor13n05.gq1.ygridcore.net%2C60020%2C1386702460286.1386702686049
 after 1386702802636ms
2013-12-10 19:13:22,650 INFO  
[hor13n02.gq1.ygridcore.net,60000,1386702649564.splitLogManagerTimeoutMonitor] 
master.SplitLogManager: resubmitting task 
/hbase/splitWAL/WALs%2Fhor13n04.gq1.ygridcore.net%2C60020%2C1386702205712-splitting%2Fhor13n04.gq1.ygridcore.net%252C60020%252C1386702205712.1386702769323
{code}
Log from NN to follow.

> Initiate lease recovery for outstanding WAL files at the very beginning of 
> recovery
> -----------------------------------------------------------------------------------
>
>                 Key: HBASE-10000
>                 URL: https://issues.apache.org/jira/browse/HBASE-10000
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Ted Yu
>            Assignee: Ted Yu
>             Fix For: 0.98.1
>
>         Attachments: 10000-0.96-v5.txt, 10000-0.96-v6.txt, 
> 10000-recover-ts-with-pb-2.txt, 10000-recover-ts-with-pb-3.txt, 
> 10000-recover-ts-with-pb-4.txt, 10000-recover-ts-with-pb-5.txt, 
> 10000-recover-ts-with-pb-6.txt, 10000-v4.txt, 10000-v5.txt, 10000-v6.txt
>
>
> At the beginning of recovery, master can send lease recovery requests 
> concurrently for outstanding WAL files using a thread pool.
> Each split worker would first check whether the WAL file it processes is 
> closed.
> Thanks to Nicolas Liochon and Jeffery discussion with whom gave rise to this 
> idea. 



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

Reply via email to