[ 
https://issues.apache.org/jira/browse/HBASE-15100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15109274#comment-15109274
 ] 

Enis Soztutar commented on HBASE-15100:
---------------------------------------

Some more info from the logs. I think the first left entry was 168.log and last 
is 280.log. 

Seems 279 is not complete: 
{code}
2016-01-20 11:31:52,741 INFO  [cn017:16000.activeMasterManager] 
util.FSHDFSUtils: recoverLease=true, attempt=0 on 
file=hdfs://cn017.l42scl.hortonworks.com:8020/apps/hbase/data/MasterProcWALs/state-00000000000000000279.log
 after 1ms
2016-01-20 11:31:52,744 WARN  [cn017:16000.activeMasterManager] 
wal.WALProcedureStore: Unable to read tracker for 
hdfs://cn017.l42scl.hortonworks.com:8020/apps/hbase/data/MasterProcWALs/state-00000000000000000279.log
 - Missing trailer: size=10 startPos=10
2016-01-20 11:31:52,800 INFO  [cn017:16000.activeMasterManager] 
wal.WALProcedureStore: Lease acquired for flushLogId: 280
2016-01-20 11:31:52,805 WARN  [cn017:16000.activeMasterManager] 
wal.ProcedureWALFormatReader: nothing left to decode. exiting with missing EOF
2016-01-20 11:31:52,805 INFO  [cn017:16000.activeMasterManager] 
wal.ProcedureWALFormatReader: No active entry found in state log 
hdfs://cn017.l42scl.hortonworks.com:8020/apps/hbase/data/MasterProcWALs/state-00000000000000000279.log.
 removing it
{code}

After these all logs between 168 and 280 got deleted, but not 168 and 280. 
Although the master says that it is deleting: 
{code}
2016-01-20 11:31:53,757 INFO  [cn017:16000.activeMasterManager] 
wal.WALProcedureStore: Remove log: 
hdfs://cn017.l42scl.hortonworks.com:8020/apps/hbase/data/MasterProcWALs/state-00000000000000000169.log
2016-01-20 11:31:53,757 INFO  [cn017:16000.activeMasterManager] 
wal.WALProcedureStore: Removed logs: 
[hdfs://cn017.l42scl.hortonworks.com:8020/apps/hbase/data/MasterProcWALs/state-00000000000000000168.log,
 
hdfs://cn017.l42scl.hortonworks.com:8020/apps/hbase/data/MasterProcWALs/state-00000000000000000280.log]
{code}

The logs are still there: 
{code}
[root@cn017 cluster-perf]# hadoop fs -ls /apps/hbase/data/MasterProcWALs 
Found 2 items
-rw-r--r--   3 hbase hdfs       9572 2016-01-15 20:27 
/apps/hbase/data/MasterProcWALs/state-00000000000000000168.log
-rw-r--r--   3 hbase hdfs         39 2016-01-20 11:31 
/apps/hbase/data/MasterProcWALs/state-00000000000000000280.log
{code}

Is it normal that logs are deleted out of order?  
For what it is worth, 168 contains ServerCrashProcedure, so maybe that is 
something to look at. 


> Master WALProcs still never clean up
> ------------------------------------
>
>                 Key: HBASE-15100
>                 URL: https://issues.apache.org/jira/browse/HBASE-15100
>             Project: HBase
>          Issue Type: Bug
>          Components: master, proc-v2
>    Affects Versions: 1.2.0
>            Reporter: Elliott Clark
>            Assignee: Matteo Bertozzi
>            Priority: Blocker
>         Attachments: HBASE-15100-v0.patch, TestWalProcedure.java, procs.log
>
>
> {code}
> bin/hdfs dfs -ls /hbase/MasterProcWALs | wc -l
> 218631
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to