[
https://issues.apache.org/jira/browse/HBASE-9651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lars Hofhansl resolved HBASE-9651.
----------------------------------
Resolution: Fixed
Hadoop Flags: Reviewed
Was committed some time back. Marking fixed.
> Backport HBASE-3890 'Scheduled tasks in distributed log splitting not in sync
> with ZK' to 0.94
> ----------------------------------------------------------------------------------------------
>
> Key: HBASE-9651
> URL: https://issues.apache.org/jira/browse/HBASE-9651
> Project: HBase
> Issue Type: Bug
> Reporter: Ted Yu
> Assignee: Ted Yu
> Fix For: 0.94.13
>
> Attachments: 9651.patch
>
>
> HBASE-3890 was fixed in 0.96 and trunk. This issue is to backport to 0.94
> Note that there must be more slightly off here. Although the splitlogs znode
> is now empty the master is still stuck here:
> {code}
> Doing distributed log split in
> hdfs://localhost:8020/hbase/.logs/10.0.0.65,60020,1305406356765
> - Waiting for distributed tasks to finish. scheduled=2 done=1 error=0 4380s
> Master startup
> - Splitting logs after master startup 4388s
> {code}
> There seems to be an issue with what is in ZK and what the TaskBatch holds.
> In my case it could be related to the fact that the task was already in ZK
> after many faulty restarts because of the NPE. Maybe it was added once (since
> that is keyed by path, and that is unique on my machine), but the reference
> count upped twice? Now that the real one is done, the done counter has been
> increased, but will never match the scheduled.
--
This message was sent by Atlassian JIRA
(v6.1#6144)