[
https://issues.apache.org/jira/browse/KYLIN-4016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16965556#comment-16965556
]
nichunen commented on KYLIN-4016:
---------------------------------
Seems duplicate with https://issues.apache.org/jira/browse/KYLIN-3555
> Garbage Collection on HBase hbase数据存储在s3但是却找的是hdfs
> ---------------------------------------------------
>
> Key: KYLIN-4016
> URL: https://issues.apache.org/jira/browse/KYLIN-4016
> Project: Kylin
> Issue Type: Bug
> Components: Storage - HBase
> Affects Versions: v2.5.0
> Reporter: Tom_yj
> Priority: Major
>
> Garbage Collection on HBase
> hbase数据存储在s3上,清理数据时却找的是hdfs,报文件不存在异常。
> 需要清理的文件在s3上真实存在,hdfs上不存在
>
> kylin.properties
> kylin.env.hdfs-working-dir=s3://XXX-hive/kylin
> kylin.storage.hbase.cluster-fs=s3://XXX-hive/hbase
>
>
>
> log
> java.io.FileNotFoundException: File
> /kylin/kylin_metadata/kylin-d3926099-21bb-6893-1055-6d52f2fe17b7/XXX does not
> exist.
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:904)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:114)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$22.doCall(DistributedFileSystem.java:964)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$22.doCall(DistributedFileSystem.java:961)
> at
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:971)
> at
> org.apache.kylin.storage.hbase.steps.HDFSPathGarbageCollectionStep.dropHdfsPathOnCluster(HDFSPathGarbageCollectionStep.java:95)
> at
> org.apache.kylin.storage.hbase.steps.HDFSPathGarbageCollectionStep.doWork(HDFSPathGarbageCollectionStep.java:65)
> at
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:163)
> at
> org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:69)
> at
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:163)
> at
> org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:113)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)