In 0.8 branch, Kylin start to read Hive table via the HCat APIs. The one
HDFS file limitation is gone.  However the small assumption still exists.

On Wed, May 27, 2015 at 4:19 PM, hongbin ma <[email protected]> wrote:

> for now , kylin assumes dimension tables are relatively small, and there
> exists only one hdfs file for that table.
>
> On Wed, May 27, 2015 at 4:17 PM, Luke Han <[email protected]> wrote:
>
> > Forward to mailing list for further support.
> >
> > Thanks.
> >
> > 在 2015年5月27日星期三 UTC+8下午4:11:09,donald fossouo写道:
> >>
> >> Hi i just start with kylin after build test cube and my own on HDP
> >> environment :
> >>
> >> HDP 2.2
> >>
> >> Hive 0.14 , hadoop 2.6
> >>
> >> I got the following error in the 3rd step :
> >>
> >>
> >>
> >> java.lang.IllegalStateException: Expect 1 and only 1 non-zero file under
> >> hdfs://my-name-node:8020/apps/hive/warehouse/dim_organisations, but
> find 0
> >>         at
> >> org.apache.kylin.dict.lookup.HiveTable.findOnlyFile(HiveTable.java:123)
> >>         at
> >>
> org.apache.kylin.dict.lookup.HiveTable.computeHDFSLocation(HiveTable.java:107)
> >>
> >>         at
> >>
> org.apache.kylin.dict.lookup.HiveTable.getHDFSLocation(HiveTable.java:83)
> >>         at
> >> org.apache.kylin.dict.lookup.HiveTable.getFileTable(HiveTable.java:76)
> >>         at
> >> org.apache.kylin.dict.lookup.HiveTable.getSignature(HiveTable.java:71)
> >>         at
> >>
> org.apache.kylin.dict.DictionaryManager.buildDictionary(DictionaryManager.java:164)
> >>
> >>         at
> >> org.apache.kylin.cube.CubeManager.buildDictionary(CubeManager.java:154)
> >>         at
> >>
> org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:53)
> >>
> >>         at
> >>
> org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:42)
> >>
> >>         at
> >>
> org.apache.kylin.job.hadoop.dict.CreateDictionaryJob.run(CreateDictionaryJob.java:53)
> >>
> >>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
> >>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84)
> >>         at
> >>
> org.apache.kylin.job.common.HadoopShellExecutable.doWork(HadoopShellExecutable.java:63)
> >>
> >>         at
> >>
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
> >>
> >>         at
> >>
> org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
> >>
> >>         at
> >>
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
> >>
> >>         at
> >>
> org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:132)
> >>
> >>         at
> >>
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> >>
> >>         at
> >>
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> >>
> >>         at java.lang.Thread.run(Thread.java:745)
> >>
> >> result code:2
> >>
> >> What is the cause of the problem?
> >>
> >>
>
>
> --
> Regards,
>
> *Bin Mahone | 马洪宾*
> Apache Kylin: http://kylin.io
> Github: https://github.com/binmahone
>

Reply via email to