Yes copy from the hadoop environment. We haven't tested on HDP 2.5 so I
couldn't ensure that's the problem; If you're new to Kylin, suggest you
downgrade to HDP 2.4 or even HDP 2.2, which are much easier. Besides, you
can open a JIRA for this problem (
https://issues.apache.org/jira/secure/Dashboard.jspa), we will investigate
it.

2017-02-05 22:00 GMT+08:00 磊 王 <[email protected]>:

> Thanks your reply again.
>
> I will follow your suggestion to try it. Some discussion-
> I did not touch core-site.xml under Kylin conf folder, so it should be the
> default. I do not think the customization is always needed to make kylin
> work, then why this issue occurred in my environment?
>
> Where can I get the correct core-site.xml, can I copy it from my Hadoop
> environment? Or what specific conf items you specially care about to judge
> its correctness, say 'fs.defaultFS'?
>
>
> 发自我的 iPhone
>
> > 在 2017年2月5日,21:38,ShaoFeng Shi <[email protected]> 写道:
> >
> > No, from the output of "hdfs fs -ls" the file does exist on HDFS; In
> hive's
> > context its default fs is HDFS, so it works as expected. The behavior of
> > Kylin is wrong as it goes to "RawLocalFileSystem". Usually this is
> because
> > the core-site.xml is wrongly configured or it is absent from Kylin's
> > classpath I think. A quick workaround can be manually finding the right
> > core-site.xml and then copy it to KYLIN_HOME/conf, restart and then
> resume
> > the job. Please take a try and let us know whether it can solve. (we
> > haven't tried HDP 2.5)
> >
> > 2017-02-05 21:22 GMT+08:00 磊 王 <[email protected]>:
> >
> >> Thanks for your reply.
> >> I see the log shows it firstly moved that file from hdfs to local, and
> >> then read from local. But the issue is the file to move in hdfs does not
> >> exist at the step of move, so the local file is not available to read at
> >> the step of read.
> >> Is my understanding correct?
> >>
> >> 发自我的 iPhone
> >>
> >>> 在 2017年2月5日,20:54,ShaoFeng Shi <[email protected]> 写道:
> >>>
> >>> From the error it tried to seek the file from local file system,
> instead
> >> of
> >>> HDFS. Please check whethere "fs.DefaultFS" in your environment was set
> to
> >>> local file by mistake.
> >>>
> >>>
> >>> 2017-02-04 22:58 GMT+08:00 ? ? <[email protected]>:
> >>>
> >>>> Hi Sir,
> >>>>
> >>>> When I built the sample cube (and my own cube), I met the error as
> >> below.
> >>>> It seems the issue is at the moving step, because I confirmed there
> was
> >>>> ‘hdfs://sandbox.hortonworks.com:8020/kylin/kylin_metadata/
> >>>> kylin-6a392cfd-a903-4763-89cf-1ce44302c394/row_count/000000_0’, but
> >> Kylin
> >>>> was trying to move is hdfs://sandbox.hortonworks.
> >>>> com:8020/kylin/kylin_metadata/kylin-6a392cfd-a903-4763-89cf-
> >>>> 1ce44302c394/row_count/.hive-staging_hive_2017-02-04_09-38-
> >>>> 21_841_1217104325959051054-5/-ext-10000.
> >>>>
> >>>> [root@ip-10-9-255-49 ec2-user]# hdfs dfs  -ls
> >> hdfs://sandbox.hortonworks.
> >>>> com:8020/kylin/kylin_metadata/kylin-6a392cfd-a903-4763-89cf-
> >>>> 1ce44302c394/row_count
> >>>> Found 1 items
> >>>> -rwxr-xr-x   1 hive hdfs          3 2017-02-04 09:38 hdfs://
> >>>> sandbox.hortonworks.com:8020/kylin/kylin_metadata/
> >>>> kylin-6a392cfd-a903-4763-89cf-1ce44302c394/row_count/000000_0
> >>>>
> >>>> Environment:
> >>>> Kylin 1.6.0 + HDP 2.5
> >>>> I am not sure if HDP 2.5 is a too high version, because I only see HDP
> >> 2.4
> >>>> is referred in Kylin doc.
> >>>>
> >>>>
> >>>> Build error:
> >>>> 2017-02-04 09:38:17,947 INFO  [pool-5-thread-2]
> >>>> manager.ExecutableManager:292 : job id:6a392cfd-a903-4763-89cf-
> >> 1ce44302c394-01
> >>>> from READY to RUNNING
> >>>> 2017-02-04 09:38:17,947 INFO  [pool-5-thread-2]
> hive.HiveCmdBuilder:81 :
> >>>> The statements to execute in beeline:
> >>>> USE default;
> >>>> SET hive.exec.compress.output=true;
> >>>> SET hive.auto.convert.join.noconditionaltask=true;
> >>>> SET hive.auto.convert.join.noconditionaltask.size=100000000;
> >>>> SET mapreduce.output.fileoutputformat.compress.type=BLOCK;
> >>>> SET mapreduce.job.split.metainfo.maxsize=-1;
> >>>>
> >>>> set hive.exec.compress.output=false;
> >>>>
> >>>> set hive.exec.compress.output=false;
> >>>> INSERT OVERWRITE DIRECTORY '/kylin/kylin_metadata/kylin-
> >>>> 6a392cfd-a903-4763-89cf-1ce44302c394/row_count' SELECT count(*) FROM
> >>>> kylin_intermediate_kylin_sales_cube_desc_0eb9faec_6a22_
> >>>> 4b49_889f_c283d82d72dd;
> >>>>
> >>>>
> >>>> 2017-02-04 09:38:17,947 DEBUG [pool-5-thread-2]
> hive.HiveCmdBuilder:83 :
> >>>> The SQL to execute in beeline:
> >>>>
> >>>> 2017-02-04 09:38:17,947 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : Compute row count of flat hive
> table,
> >>>> cmd:
> >>>> 2017-02-04 09:38:17,947 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : beeline -n root -u 'jdbc:hive2://
> >>>> sandbox.hortonworks.com:2181/;serviceDiscoveryMode=
> >>>> zooKeeper;zooKeeperNamespace=hiveserver2' -f
> >>>> /root/apache-kylin-1.6.0-hbase1.x-bin/bin/../tomcat/temp/beeline_
> >> 3013188987573907903.hql;rm
> >>>> -f /root/apache-kylin-1.6.0-hbase1.x-bin/bin/../tomcat/temp/beeline_
> >>>> 3013188987573907903.hql
> >>>> 2017-02-04 09:38:19,832 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : WARNING: Use "yarn jar" to launch
> >> YARN
> >>>> applications.
> >>>> 2017-02-04 09:38:20,342 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : Connecting to jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/;serviceDiscoveryMode=
> >> zooKeeper;zooKeeperNamespace=
> >>>> hiveserver2
> >>>> 2017-02-04 09:38:21,629 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : Connected to: Apache Hive (version
> >>>> 1.2.1000.2.5.0.0-1245)
> >>>> 2017-02-04 09:38:21,630 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : Driver: Hive JDBC (version
> >>>> 1.2.1.2.3.2.0-2950)
> >>>> 2017-02-04 09:38:21,630 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : Transaction isolation:
> >>>> TRANSACTION_REPEATABLE_READ
> >>>> 2017-02-04 09:38:21,679 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/> USE default;
> >>>> 2017-02-04 09:38:21,732 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : No rows affected (0.051 seconds)
> >>>> 2017-02-04 09:38:21,746 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/> SET hive.exec.compress.output=true;
> >>>> 2017-02-04 09:38:21,757 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : No rows affected (0.011 seconds)
> >>>> 2017-02-04 09:38:21,766 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/> SET hive.auto.convert.join.
> >> noconditionaltask=true;
> >>>> 2017-02-04 09:38:21,768 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : No rows affected (0.002 seconds)
> >>>> 2017-02-04 09:38:21,777 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/> SET hive.auto.convert.join.
> >> noconditionaltask.size=
> >>>> 100000000;
> >>>> 2017-02-04 09:38:21,779 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : No rows affected (0.002 seconds)
> >>>> 2017-02-04 09:38:21,787 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/> SET mapreduce.output.
> fileoutputformat.compress.
> >>>> type=BLOCK;
> >>>> 2017-02-04 09:38:21,790 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : No rows affected (0.003 seconds)
> >>>> 2017-02-04 09:38:21,796 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/> SET mapreduce.job.split.metainfo.maxsize=-1;
> >>>> 2017-02-04 09:38:21,801 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : No rows affected (0.005 seconds)
> >>>> 2017-02-04 09:38:21,804 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/>
> >>>> 2017-02-04 09:38:21,808 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/> set hive.exec.compress.output=false;
> >>>> 2017-02-04 09:38:21,811 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : No rows affected (0.003 seconds)
> >>>> 2017-02-04 09:38:21,812 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/>
> >>>> 2017-02-04 09:38:21,816 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/> set hive.exec.compress.output=false;
> >>>> 2017-02-04 09:38:21,818 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : No rows affected (0.002 seconds)
> >>>> 2017-02-04 09:38:21,833 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/> INSERT OVERWRITE DIRECTORY
> >>>> '/kylin/kylin_metadata/kylin-6a392cfd-a903-4763-89cf-1c
> >>>> e44302c394/row_count' SELECT count(*) FROM k
> >>>> 2017-02-04 09:38:21,840 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : ylin_intermediate_kylin_sales_
> >>>> cube_desc_0eb9faec_6a22_4b49_889f_c283d82d72dd;
> >>>> 2017-02-04 09:38:30,452 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Tez session hasn't been
> >> created
> >>>> yet. Opening session
> >>>> 2017-02-04 09:38:30,452 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Dag name: INSERT OVERWRITE
> >>>> DIRE...49_889f_c283d82d72dd(Stage-1)
> >>>> 2017-02-04 09:38:30,452 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  :
> >>>> 2017-02-04 09:38:30,452 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 :
> >>>> 2017-02-04 09:38:30,452 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Status: Running (Executing
> on
> >>>> YARN cluster with App id application_1486198519344_0004)
> >>>> 2017-02-04 09:38:30,453 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 :
> >>>> 2017-02-04 09:38:30,453 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Map 1: -/-    Reducer 2:
> 0/1
> >>>> 2017-02-04 09:38:30,453 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Map 1: 0/1    Reducer 2:
> 0/1
> >>>> 2017-02-04 09:38:30,453 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Map 1: 0(+1)/1
> Reducer
> >>>> 2: 0/1
> >>>> 2017-02-04 09:38:30,453 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Map 1: 1/1    Reducer 2:
> 0/1
> >>>> 2017-02-04 09:38:30,453 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Map 1: 1/1    Reducer 2:
> >> 0(+1)/1
> >>>> 2017-02-04 09:38:30,453 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Map 1: 1/1    Reducer 2:
> 1/1
> >>>> 2017-02-04 09:38:30,453 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : INFO  : Moving data to directory
> >>>> /kylin/kylin_metadata/kylin-6a392cfd-a903-4763-89cf-
> >> 1ce44302c394/row_count
> >>>> from hdfs://sandbox.h
> >>>> ortonworks.com:8020/kylin/kylin_metadata/kylin-6a392cfd-
> >>>> a903-4763-89cf-1ce44302c394/row_count/.hive-staging_hive_
> >>>> 2017-02-04_09-38-21_841_1217104325959051054-5/-ext-10000
> >>>> 2017-02-04 09:38:30,453 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : No rows affected (8.613 seconds)
> >>>> 2017-02-04 09:38:30,457 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/>
> >>>> 2017-02-04 09:38:30,457 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/>
> >>>> 2017-02-04 09:38:30,458 INFO  [pool-5-thread-2]
> >>>> execution.AbstractExecutable:36 : Closing: 0: jdbc:hive2://sandbox.
> >>>> hortonworks.com:2181/;serviceDiscoveryMode=
> >> zooKeeper;zooKeeperNamespace=
> >>>> hiveserver2
> >>>> 2017-02-04 09:38:30,573 ERROR [pool-5-thread-2]
> >>>> execution.AbstractExecutable:370 : job:6a392cfd-a903-4763-89cf-
> >> 1ce44302c394-01
> >>>> execute finished with exception
> >>>> java.io.FileNotFoundException: File /kylin/kylin_metadata/kylin-
> >>>> 6a392cfd-a903-4763-89cf-1ce44302c394/row_count does not exist
> >>>>       at org.apache.hadoop.fs.RawLocalFileSystem.listStatus(
> >>>> RawLocalFileSystem.java:429)
> >>>>       at org.apache.hadoop.fs.FileSystem.listStatus(
> >>>> FileSystem.java:1515)
> >>>>       at org.apache.hadoop.fs.FileSystem.listStatus(
> >>>> FileSystem.java:1555)
> >>>>       at org.apache.hadoop.fs.ChecksumFileSystem.listStatus(
> >>>> ChecksumFileSystem.java:574)
> >>>>       at org.apache.kylin.source.hive.HiveMRInput$
> >>>> RedistributeFlatHiveTableStep.doWork(HiveMRInput.java:338)
> >>>>       at org.apache.kylin.job.execution.AbstractExecutable.
> >>>> execute(AbstractExecutable.java:113)
> >>>>       at org.apache.kylin.job.execution.DefaultChainedExecutable.
> >> doWork(
> >>>> DefaultChainedExecutable.java:57)
> >>>>       at org.apache.kylin.job.execution.AbstractExecutable.
> >>>> execute(AbstractExecutable.java:113)
> >>>>       at org.apache.kylin.job.impl.threadpool.DefaultScheduler$
> >>>> JobRunner.run(DefaultScheduler.java:136)
> >>>>
> >>>>
> >>>>
> >>>> Thx
> >>>> Lei Wang
> >>>>
> >>>>
> >>>
> >>>
> >>> --
> >>> Best regards,
> >>>
> >>> Shaofeng Shi 史少锋
> >>
> >
> >
> >
> > --
> > Best regards,
> >
> > Shaofeng Shi 史少锋
>



-- 
Best regards,

Shaofeng Shi 史少锋

Reply via email to