date:20190219

Hbase table is always empty when build with spark

2019-02-19 Thread mailpig

In kylin-2.5.2, the result hbase table is always table is always empty when I
build cube with spark.
I found that the step "Load HFile to HBase Table" has some warn log:
/2019-01-27 00:49:30,067 WARN [Scheduler 448149092 Job
89a25959-e12d-7a5e-0ecb-80c978533eab-6419]
mapreduce.LoadIncrementalHFiles:204 : Skipping non-directory
hdfs://test/kylin/kylin_metadata/kylin-89a25959-e12d-7a5e-0ecb-80c978533eab/test_UUID_spark/hfile/_SUCCESS
2019-01-27 00:49:30,068 WARN [Scheduler 448149092 Job
89a25959-e12d-7a5e-0ecb-80c978533eab-6419]
mapreduce.LoadIncrementalHFiles:204 : Skipping non-directory
hdfs://test/kylin/kylin_metadata/kylin-89a25959-e12d-7a5e-0ecb-80c978533eab/test_UUID_spark/hfile/part-r-0
2019-01-27 00:49:30,068 WARN [Scheduler 448149092 Job
89a25959-e12d-7a5e-0ecb-80c978533eab-6419]
mapreduce.LoadIncrementalHFiles:204 : Skipping non-directory
hdfs://test/kylin/kylin_metadata/kylin-89a25959-e12d-7a5e-0ecb-80c978533eab/test_UUID_spark/hfile/part-r-1/

After read the source code, I found the step "Convert Cuboid Data to HFile"
with spark has bug. The above step's outputdir should has subdirectory with
column family. Indeed, SparkCubeHFile must set
mapreduce.job.outputformat.class with HFileOutputFormat2.class. 

Please check if I am correct！

--
Sent from: http://apache-kylin.74782.x6.nabble.com/

[jira] [Created] (KYLIN-3819) kylin.sh run in the foreground

2019-02-19 Thread Temple Zhou (JIRA)

Temple Zhou created KYLIN-3819:
--

 Summary: kylin.sh run in the foreground
 Key: KYLIN-3819
 URL: https://issues.apache.org/jira/browse/KYLIN-3819
 Project: Kylin
  Issue Type: Improvement
  Components: Tools, Build and Test
Affects Versions: v2.6.0, v2.5.0, v2.4.0
Reporter: Temple Zhou
Assignee: Temple Zhou


The "kylin.sh start" can only run kylin in the background, but it should be run 
in the foreground as well. Because only in this way, both streams can be 
captured if using a supervision system like daemontools.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Re: Re: [Discuss] Moving toward Apache Kylin 3.0

2019-02-19 Thread ShaoFeng Shi

Thanks for the feedbacks; Since there was no objection on this, the version
on the current master branch has been updated to 3.0.0-SNAPSHOT.

Best regards,

Shaofeng Shi 史少锋
Apache Kylin PMC
Work email: shaofeng@kyligence.io
Kyligence Inc: https://kyligence.io/

Apache Kylin FAQ: https://kylin.apache.org/docs/gettingstarted/faq.html
Join Kylin user mail group: user-subscr...@kylin.apache.org
Join Kylin dev mail group: dev-subscr...@kylin.apache.org




Yichen Zhou  于2019年1月25日周五 下午4:14写道：

> +1
>
> Regards,
> Yichen
>
> On Thu, Jan 24, 2019 at 11:49 PM nichunen  wrote:
>
> > +1
> >
> >
> > --
> >
> >
> > Best regards,
> >
> >
> >
> > Ni Chunen / George
> >
> >
> >
> >
> > At 2019-01-25 15:48:21, "Billy Liu"  wrote:
> > >+1
> > >
> > >That's cool. Let's move to the real-time scenario.
> > >
> > >With Warm regards
> > >
> > >Billy Liu
> > >
> > >
> > >Temple Zhou  于2019年1月25日周五 下午3:32写道：
> > >
> > >> +1
> > >> Real-time streaming feature may be expected by many people.
> > >> ———
> > >> Best wishes~
> > >> Temple Zhou
> > >>
> > >>  Original Message
> > >> *Sender:* zhan shaoxiong
> > >> *Recipient:* dev@kylin.apache.org
> > >> *Date:* Friday, Jan 25, 2019 15:27
> > >> *Subject:* Re: [Discuss] Moving toward Apache Kylin 3.0
> > >>
> > >> +1
> > >> thanks
> > >>
> > >> 在 2019/1/23 下午3:57，“ShaoFeng Shi” 写入:
> > >>
> > >> Hi Kylin developers,
> > >>
> > >> In last week, Kylin released v2.6.0, with the enhanced &
> > distributed query
> > >> cache and JDBC data source SDK. After this release, the next batch
> > >> candidate features include real-time streaming, parquet storage,
> > and druid
> > >> storage. These features were developed in the past 1-2 years by
> > different
> > >> Kylin players and were open sourced in the past 6 months. They
> have
> > already
> > >> been staged in separate branches and are under evaluation by the
> > community.
> > >> We have received much feedback from the community.
> > >>
> > >> These candidate features are big supplements to as-is Kylin
> > functions; For
> > >> example, the real-time streaming feature will bring Kylin from
> > batch &
> > >> historical analytics into real-time analytics. The parquet storage
> > will
> > >> make the deployment more flexible and more cloud-friendly. Of
> > course,
> > >> stabilizing and improving these features need additional time and
> > effort.
> > >>
> > >> So, when we merging and releasing them, we'd better give it a new
> > version
> > >> number so that user can clearly know the difference with current
> 2.x
> > >> versions. I discussed this with several developers offline, we
> > think it is
> > >> time to move toward Kylin 3.0. So, if one of the above features is
> > merged,
> > >> the version will be 3.0. The current 2.6 will be maintained until
> > 3.x is
> > >> ready for production use.
> > >>
> > >> Your comments, ideas, and suggestions are welcomed!
> > >>
> > >> Best regards,
> > >>
> > >> Shaofeng Shi 史少锋
> > >> Apache Kylin PMC
> > >> Work email: shaofeng@kyligence.io
> > >> Kyligence Inc: https://kyligence.io/
> > >>
> > >> Apache Kylin FAQ:
> > https://kylin.apache.org/docs/gettingstarted/faq.html
> > >> Join Kylin user mail group: user-subscr...@kylin.apache.org
> > >> Join Kylin dev mail group: dev-subscr...@kylin.apache.org
> > >>
> > >>
> >
>

Re: Kylin go to hdfs to find jar file

2019-02-19 Thread ShaoFeng Shi

Did you configure to use HDFS as the defaultFS in core-site.xml?

Best regards,

Shaofeng Shi 史少锋
Apache Kylin PMC
Work email: shaofeng@kyligence.io
Kyligence Inc: https://kyligence.io/

Apache Kylin FAQ: https://kylin.apache.org/docs/gettingstarted/faq.html
Join Kylin user mail group: user-subscr...@kylin.apache.org
Join Kylin dev mail group: dev-subscr...@kylin.apache.org




jiangxiaoma111 <369806...@qq.com> 于2019年2月19日周二 下午6:02写道：

> Did you find the solution of this exception
>
> --
> Sent from: http://apache-kylin.74782.x6.nabble.com/
>

Re: Kylin go to hdfs to find jar file

2019-02-19 Thread jiangxiaoma111

Did you find the solution of this exception

--
Sent from: http://apache-kylin.74782.x6.nabble.com/

Build cube exception: java.io.FileNotFoundException

2019-02-19 Thread jiangxiaoma111

hi, all. 
I am a beginner of kylin. My deployment env:
os: max os 10.12.6
hadoop: 3.0.0
hive: 2.3.1
hbase: 1.2.9
kylin: apache-kylin-2.6.0-bin-hbase1x

All project is deployed standlone. 

When I build a cube,there is a error occured in Step 3:
2019-02-19 17:22:33,757 ERROR [pool-10-thread-3]
threadpool.DefaultScheduler:116 : ExecuteException
job:f2fdcdd5-7c2b-df7c-89ea-8a61a85f8975
org.apache.kylin.job.exception.ExecuteException:
org.apache.kylin.job.exception.ExecuteException:
java.io.FileNotFoundException: File does not exist:
hdfs://localhost:8020/usr/local/var/log/hadoop/hadoop-didi/mapred/staging/didi2113494170/.staging/job_local2113494170_0001/libjars/hive-hcatalog-core-2.3.1.jar
at
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:180)
at
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:114)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: org.apache.kylin.job.exception.ExecuteException:
java.io.FileNotFoundException: File does not exist:
hdfs://localhost:8020/usr/local/var/log/hadoop/hadoop-didi/mapred/staging/didi2113494170/.staging/job_local2113494170_0001/libjars/hive-hcatalog-core-2.3.1.jar
at
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:180)
at
org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:70)
at
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:165)
... 4 more
Caused by: java.io.FileNotFoundException: File does not exist:
hdfs://localhost:8020/usr/local/var/log/hadoop/hadoop-didi/mapred/staging/didi2113494170/.staging/job_local2113494170_0001/libjars/hive-hcatalog-core-2.3.1.jar
at
org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:1072)
at
org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:1064)
at
org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)

I searched a lot of  posts, but can not solute the exception.
Please help.



--
Sent from: http://apache-kylin.74782.x6.nabble.com/

[jira] [Created] (KYLIN-3818) After Cube disabled, merge cube job still running

2019-02-19 Thread Na Zhai (JIRA)

Na Zhai created KYLIN-3818:
--

 Summary: After Cube disabled, merge cube job still running
 Key: KYLIN-3818
 URL: https://issues.apache.org/jira/browse/KYLIN-3818
 Project: Kylin
  Issue Type: Bug
Affects Versions: v2.6.0
Reporter: Na Zhai
Assignee: Na Zhai


There is a merge cube job with an error status, then I discard the job, then I 
disable the cube, but the merge cube job still running.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Hbase table is always empty when build with spark

[jira] [Created] (KYLIN-3819) kylin.sh run in the foreground

Re: Re: [Discuss] Moving toward Apache Kylin 3.0

Re: Kylin go to hdfs to find jar file

Re: Kylin go to hdfs to find jar file

Build cube exception: java.io.FileNotFoundException

[jira] [Created] (KYLIN-3818) After Cube disabled, merge cube job still running

7 matches

Site Navigation

Mail list logo

Footer information