[jira] [Created] (KYLIN-4100) Add overall job number statistics in monitor page

2019-07-18 Thread Liu Shaohui (JIRA)
Liu Shaohui created KYLIN-4100:
--

 Summary: Add overall job number statistics in monitor page
 Key: KYLIN-4100
 URL: https://issues.apache.org/jira/browse/KYLIN-4100
 Project: Kylin
  Issue Type: Improvement
Reporter: Liu Shaohui
Assignee: Liu Shaohui


Currently it's hard to get pending and running job number in mointor page, we 
can only continue to click more until the end.

It's better to have an overall job number statistics in monitor page.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (KYLIN-4099) Using no blocking unpersist in spark cubing job

2019-07-18 Thread Liu Shaohui (JIRA)
Liu Shaohui created KYLIN-4099:
--

 Summary: Using no blocking unpersist in spark cubing job 
 Key: KYLIN-4099
 URL: https://issues.apache.org/jira/browse/KYLIN-4099
 Project: Kylin
  Issue Type: Improvement
Reporter: Liu Shaohui
Assignee: Liu Shaohui


By default, the unpersist operation of RDD in spark is blocking which may cost 
a lot time and

some times it may failed for some spark executors lost. 

We can set blocking false to improve it.
{code:java}
sun.misc.Unsafe.park(Native Method)
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:215)
java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedNanos(AbstractQueuedSynchronizer.java:1037)
java.util.concurrent.locks.AbstractQueuedSynchronizer.tryAcquireSharedNanos(AbstractQueuedSynchronizer.java:1328)
scala.concurrent.impl.Promise$DefaultPromise.tryAwait(Promise.scala:208)
scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:218)
scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:223)
scala.concurrent.Await$$anonfun$result$1.apply(package.scala:190)
scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53)
scala.concurrent.Await$.result(package.scala:190)
org.apache.spark.rpc.RpcTimeout.awaitResult(RpcTimeout.scala:81)
org.apache.spark.storage.BlockManagerMaster.removeRdd(BlockManagerMaster.scala:127)
org.apache.spark.SparkContext.unpersistRDD(SparkContext.scala:1709)
org.apache.spark.rdd.RDD.unpersist(RDD.scala:216)
org.apache.spark.api.java.JavaPairRDD.unpersist(JavaPairRDD.scala:73)
org.apache.kylin.engine.spark.SparkCubingByLayer.execute(SparkCubingByLayer.java:204)
org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:37)
org.apache.kylin.common.util.SparkEntry.main(SparkEntry.java:44)
sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
java.lang.reflect.Method.invoke(Method.java:498)
org.apache.spark.deploy.yarn.ApplicationMaster$$anon$2.run(ApplicationMaster.scala:653){code}



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (KYLIN-4098) Add cube auto merge api

2019-07-18 Thread Liu Shaohui (JIRA)
Liu Shaohui created KYLIN-4098:
--

 Summary: Add cube auto merge api
 Key: KYLIN-4098
 URL: https://issues.apache.org/jira/browse/KYLIN-4098
 Project: Kylin
  Issue Type: Improvement
Reporter: Liu Shaohui
Assignee: Liu Shaohui


Currently the auto merging of cube is triggered by the event of new segment is 
ready automatically. When the cluster restart, there may be too many merging 
job.

It's better to have a rest api to trigger the merging and make it more 
controllable.

 



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


Re: 问题求助

2019-07-18 Thread Billy Liu
Similar to this article: https://zhuanlan.zhihu.com/p/28765127

With Warm regards

Billy Liu


张峥  于2019年7月18日周四 下午2:42写道:

> Kylin 团队,你们好
>
>
>
> 我的团队在使用中遇到一个问题,求教。
>
> 问题举例:
>
>目前KYLIN只支持在model中定义的度量值,而不能对这个度量值做任何运算。例如learn_kylin demo中的PRICE
> ,哪怕我在tableau中新增一个度量值 利润 = PRICE * 0.2,也会报错。
>
>基于这个现状,我的思路是,1、把所有的度量都事先通过SQL计算再放入KYLIN建模加速,或者2使用数据提取到tableau。
>
>方法一SQL工作量巨大,且浪费了tableau的分析能力。
>
>方法二,数据提取后,所有的度量值均变为了null。
>
>
>
> 请问有什么好的解决方法吗?
>
> 谢谢。
>
>
>
>
>
>
>
>
>
> ---
> 张峥 Frank Zhang
> 上海医时信息科技有限公司
> Ethicall Information Technology Co.,ltd
> 上海市闵行区万源路2163号新桥商务中心B座501室, 201103
> Room 501,Building B,Xinqiao Business Center,No.2163 Wanyuan Road, Minhang
> District, Shanghai
> Cell & WeChat:18652024136
> E-mail:zha...@ethicall.cn 
> www.ethicall.cn
>
> [image: signature_1707173497]
>


[jira] [Created] (KYLIN-4097) Throw exception when too many dict slice eviction in AppendTrieDictionary

2019-07-18 Thread Liu Shaohui (JIRA)
Liu Shaohui created KYLIN-4097:
--

 Summary: Throw exception when too many dict slice eviction in 
AppendTrieDictionary
 Key: KYLIN-4097
 URL: https://issues.apache.org/jira/browse/KYLIN-4097
 Project: Kylin
  Issue Type: Improvement
Reporter: Liu Shaohui
Assignee: Liu Shaohui


When the global dict is too large than the spark executor memory, there will be 
too many dict slice evictions and loads in AppendTrieDictionary, and  the build 
job will be very slow.

It's better to throw an exception in advance in this case.

 



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (KYLIN-4095) Add RESOURCE_PATH_PREFIX option in ResourceTool

2019-07-18 Thread Liu Shaohui (JIRA)
Liu Shaohui created KYLIN-4095:
--

 Summary: Add RESOURCE_PATH_PREFIX option in ResourceTool
 Key: KYLIN-4095
 URL: https://issues.apache.org/jira/browse/KYLIN-4095
 Project: Kylin
  Issue Type: Improvement
Reporter: Liu Shaohui


ResourceTool is very useful to fix the metadata with overlap segments.

But downloading and uploading entire metadata is too heavy.

It's better to have a RESOURCE_PATH_PREFIX option for downloading and uploading 
cmds.

 



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


问题求助

2019-07-18 Thread 张峥
Kylin 团队,你们好

 

我的团队在使用中遇到一个问题,求教。

问题举例:

   目前KYLIN只支持在model中定义的度量值,而不能对这个度量值做任何运算。例如learn_kylin 
demo中的PRICE,哪怕我在tableau中新增一个度量值 利润 = PRICE * 0.2,也会报错。

   基于这个现状,我的思路是,1、把所有的度量都事先通过SQL计算再放入KYLIN建模加速,或者2使用数据提取到tableau。

   方法一SQL工作量巨大,且浪费了tableau的分析能力。

   方法二,数据提取后,所有的度量值均变为了null。

 

请问有什么好的解决方法吗?

谢谢。

 

 

 

 

---
张峥 Frank Zhang
上海医时信息科技有限公司 
Ethicall Information Technology Co.,ltd
上海市闵行区万源路2163号新桥商务中心B座501室, 201103
Room 501,Building B,Xinqiao Business Center,No.2163 Wanyuan Road, Minhang 
District, Shanghai
Cell & WeChat:18652024136
E-mail:zha...@ethicall.cn 
www.ethicall.cn




[jira] [Created] (KYLIN-4094) Add script to create system tables and cubes automatically

2019-07-18 Thread Liu Shaohui (JIRA)
Liu Shaohui created KYLIN-4094:
--

 Summary: Add script to create system tables and cubes automatically
 Key: KYLIN-4094
 URL: https://issues.apache.org/jira/browse/KYLIN-4094
 Project: Kylin
  Issue Type: Improvement
Reporter: Liu Shaohui
Assignee: Liu Shaohui


See: [http://kylin.apache.org/docs/tutorial/setup_systemcube.html]

It's a little complex to setup the the system cubes. We can add a scripts to 
make it easier.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (KYLIN-4093) Slow query pages should be open to all users of the project

2019-07-18 Thread Liu Shaohui (JIRA)
Liu Shaohui created KYLIN-4093:
--

 Summary: Slow query pages should be open to all users of the 
project
 Key: KYLIN-4093
 URL: https://issues.apache.org/jira/browse/KYLIN-4093
 Project: Kylin
  Issue Type: Improvement
Reporter: Liu Shaohui


Currently the show query page only can been seen for kylin admins. It's very 
useful for modlers and analysts for this project.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)