[jira] [Updated] (KYLIN-4552) Add a switch to enable/disable flink cubing step: convert to HFile

2020-06-09 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue updated KYLIN-4552:
--
Fix Version/s: v3.1.0

> Add a switch to enable/disable flink cubing step: convert to HFile
> --
>
> Key: KYLIN-4552
> URL: https://issues.apache.org/jira/browse/KYLIN-4552
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v3.1.0
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4552) Add a switch to enable/disable flink cubing step: convert to HFile

2020-06-09 Thread Harvey Yue (Jira)
Harvey Yue created KYLIN-4552:
-

 Summary: Add a switch to enable/disable flink cubing step: convert 
to HFile
 Key: KYLIN-4552
 URL: https://issues.apache.org/jira/browse/KYLIN-4552
 Project: Kylin
  Issue Type: Sub-task
  Components: Flink Engine
Affects Versions: v3.1.1
Reporter: Harvey Yue
Assignee: Harvey Yue






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4552) Add a switch to enable/disable flink cubing step: convert to HFile

2020-06-09 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue updated KYLIN-4552:
--
Affects Version/s: (was: v3.1.1)

> Add a switch to enable/disable flink cubing step: convert to HFile
> --
>
> Key: KYLIN-4552
> URL: https://issues.apache.org/jira/browse/KYLIN-4552
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (KYLIN-4482) Too many logging segment info with CubeBuildJob step

2020-06-06 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue resolved KYLIN-4482.
---
Resolution: Fixed

> Too many logging segment info with CubeBuildJob step
> 
>
> Key: KYLIN-4482
> URL: https://issues.apache.org/jira/browse/KYLIN-4482
> Project: Kylin
>  Issue Type: Bug
>  Components: Spark Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (KYLIN-4483) Avoid to build global dictionaries with empty ColumnDesc collection

2020-06-06 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue resolved KYLIN-4483.
---
Resolution: Fixed

> Avoid to build global dictionaries with empty ColumnDesc collection
> ---
>
> Key: KYLIN-4483
> URL: https://issues.apache.org/jira/browse/KYLIN-4483
> Project: Kylin
>  Issue Type: Bug
>  Components: Spark Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (KYLIN-4467) Support TopN by kylin on Parquet

2020-06-06 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue resolved KYLIN-4467.
---
Resolution: Fixed

> Support TopN by kylin on Parquet
> 
>
> Key: KYLIN-4467
> URL: https://issues.apache.org/jira/browse/KYLIN-4467
> Project: Kylin
>  Issue Type: Improvement
>  Components: Query Engine
>Reporter: wangrupeng
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4501) Support alluxio to adapt more storage like s3, oss ...

2020-05-22 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113896#comment-17113896
 ] 

Harvey Yue edited comment on KYLIN-4501 at 5/22/20, 9:56 AM:
-

*Kylin and Alluxio related configuration changes:*

alluxio-site.properties

// fix delete cuboid temp directories issue: 
alluxio.exception.DirectoryNotEmptyException

alluxio.user.file.delete.unchecked=true

 

kylin.properties

kylin.env.hdfs-working-dir=*alluxio://192.168.xxx.xxx:19998/kylin*

kylin.storage.url=*alluxio*

kylin.engine.spark-conf.spark.eventLog.dir=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

kylin.engine.spark-conf.spark.history.fs.logDirectory=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

 

*Spark jars:*

copy ALLUXIO_HOME/client/alluxio-2.2.0-client.jar to SPARK_HOME/jars

 


was (Author: harveyyue):
Kylin and Alluxio related configuration changes:

alluxio-site.properties

// fix delete cuboid temp directories issue: 
alluxio.exception.DirectoryNotEmptyException

alluxio.user.file.delete.unchecked=true

 

kylin.properties

kylin.env.hdfs-working-dir=*alluxio://192.168.xxx.xxx:19998/kylin*

kylin.storage.url=*alluxio*

kylin.engine.spark-conf.spark.eventLog.dir=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

kylin.engine.spark-conf.spark.history.fs.logDirectory=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

 

 

 

> Support alluxio to adapt more storage like s3, oss ...
> --
>
> Key: KYLIN-4501
> URL: https://issues.apache.org/jira/browse/KYLIN-4501
> Project: Kylin
>  Issue Type: New Feature
>  Components: Spark Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4501) Support alluxio to adapt more storage like s3, oss ...

2020-05-22 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113896#comment-17113896
 ] 

Harvey Yue edited comment on KYLIN-4501 at 5/22/20, 9:52 AM:
-

Kylin and Alluxio related configuration changes:

alluxio-site.properties

// fix delete cuboid temp directories issue: 
alluxio.exception.DirectoryNotEmptyException

alluxio.user.file.delete.unchecked=true

 

kylin.properties

kylin.env.hdfs-working-dir=*alluxio://192.168.xxx.xxx:19998/kylin*

kylin.storage.url=*alluxio*

kylin.engine.spark-conf.spark.eventLog.dir=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

kylin.engine.spark-conf.spark.history.fs.logDirectory=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

 

 

 


was (Author: harveyyue):
Kylin and Alluxio related configuration changes:

alluxio-site.properties

## fix delete cuboid temp directories issue: 
alluxio.exception.DirectoryNotEmptyException

 

alluxio.user.file.delete.unchecked=true

 

kylin.properties

kylin.env.hdfs-working-dir=*alluxio://192.168.xxx.xxx:19998/kylin*

kylin.storage.url=*alluxio*

kylin.engine.spark-conf.spark.eventLog.dir=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

kylin.engine.spark-conf.spark.history.fs.logDirectory=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

 

 

 

> Support alluxio to adapt more storage like s3, oss ...
> --
>
> Key: KYLIN-4501
> URL: https://issues.apache.org/jira/browse/KYLIN-4501
> Project: Kylin
>  Issue Type: New Feature
>  Components: Spark Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4501) Support alluxio to adapt more storage like s3, oss ...

2020-05-22 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113896#comment-17113896
 ] 

Harvey Yue edited comment on KYLIN-4501 at 5/22/20, 9:52 AM:
-

Kylin and Alluxio related configuration changes:

alluxio-site.properties

## fix delete cuboid temp directories issue: 
alluxio.exception.DirectoryNotEmptyException

 

alluxio.user.file.delete.unchecked=true

 

kylin.properties

kylin.env.hdfs-working-dir=*alluxio://192.168.xxx.xxx:19998/kylin*

kylin.storage.url=*alluxio*

kylin.engine.spark-conf.spark.eventLog.dir=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

kylin.engine.spark-conf.spark.history.fs.logDirectory=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

 

 

 


was (Author: harveyyue):
Configurations:

kylin.properties
 # kylin.env.hdfs-working-dir=alluxio://192.168.xxx.xxx:19998/kylin
 # kylin.storage.url=alluxio
 # 
kylin.engine.spark-conf.spark.eventLog.dir=alluxio://192.168.xxx.xxx:19998/kylin/spark-history
 # 
kylin.engine.spark-conf.spark.history.fs.logDirectory=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

> Support alluxio to adapt more storage like s3, oss ...
> --
>
> Key: KYLIN-4501
> URL: https://issues.apache.org/jira/browse/KYLIN-4501
> Project: Kylin
>  Issue Type: New Feature
>  Components: Spark Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4501) Support alluxio to adapt more storage like s3, oss ...

2020-05-22 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113896#comment-17113896
 ] 

Harvey Yue commented on KYLIN-4501:
---

Configurations:

kylin.properties
 # kylin.env.hdfs-working-dir=alluxio://192.168.xxx.xxx:19998/kylin
 # kylin.storage.url=alluxio
 # 
kylin.engine.spark-conf.spark.eventLog.dir=alluxio://192.168.xxx.xxx:19998/kylin/spark-history
 # 
kylin.engine.spark-conf.spark.history.fs.logDirectory=alluxio://192.168.xxx.xxx:19998/kylin/spark-history

> Support alluxio to adapt more storage like s3, oss ...
> --
>
> Key: KYLIN-4501
> URL: https://issues.apache.org/jira/browse/KYLIN-4501
> Project: Kylin
>  Issue Type: New Feature
>  Components: Spark Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-4502) Add alluxio dependence jar

2020-05-19 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue updated KYLIN-4502:
--
Component/s: Spark Engine

> Add alluxio dependence jar
> --
>
> Key: KYLIN-4502
> URL: https://issues.apache.org/jira/browse/KYLIN-4502
> Project: Kylin
>  Issue Type: New Feature
>  Components: Spark Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4502) Add alluxio dependence jar

2020-05-19 Thread Harvey Yue (Jira)
Harvey Yue created KYLIN-4502:
-

 Summary: Add alluxio dependence jar
 Key: KYLIN-4502
 URL: https://issues.apache.org/jira/browse/KYLIN-4502
 Project: Kylin
  Issue Type: New Feature
Reporter: Harvey Yue
Assignee: Harvey Yue
 Fix For: v4.0.0-beta






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4501) Support alluxio to adapt more storage like s3, oss ...

2020-05-19 Thread Harvey Yue (Jira)
Harvey Yue created KYLIN-4501:
-

 Summary: Support alluxio to adapt more storage like s3, oss ...
 Key: KYLIN-4501
 URL: https://issues.apache.org/jira/browse/KYLIN-4501
 Project: Kylin
  Issue Type: New Feature
  Components: Spark Engine
Reporter: Harvey Yue
Assignee: Harvey Yue
 Fix For: v4.0.0-beta






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-3849) Flink cubing step : convert to HFile

2020-05-14 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue updated KYLIN-3849:
--
Attachment: flink_cube_build.jpg

> Flink cubing step : convert to HFile
> 
>
> Key: KYLIN-3849
> URL: https://issues.apache.org/jira/browse/KYLIN-3849
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
> Attachments: flink_cube_build.jpg, hfile.jpg
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KYLIN-3849) Flink cubing step : convert to HFile

2020-05-14 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue updated KYLIN-3849:
--
Attachment: hfile.jpg

> Flink cubing step : convert to HFile
> 
>
> Key: KYLIN-3849
> URL: https://issues.apache.org/jira/browse/KYLIN-3849
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
> Attachments: flink_cube_build.jpg, hfile.jpg
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-3849) Flink cubing step : convert to HFile

2020-05-14 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17107017#comment-17107017
 ] 

Harvey Yue commented on KYLIN-3849:
---

Hi ShaoFeng,

3000w+ records mr need about 10+ minutes, but with flink 1.10 version need 
25min, I have tried to adjust flink TaskExecutor parameters with task heap, 
flink managed memory (could help to improve sorting operator), maybe I still 
missing other things, please pending this pr, I will do more tunning work.

 

Refer XinTong Song's TaskExecutor memory mode sharing

[https://www.bilibili.com/s/video/BV1At4y1U7vH]

 

And please refer the snapshot with flink hfile step

> Flink cubing step : convert to HFile
> 
>
> Key: KYLIN-3849
> URL: https://issues.apache.org/jira/browse/KYLIN-3849
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4483) Avoid to build global dictionaries with empty ColumnDesc collection

2020-05-08 Thread Harvey Yue (Jira)
Harvey Yue created KYLIN-4483:
-

 Summary: Avoid to build global dictionaries with empty ColumnDesc 
collection
 Key: KYLIN-4483
 URL: https://issues.apache.org/jira/browse/KYLIN-4483
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Reporter: Harvey Yue
Assignee: Harvey Yue
 Fix For: v4.0.0-beta






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4482) Too many logging segment info with CubeBuildJob step

2020-05-08 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103034#comment-17103034
 ] 

Harvey Yue commented on KYLIN-4482:
---

Adding pr #184

> Too many logging segment info with CubeBuildJob step
> 
>
> Key: KYLIN-4482
> URL: https://issues.apache.org/jira/browse/KYLIN-4482
> Project: Kylin
>  Issue Type: Bug
>  Components: Spark Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4482) Too many logging segment info with CubeBuildJob step

2020-05-08 Thread Harvey Yue (Jira)
Harvey Yue created KYLIN-4482:
-

 Summary: Too many logging segment info with CubeBuildJob step
 Key: KYLIN-4482
 URL: https://issues.apache.org/jira/browse/KYLIN-4482
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Reporter: Harvey Yue
Assignee: Harvey Yue
 Fix For: v4.0.0-beta






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4455) Query will fail when set calcite.debug=true

2020-04-28 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094974#comment-17094974
 ] 

Harvey Yue commented on KYLIN-4455:
---

In OLAPProjectRel, we added kylin mesure columns to AbstractRelNode#rowType and 
the RexNodes to OLAPProjectRel#rewriteProjects instead of Project#exps, but the 
Project#exps is final property, we couldn't to reassign the new RexNode to 
exps, so when printing plan occur the 

exception: Caused by: java.lang.IndexOutOfBoundsException: index (3) must be 
less than size (3)

> Query will fail when set calcite.debug=true
> ---
>
> Key: KYLIN-4455
> URL: https://issues.apache.org/jira/browse/KYLIN-4455
> Project: Kylin
>  Issue Type: Bug
>  Components: Storage - Parquet
>Reporter: xuekaiqi
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>
> Query will failed when set calcite.debug=true



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KYLIN-4455) Query will fail when set calcite.debug=true

2020-04-28 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue reassigned KYLIN-4455:
-

Assignee: Harvey Yue

> Query will fail when set calcite.debug=true
> ---
>
> Key: KYLIN-4455
> URL: https://issues.apache.org/jira/browse/KYLIN-4455
> Project: Kylin
>  Issue Type: Bug
>  Components: Storage - Parquet
>Reporter: xuekaiqi
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v4.0.0-beta
>
>
> Query will failed when set calcite.debug=true



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Reopened] (KYLIN-4467) Support TopN by kylin on Parquet

2020-04-27 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue reopened KYLIN-4467:
---
  Assignee: Harvey Yue  (was: wangrupeng)

Reopen this issue when querying topn measure sql get 
ArrayIndexOutOfBoundsException:

at org.apache.kylin.engine.spark2.NExecAndComp.sql(NExecAndComp.java:425) at 
org.apache.kylin.engine.spark2.NExecAndComp.sql(NExecAndComp.java:425) at 
org.apache.kylin.engine.spark2.NExecAndComp.sql(NExecAndComp.java:408) at 
org.apache.kylin.engine.spark2.BuildAndQueryEmptySegmentsTest.testEmptySegments(BuildAndQueryEmptySegmentsTest.java:57)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:498) at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
 at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
 at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
 at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
 at 
org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) 
at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27) 
at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325) at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
 at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
 at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) at 
org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) at 
org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) at 
org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) at 
org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) at 
org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) 
at org.junit.runners.ParentRunner.run(ParentRunner.java:363) at 
org.junit.runner.JUnitCore.run(JUnitCore.java:137) at 
com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(JUnit4IdeaTestRunner.java:68)
 at 
com.intellij.rt.junit.IdeaTestRunner$Repeater.startRunnerWithArgs(IdeaTestRunner.java:33)
 at 
com.intellij.rt.junit.JUnitStarter.prepareStreamsAndStart(JUnitStarter.java:230)
 at com.intellij.rt.junit.JUnitStarter.main(JUnitStarter.java:58)Caused by: 
java.sql.SQLException: exception while executing query: 
java.lang.ArrayIndexOutOfBoundsException: 12 at 
org.apache.calcite.avatica.Helper.createException(Helper.java:56) at 
org.apache.calcite.avatica.Helper.createException(Helper.java:41) at 
org.apache.calcite.avatica.AvaticaConnection.executeQueryInternal(AvaticaConnection.java:577)
 at 
org.apache.calcite.avatica.AvaticaPreparedStatement.executeQuery(AvaticaPreparedStatement.java:137)
 at 
org.apache.kylin.engine.spark2.NExecAndComp.queryCube(NExecAndComp.java:455) at 
org.apache.kylin.engine.spark2.NExecAndComp.queryCubeAndSkipCompute(NExecAndComp.java:431)
 at org.apache.kylin.engine.spark2.NExecAndComp.sql(NExecAndComp.java:418) ... 
27 moreCaused by: java.lang.RuntimeException: 
java.lang.ArrayIndexOutOfBoundsException: 12 at 
org.apache.kylin.query.exec.SparkExec.collectToEnumerable(SparkExec.java:46) at 
Baz.bind(Unknown Source) at 
org.apache.calcite.jdbc.CalcitePrepare$CalciteSignature.enumerable(CalcitePrepare.java:365)
 at 
org.apache.calcite.jdbc.CalciteConnectionImpl.enumerable(CalciteConnectionImpl.java:301)
 at 
org.apache.calcite.jdbc.CalciteMetaImpl._createIterable(CalciteMetaImpl.java:559)
 at 
org.apache.calcite.jdbc.CalciteMetaImpl.createIterable(CalciteMetaImpl.java:550)
 at 
org.apache.calcite.avatica.AvaticaResultSet.execute(AvaticaResultSet.java:182) 
at org.apache.calcite.jdbc.CalciteResultSet.execute(CalciteResultSet.java:67) 
at org.apache.calcite.jdbc.CalciteResultSet.execute(CalciteResultSet.java:44) 
at 
org.apache.calcite.avatica.AvaticaConnection.executeQueryInternal(AvaticaConnection.java:573)
 ... 31 moreCaused by: java.lang.ArrayIndexOutOfBoundsException: 12 at 
org.apache.kylin.query.runtime.RuntimeHelper$$anonfun$gtSchemaToCalciteSchema$2.apply(RuntimeHelper.scala:118)
 at 
org.apache.kylin.query.runtime.RuntimeHelper$$anonfun$gtSchemaToCalciteSchema$2.apply(RuntimeHelper.scala:97)
 at 
scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
 at 
scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
 at scala.collection.Iterator$class.foreach(Iterator.scala:893) at 
scala.collection.AbstractIterator.foreach(Iterator.scala:1336) at 
scala.collection.IterableLike$class.foreach(IterableLike.scala:72) at 
scala.collection.AbstractIterable.foreach(Iterable.scala:54) at 

[jira] [Resolved] (KYLIN-4402) Missing data size in web UI with flink cubing

2020-03-10 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue resolved KYLIN-4402.
---
Resolution: Fixed

> Missing data size in web UI with flink cubing
> -
>
> Key: KYLIN-4402
> URL: https://issues.apache.org/jira/browse/KYLIN-4402
> Project: Kylin
>  Issue Type: Sub-task
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v3.1.0
>
> Attachments: image-2020-03-04-10-58-45-832.png
>
>
> # Missing data size label in web UI
>  # Cuboid file not configured to compress, reduced the replication numbers



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KYLIN-3849) Flink cubing step : convert to HFile

2020-03-10 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue reassigned KYLIN-3849:
-

Assignee: Harvey Yue  (was: vinoyang)

> Flink cubing step : convert to HFile
> 
>
> Key: KYLIN-3849
> URL: https://issues.apache.org/jira/browse/KYLIN-3849
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-3904) Support more dependency jars in FlinkExecutable for FlinkCubeHFile

2020-03-10 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-3904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055654#comment-17055654
 ] 

Harvey Yue commented on KYLIN-3904:
---

Add "-C file://%s" parameter to flink run command.

flink run -m yarn-cluster  -ytm 4G -yjm 2G -yD taskmanager.memory.preallocate 
false -ys 1 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-server-2.1.0-cdh6.2.0.jar
 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-shaded-protobuf-2.1.0.jar
 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-metrics-2.1.0-cdh6.2.0.jar
 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-metrics-api-2.1.0-cdh6.2.0.jar
 -C file:///opt/kylin/tomcat/webapps/kylin/WEB-INF/lib/metrics-core-2.2.0.jar 
-C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-protocol-2.1.0-cdh6.2.0.jar
 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-protocol-shaded-2.1.0-cdh6.2.0.jar
 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-mapreduce-2.1.0-cdh6.2.0.jar
 -C file:///opt/spark-2.4.4-bin-hadoop2.7/jars/htrace-core-3.1.0-incubating.jar 
-C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-common-2.1.0-cdh6.2.0.jar
 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-shaded-miscellaneous-2.1.0.jar
 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-hadoop-compat-2.1.0-cdh6.2.0.jar
 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-hadoop2-compat-2.1.0-cdh6.2.0.jar
 -C 
file:///opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/jars/hbase-client-2.1.0-cdh6.2.0.jar
 -c org.apache.kylin.common.util.FlinkEntry -p 8 
/opt/kylin/lib/kylin-job-3.0.1.jar -className 
org.apache.kylin.storage.hbase.steps.FlinkCubeHFile -output 
hdfs://ng-dc-master:8020/kylin/kylin_metadata/kylin-51cc6449-15dd-b81f-940f-23a9cd95bd2e/mileage_cube/hfile
 -partitions 
hdfs://ng-dc-master:8020/kylin/kylin_metadata/kylin-51cc6449-15dd-b81f-940f-23a9cd95bd2e/mileage_cube/rowkey_stats/part-r-0_hfile
 -input 
hdfs://ng-dc-master:8020/kylin/kylin_metadata/kylin-51cc6449-15dd-b81f-940f-23a9cd95bd2e/mileage_cube/cuboid/level_6_cuboid
 -enableObjectReuse false -segmentId 66eb63d4-b2dd-1d37-62a6-9abc6f6e1e01 
-metaUrl 
kylin_metadata@hdfs,path=hdfs://ng-dc-master:8020/kylin/kylin_metadata/kylin-51cc6449-15dd-b81f-940f-23a9cd95bd2e/mileage_cube/metadata
 -cubename mileage_cube -hbaseConfPath 
hdfs://ng-dc-master:8020/kylin/kylin_metadata/kylin-51cc6449-15dd-b81f-940f-23a9cd95bd2e/hbase-conf.xml

> Support more dependency jars in FlinkExecutable for FlinkCubeHFile
> --
>
> Key: KYLIN-3904
> URL: https://issues.apache.org/jira/browse/KYLIN-3904
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KYLIN-3904) Support more dependency jars in FlinkExecutable for FlinkCubeHFile

2020-03-10 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue reassigned KYLIN-3904:
-

Assignee: Harvey Yue  (was: vinoyang)

> Support more dependency jars in FlinkExecutable for FlinkCubeHFile
> --
>
> Key: KYLIN-3904
> URL: https://issues.apache.org/jira/browse/KYLIN-3904
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-3847) Flink cubing step : fact distinct

2020-03-10 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055617#comment-17055617
 ] 

Harvey Yue commented on KYLIN-3847:
---

Need set following property to enable flink fact distinct step

kylin.engine.flink-fact-distinct=true

> Flink cubing step : fact distinct
> -
>
> Key: KYLIN-3847
> URL: https://issues.apache.org/jira/browse/KYLIN-3847
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KYLIN-3847) Flink cubing step : fact distinct

2020-03-10 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-3847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue reassigned KYLIN-3847:
-

Assignee: Harvey Yue  (was: vinoyang)

> Flink cubing step : fact distinct
> -
>
> Key: KYLIN-3847
> URL: https://issues.apache.org/jira/browse/KYLIN-3847
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4402) Missing data size in web UI with flink cubing

2020-03-03 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17050881#comment-17050881
 ] 

Harvey Yue commented on KYLIN-4402:
---

About point 1, need to modify following configuration in 
$FLINK_HOME/conf/log4j-cli.properties, not printing information to console with 
default log4j/logback configuration file

log4j.rootLogger=INFO, file => log4j.rootLogger=INFO, file, *console*

> Missing data size in web UI with flink cubing
> -
>
> Key: KYLIN-4402
> URL: https://issues.apache.org/jira/browse/KYLIN-4402
> Project: Kylin
>  Issue Type: Sub-task
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v3.1.0
>
> Attachments: image-2020-03-04-10-58-45-832.png
>
>
> # Missing data size label in web UI
>  # Cuboid file not configured to compress, reduced the replication numbers



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KYLIN-4392) Bump Flink version from 1.9.2 to 1.10.0

2020-03-03 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue reassigned KYLIN-4392:
-

Assignee: Harvey Yue  (was: vinoyang)

> Bump Flink version from 1.9.2 to 1.10.0
> ---
>
> Key: KYLIN-4392
> URL: https://issues.apache.org/jira/browse/KYLIN-4392
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
>
> Recently, Flink has release v1.10.0. It's a major version that contains some 
> great features and big changes. We can tentatively upgrade to v1.10.0 after 
> we upgrade to v1.9.2 firstly. If we find some fatal issues, we can fall back 
> to v1.9.2.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4402) Missing data size in web UI with flink cubing

2020-03-03 Thread Harvey Yue (Jira)
Harvey Yue created KYLIN-4402:
-

 Summary: Missing data size in web UI with flink cubing
 Key: KYLIN-4402
 URL: https://issues.apache.org/jira/browse/KYLIN-4402
 Project: Kylin
  Issue Type: Sub-task
Reporter: Harvey Yue
Assignee: Harvey Yue
 Fix For: v3.1.0
 Attachments: image-2020-03-04-10-58-45-832.png

# Missing data size label in web UI
 # Cuboid file not configured to compress, reduced the replication numbers



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (KYLIN-4387) Flink cubing merge step failed

2020-03-03 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue resolved KYLIN-4387.
---
Resolution: Fixed

> Flink cubing merge step failed
> --
>
> Key: KYLIN-4387
> URL: https://issues.apache.org/jira/browse/KYLIN-4387
> Project: Kylin
>  Issue Type: Bug
>  Components: Flink Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v3.1.0
>
>
> When job running this step face following issue, I guess which have two class 
> loader to loading the class *InstantiatorStrategy* when flink execute the 
> reduceGroup operator
> Workaround:using ByteArray instead of Text in reduceGroup operator
> The kylin binary including the flink core jar is the root cause, we need to 
> change the flink related dependence scope to provided.
>  
> The program finished with the following exception:
> org.apache.flink.client.program.ProgramInvocationException: The main method 
> caused an error: error execute 
> org.apache.kylin.engine.flink.FlinkCubingMerge. Root cause: Job failed. 
> (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:593)
>  at 
> org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:438)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:274)
>  at 
> org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:746)
>  at 
> org.apache.flink.client.cli.CliFrontend.runProgram(CliFrontend.java:273)
>  at org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:205)
>  at 
> org.apache.flink.client.cli.CliFrontend.parseParameters(CliFrontend.java:1010)
>  at 
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1083)
>  at java.security.AccessController.doPrivileged(Native Method)
>  at javax.security.auth.Subject.doAs(Subject.java:422)
>  at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1836)
>  at 
> org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
>  at 
> org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1083)
>  Caused by: java.lang.RuntimeException: error execute 
> org.apache.kylin.engine.flink.FlinkCubingMerge. Root cause: Job failed. 
> (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:42)
>  at org.apache.kylin.common.util.FlinkEntry.main(FlinkEntry.java:46)
>  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>  at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:498)
>  at 
> org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:576)
>  ... 12 more
>  Caused by: org.apache.flink.client.program.ProgramInvocationException: Job 
> failed. (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:262)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:338)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:326)
>  at 
> org.apache.flink.client.program.ContextEnvironment.execute(ContextEnvironment.java:62)
>  at 
> org.apache.kylin.engine.flink.FlinkCubingMerge.execute(FlinkCubingMerge.java:235)
>  at 
> org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:37)
>  ... 18 more
>  Caused by: org.apache.flink.runtime.client.JobExecutionException: Job 
> execution failed.
>  at 
> org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:146)
>  at 
> org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:259)
>  ... 23 more
>  Caused by: java.lang.LinkageError: loader constraint violation: when 
> resolving method 
> "com.esotericsoftware.kryo.Kryo$DefaultInstantiatorStrategy.setFallbackInstantiatorStrategy(*Lorg/objenesis/strategy/InstantiatorStrategy*;)V"
>  the class loader (instance of org/apache/flink/util/*ChildFirstClassLoader*) 
> of the current class, 
> org/apache/flink/api/java/typeutils/runtime/WritableComparator, and the class 
> loader (instance of sun/misc/Launcher$*AppClassLoader*) for the method's 
> defining class, com/esotericsoftware/kryo/Kryo$DefaultInstantiatorStrategy, 
> have 

[jira] [Commented] (KYLIN-4391) Bump Flink version from 1.9.0 to 1.9.2

2020-03-02 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17049966#comment-17049966
 ] 

Harvey Yue commented on KYLIN-4391:
---

After discussion with Vino, I will take this task to verify building cube with 
Flink 1.9.2 and 1.10.0 version

> Bump Flink version from 1.9.0 to 1.9.2
> --
>
> Key: KYLIN-4391
> URL: https://issues.apache.org/jira/browse/KYLIN-4391
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: vinoyang
>Priority: Major
>
> Flink 1.9.2 is a stable version in 1.9 series. We should upgrade to this 
> version for flink engine.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (KYLIN-4391) Bump Flink version from 1.9.0 to 1.9.2

2020-03-02 Thread Harvey Yue (Jira)


 [ 
https://issues.apache.org/jira/browse/KYLIN-4391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harvey Yue reassigned KYLIN-4391:
-

Assignee: Harvey Yue  (was: vinoyang)

> Bump Flink version from 1.9.0 to 1.9.2
> --
>
> Key: KYLIN-4391
> URL: https://issues.apache.org/jira/browse/KYLIN-4391
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: vinoyang
>Assignee: Harvey Yue
>Priority: Major
>
> Flink 1.9.2 is a stable version in 1.9 series. We should upgrade to this 
> version for flink engine.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4387) Flink cubing merge step failed

2020-03-01 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17048782#comment-17048782
 ] 

Harvey Yue commented on KYLIN-4387:
---

Please ignore the pull request #1125, because the branch had the extra commit 
issue.

> Flink cubing merge step failed
> --
>
> Key: KYLIN-4387
> URL: https://issues.apache.org/jira/browse/KYLIN-4387
> Project: Kylin
>  Issue Type: Bug
>  Components: Flink Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v3.1.0
>
>
> When job running this step face following issue, I guess which have two class 
> loader to loading the class *InstantiatorStrategy* when flink execute the 
> reduceGroup operator
> Workaround:using ByteArray instead of Text in reduceGroup operator
> The kylin binary including the flink core jar is the root cause, we need to 
> change the flink related dependence scope to provided.
>  
> The program finished with the following exception:
> org.apache.flink.client.program.ProgramInvocationException: The main method 
> caused an error: error execute 
> org.apache.kylin.engine.flink.FlinkCubingMerge. Root cause: Job failed. 
> (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:593)
>  at 
> org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:438)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:274)
>  at 
> org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:746)
>  at 
> org.apache.flink.client.cli.CliFrontend.runProgram(CliFrontend.java:273)
>  at org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:205)
>  at 
> org.apache.flink.client.cli.CliFrontend.parseParameters(CliFrontend.java:1010)
>  at 
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1083)
>  at java.security.AccessController.doPrivileged(Native Method)
>  at javax.security.auth.Subject.doAs(Subject.java:422)
>  at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1836)
>  at 
> org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
>  at 
> org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1083)
>  Caused by: java.lang.RuntimeException: error execute 
> org.apache.kylin.engine.flink.FlinkCubingMerge. Root cause: Job failed. 
> (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:42)
>  at org.apache.kylin.common.util.FlinkEntry.main(FlinkEntry.java:46)
>  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>  at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:498)
>  at 
> org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:576)
>  ... 12 more
>  Caused by: org.apache.flink.client.program.ProgramInvocationException: Job 
> failed. (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:262)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:338)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:326)
>  at 
> org.apache.flink.client.program.ContextEnvironment.execute(ContextEnvironment.java:62)
>  at 
> org.apache.kylin.engine.flink.FlinkCubingMerge.execute(FlinkCubingMerge.java:235)
>  at 
> org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:37)
>  ... 18 more
>  Caused by: org.apache.flink.runtime.client.JobExecutionException: Job 
> execution failed.
>  at 
> org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:146)
>  at 
> org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:259)
>  ... 23 more
>  Caused by: java.lang.LinkageError: loader constraint violation: when 
> resolving method 
> "com.esotericsoftware.kryo.Kryo$DefaultInstantiatorStrategy.setFallbackInstantiatorStrategy(*Lorg/objenesis/strategy/InstantiatorStrategy*;)V"
>  the class loader (instance of org/apache/flink/util/*ChildFirstClassLoader*) 
> of the current class, 
> org/apache/flink/api/java/typeutils/runtime/WritableComparator, and the class 
> loader (instance of sun/misc/Launcher$*AppClassLoader*) for 

[jira] [Commented] (KYLIN-4389) Change Flink jar dependence from compile to provided

2020-03-01 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17048775#comment-17048775
 ] 

Harvey Yue commented on KYLIN-4389:
---

Please ignore the pull request #1128, because of the branch had two extra 
commits issue.

> Change Flink jar dependence from compile to provided 
> -
>
> Key: KYLIN-4389
> URL: https://issues.apache.org/jira/browse/KYLIN-4389
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v3.1.0
>
>
> Currently, build module engine-flink which including flink related core jar 
> to kylin package, blow up 70M to package, so I suggest that change flink 
> dependence scope
> One more thing, I guess this fat jar(kylin-job-xxx.jar) cause  the issue 
> KYLIN-4387



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KYLIN-4389) Change Flink jar dependence from compile to provided

2020-03-01 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17048530#comment-17048530
 ] 

Harvey Yue commented on KYLIN-4389:
---

I have verified these change in my CDH 6.2 environment, attach part of kylin 
process status info:

/opt/spark/jars/jackson-module-scala_2.11-2.6.7.1.jar:/opt/spark/jars/jackson-module-paranamer-2.7.9.jar:/opt/spark/jars/jackson-module-jaxb-annotations-2.6.7.jar:/opt/spark/jars/parquet-avro-1.10.1.jar
 
-Dkylin.flink.dependency=/opt/flink/lib/flink-table_2.11-1.9.0.jar:/opt/flink/lib/flink-dist_2.11-1.9.0.jar:/opt/flink/lib/flink-table-blink_2.11-1.9.0.jar:/opt/flink/lib/flink-metrics-prometheus_2.11-1.9.0.jar
 -Dkylin.hadoop.conf.dir=/etc/hadoop/conf 
-Dkylin.server.host-address=ng-dc-node5:7070 -Dspring.profiles.active=testing 
org.apache.hadoop.util.RunJar /opt/kylin/bin/../tomcat/bin/bootstrap.jar 
org.apache.catalina.startup.Bootstrap start

> Change Flink jar dependence from compile to provided 
> -
>
> Key: KYLIN-4389
> URL: https://issues.apache.org/jira/browse/KYLIN-4389
> Project: Kylin
>  Issue Type: Sub-task
>  Components: Flink Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v3.1.0
>
>
> Currently, build module engine-flink which including flink related core jar 
> to kylin package, blow up 70M to package, so I suggest that change flink 
> dependence scope
> One more thing, I guess this fat jar(kylin-job-xxx.jar) cause  the issue 
> KYLIN-4387



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (KYLIN-4389) Change Flink jar dependence from compile to provided

2020-02-27 Thread Harvey Yue (Jira)
Harvey Yue created KYLIN-4389:
-

 Summary: Change Flink jar dependence from compile to provided 
 Key: KYLIN-4389
 URL: https://issues.apache.org/jira/browse/KYLIN-4389
 Project: Kylin
  Issue Type: Sub-task
  Components: Flink Engine
Reporter: Harvey Yue
Assignee: Harvey Yue
 Fix For: v3.1.0


Currently, build module engine-flink which including flink related core jar to 
kylin package, blow up 70M to package, so I suggest that change flink 
dependence scope

One more thing, I guess this fat jar(kylin-job-xxx.jar) cause  the issue 
KYLIN-4387



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (KYLIN-4387) Flink cubing merge step failed

2020-02-27 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046484#comment-17046484
 ] 

Harvey Yue edited comment on KYLIN-4387 at 2/27/20 11:00 AM:
-

My env is CDH 6.2.0, and I merge related engine-flink module code into 
3.0.x-hadoop3.1 branch to build the binary package


was (Author: harveyyue):
My env is CDH 6.2.0

> Flink cubing merge step failed
> --
>
> Key: KYLIN-4387
> URL: https://issues.apache.org/jira/browse/KYLIN-4387
> Project: Kylin
>  Issue Type: Bug
>  Components: Flink Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v3.1.0
>
>
> When job running this step face following issue, I guess which have two class 
> loader to loading the class *InstantiatorStrategy* when flink execute the 
> reduceGroup operator
> Workaround:using ByteArray instead of Text in reduceGroup operator
> The kylin binary including the flink core jar is the root cause, we need to 
> change the flink related dependence scope to provided.
>  
> The program finished with the following exception:
> org.apache.flink.client.program.ProgramInvocationException: The main method 
> caused an error: error execute 
> org.apache.kylin.engine.flink.FlinkCubingMerge. Root cause: Job failed. 
> (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:593)
>  at 
> org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:438)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:274)
>  at 
> org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:746)
>  at 
> org.apache.flink.client.cli.CliFrontend.runProgram(CliFrontend.java:273)
>  at org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:205)
>  at 
> org.apache.flink.client.cli.CliFrontend.parseParameters(CliFrontend.java:1010)
>  at 
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1083)
>  at java.security.AccessController.doPrivileged(Native Method)
>  at javax.security.auth.Subject.doAs(Subject.java:422)
>  at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1836)
>  at 
> org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
>  at 
> org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1083)
>  Caused by: java.lang.RuntimeException: error execute 
> org.apache.kylin.engine.flink.FlinkCubingMerge. Root cause: Job failed. 
> (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:42)
>  at org.apache.kylin.common.util.FlinkEntry.main(FlinkEntry.java:46)
>  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>  at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:498)
>  at 
> org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:576)
>  ... 12 more
>  Caused by: org.apache.flink.client.program.ProgramInvocationException: Job 
> failed. (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:262)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:338)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:326)
>  at 
> org.apache.flink.client.program.ContextEnvironment.execute(ContextEnvironment.java:62)
>  at 
> org.apache.kylin.engine.flink.FlinkCubingMerge.execute(FlinkCubingMerge.java:235)
>  at 
> org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:37)
>  ... 18 more
>  Caused by: org.apache.flink.runtime.client.JobExecutionException: Job 
> execution failed.
>  at 
> org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:146)
>  at 
> org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:259)
>  ... 23 more
>  Caused by: java.lang.LinkageError: loader constraint violation: when 
> resolving method 
> "com.esotericsoftware.kryo.Kryo$DefaultInstantiatorStrategy.setFallbackInstantiatorStrategy(*Lorg/objenesis/strategy/InstantiatorStrategy*;)V"
>  the class loader (instance of org/apache/flink/util/*ChildFirstClassLoader*) 
> of the current class, 
> 

[jira] [Commented] (KYLIN-4387) Flink cubing merge step failed

2020-02-27 Thread Harvey Yue (Jira)


[ 
https://issues.apache.org/jira/browse/KYLIN-4387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046484#comment-17046484
 ] 

Harvey Yue commented on KYLIN-4387:
---

My env is CDH 6.2.0

> Flink cubing merge step failed
> --
>
> Key: KYLIN-4387
> URL: https://issues.apache.org/jira/browse/KYLIN-4387
> Project: Kylin
>  Issue Type: Bug
>  Components: Flink Engine
>Reporter: Harvey Yue
>Assignee: Harvey Yue
>Priority: Major
> Fix For: v3.1.0
>
>
> When job running this step face following issue, I guess which have two class 
> loader to loading the class *InstantiatorStrategy* when flink execute the 
> reduceGroup operator
> Workaround:using ByteArray instead of Text in reduceGroup operator
> The kylin binary including the flink core jar is the root cause, we need to 
> change the flink related dependence scope to provided.
>  
> The program finished with the following exception:
> org.apache.flink.client.program.ProgramInvocationException: The main method 
> caused an error: error execute 
> org.apache.kylin.engine.flink.FlinkCubingMerge. Root cause: Job failed. 
> (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:593)
>  at 
> org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:438)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:274)
>  at 
> org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:746)
>  at 
> org.apache.flink.client.cli.CliFrontend.runProgram(CliFrontend.java:273)
>  at org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:205)
>  at 
> org.apache.flink.client.cli.CliFrontend.parseParameters(CliFrontend.java:1010)
>  at 
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1083)
>  at java.security.AccessController.doPrivileged(Native Method)
>  at javax.security.auth.Subject.doAs(Subject.java:422)
>  at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1836)
>  at 
> org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
>  at 
> org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1083)
>  Caused by: java.lang.RuntimeException: error execute 
> org.apache.kylin.engine.flink.FlinkCubingMerge. Root cause: Job failed. 
> (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:42)
>  at org.apache.kylin.common.util.FlinkEntry.main(FlinkEntry.java:46)
>  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>  at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:498)
>  at 
> org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:576)
>  ... 12 more
>  Caused by: org.apache.flink.client.program.ProgramInvocationException: Job 
> failed. (JobID: 7a40a92561cfa972e3c5b975db993fce)
>  at 
> org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:262)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:338)
>  at 
> org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:326)
>  at 
> org.apache.flink.client.program.ContextEnvironment.execute(ContextEnvironment.java:62)
>  at 
> org.apache.kylin.engine.flink.FlinkCubingMerge.execute(FlinkCubingMerge.java:235)
>  at 
> org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:37)
>  ... 18 more
>  Caused by: org.apache.flink.runtime.client.JobExecutionException: Job 
> execution failed.
>  at 
> org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:146)
>  at 
> org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:259)
>  ... 23 more
>  Caused by: java.lang.LinkageError: loader constraint violation: when 
> resolving method 
> "com.esotericsoftware.kryo.Kryo$DefaultInstantiatorStrategy.setFallbackInstantiatorStrategy(*Lorg/objenesis/strategy/InstantiatorStrategy*;)V"
>  the class loader (instance of org/apache/flink/util/*ChildFirstClassLoader*) 
> of the current class, 
> org/apache/flink/api/java/typeutils/runtime/WritableComparator, and the class 
> loader (instance of sun/misc/Launcher$*AppClassLoader*) for the method's 
> defining class, 

[jira] [Created] (KYLIN-4387) Flink cubing merge step failed

2020-02-26 Thread Harvey Yue (Jira)
Harvey Yue created KYLIN-4387:
-

 Summary: Flink cubing merge step failed
 Key: KYLIN-4387
 URL: https://issues.apache.org/jira/browse/KYLIN-4387
 Project: Kylin
  Issue Type: Bug
  Components: Flink Engine
Reporter: Harvey Yue


When job running this step face following issue, I guess which have two class 
loader to loading the class *InstantiatorStrategy* when flink execute the 
reduceGroup operator

Workaround:using ByteArray instead of Text in reduceGroup operator

The kylin binary including the flink core jar is the root cause, we need to 
change the flink related dependence scope to provided.

 

The program finished with the following exception:

org.apache.flink.client.program.ProgramInvocationException: The main method 
caused an error: error execute org.apache.kylin.engine.flink.FlinkCubingMerge. 
Root cause: Job failed. (JobID: 7a40a92561cfa972e3c5b975db993fce)
 at 
org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:593)
 at 
org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:438)
 at 
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:274)
 at 
org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:746)
 at 
org.apache.flink.client.cli.CliFrontend.runProgram(CliFrontend.java:273)
 at org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:205)
 at 
org.apache.flink.client.cli.CliFrontend.parseParameters(CliFrontend.java:1010)
 at 
org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1083)
 at java.security.AccessController.doPrivileged(Native Method)
 at javax.security.auth.Subject.doAs(Subject.java:422)
 at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1836)
 at 
org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
 at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1083)
 Caused by: java.lang.RuntimeException: error execute 
org.apache.kylin.engine.flink.FlinkCubingMerge. Root cause: Job failed. (JobID: 
7a40a92561cfa972e3c5b975db993fce)
 at 
org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:42)
 at org.apache.kylin.common.util.FlinkEntry.main(FlinkEntry.java:46)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
 at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:498)
 at 
org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:576)
 ... 12 more
 Caused by: org.apache.flink.client.program.ProgramInvocationException: Job 
failed. (JobID: 7a40a92561cfa972e3c5b975db993fce)
 at 
org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:262)
 at 
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:338)
 at 
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:326)
 at 
org.apache.flink.client.program.ContextEnvironment.execute(ContextEnvironment.java:62)
 at 
org.apache.kylin.engine.flink.FlinkCubingMerge.execute(FlinkCubingMerge.java:235)
 at 
org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:37)
 ... 18 more
 Caused by: org.apache.flink.runtime.client.JobExecutionException: Job 
execution failed.
 at 
org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:146)
 at 
org.apache.flink.client.program.rest.RestClusterClient.submitJob(RestClusterClient.java:259)
 ... 23 more
 Caused by: java.lang.LinkageError: loader constraint violation: when resolving 
method 
"com.esotericsoftware.kryo.Kryo$DefaultInstantiatorStrategy.setFallbackInstantiatorStrategy(*Lorg/objenesis/strategy/InstantiatorStrategy*;)V"
 the class loader (instance of org/apache/flink/util/*ChildFirstClassLoader*) 
of the current class, 
org/apache/flink/api/java/typeutils/runtime/WritableComparator, and the class 
loader (instance of sun/misc/Launcher$*AppClassLoader*) for the method's 
defining class, com/esotericsoftware/kryo/Kryo$DefaultInstantiatorStrategy, 
have different Class objects for the type 
org/objenesis/strategy/InstantiatorStrategy used in the signature
 at 
org.apache.flink.api.java.typeutils.runtime.WritableComparator.checkKryoInitialized(WritableComparator.java:174)
 at 
org.apache.flink.api.java.typeutils.runtime.WritableComparator.setReference(WritableComparator.java:67)
 at