[jira] [Created] (KYLIN-3407) java.io.FileNotFoundException: File does not exist: hdfs://localhost:9000/**/hive/lib/hive-catalog-core not found

2018-06-12 Thread Rahul Midha (JIRA)
Rahul Midha created KYLIN-3407:
--

 Summary: java.io.FileNotFoundException: File does not exist: 
hdfs://localhost:9000/**/hive/lib/hive-catalog-core not found
 Key: KYLIN-3407
 URL: https://issues.apache.org/jira/browse/KYLIN-3407
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Reporter: Rahul Midha


While creating a cube in kylin i am getting error 

"java.io.FileNotFoundException: File does not exist: 
hdfs://localhost:9000/**/hive/lib/hive-catalog-core.jar not found" while file 
is there in the path, also when i remove that file from the path, the error 
come for some other jar file.  

my hadoop version is  hadoop2.7.3 ,hive2.3.3, hbase1.1.1, kylin2.3.1

 

 

 

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


add na new dimension to the cube

2018-06-12 Thread ??
I have a cube that builds data for two months. Now I want to add a new 
dimension to the cube. Do I have to clear the already created segments? If I 
don't need to clear the already constructed segments, what should I do?

[jira] [Created] (KYLIN-3406) When the /execute_output/ metadata file sinking to HDFS is deleted, the monitoring page does not display any tasks.

2018-06-12 Thread yujie.sun (JIRA)
yujie.sun created KYLIN-3406:


 Summary: When the /execute_output/ metadata file sinking to HDFS 
is deleted, the monitoring page does not display any tasks.
 Key: KYLIN-3406
 URL: https://issues.apache.org/jira/browse/KYLIN-3406
 Project: Kylin
  Issue Type: Bug
  Components: Metadata
Affects Versions: v2.3.1
Reporter: yujie.sun
 Fix For: v2.4.0


*{color:#24292e}Issue description:{color}*
When a single file of Hbase metadata is greater than a certain value, the file 
is sunk into HDFS for storage. When files in execute_output/ are deleted in 
HDFS, the monitoring page will not display any tasks.

*{color:#24292e}Reproduction step:{color}*

{color:#24292e}1.Use Hbase as metadata storage{color}

{color:#24292e}2.Modify the hbase configuration ,configure 
"hbase.client.keyvalue.maxsize" into a small value, such as 524288.{color}

{color:#24292e}3.Create a new build task, after the build task finished,find 
the /execute_output hbase metadata path on HDFS, 
/\{working_dir}/\{metadata_url}/resources/execute_output/..., and delete one of 
the files.{color}

{color:#24292e}4.At this time,{color:#33}the monitoring page will not 
display any tasks.{color}{color}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3405) kylin mandatory dimension

2018-06-12 Thread Lemont (JIRA)
Lemont created KYLIN-3405:
-

 Summary: kylin mandatory dimension
 Key: KYLIN-3405
 URL: https://issues.apache.org/jira/browse/KYLIN-3405
 Project: Kylin
  Issue Type: Bug
Reporter: Lemont


I created a simple cube on kylin 2.3.1,the table has three 
dimensions:pt,biz_time and periods_time ,one measure
:online_rider
Something  trouble me is that when i set  biz_time to the mandatory 
dimension,the sql will execute fail.
The sql is :
select floor((1524931200 - biz_time)/(30*86400)), count(DISTINCT online_rider) 
onlineRiders 
  from rider_view where pt = '20180530' and periods_time in(0) and biz_time 
>1524931200 
  group by floor((1524931200 - biz_time)*1.0/(30*86400)) .
The error is :
null while executing sql.
But when I remove the condition of periods_time in(0) or floor function the sql 
will execute success.
I don't  know what's the relationship between the two.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] shaofengshi closed pull request #154: KYLIN-3388 remove create redistribute flat hive table step if not set…

2018-06-12 Thread GitBox
shaofengshi closed pull request #154: KYLIN-3388 remove create redistribute 
flat hive table step if not set…
URL: https://github.com/apache/kylin/pull/154
 
 
   

This is a PR merged from a forked repository.
As GitHub hides the original diff on merge, it is displayed below for
the sake of provenance:

As this is a foreign pull request (from a fork), the diff is supplied
below (as it won't show otherwise due to GitHub magic):

diff --git a/core-job/src/main/java/org/apache/kylin/job/JoinedFlatTable.java 
b/core-job/src/main/java/org/apache/kylin/job/JoinedFlatTable.java
index 57e1c40f60..a6c6daad71 100644
--- a/core-job/src/main/java/org/apache/kylin/job/JoinedFlatTable.java
+++ b/core-job/src/main/java/org/apache/kylin/job/JoinedFlatTable.java
@@ -189,11 +189,7 @@ public static void 
appendJoinStatement(IJoinedFlatTableDesc flatDesc, StringBuil
 }
 
 private static void appendDistributeStatement(StringBuilder sql, TblColRef 
redistCol) {
-if (redistCol != null) {
-sql.append(" DISTRIBUTE BY ").append(colName(redistCol, 
true)).append(";\n");
-} else {
-sql.append(" DISTRIBUTE BY RAND()").append(";\n");
-}
+sql.append(" DISTRIBUTE BY ").append(colName(redistCol, 
true)).append(";\n");
 }
 
 private static void appendClusterStatement(StringBuilder sql, TblColRef 
clusterCol) {
diff --git 
a/source-hive/src/main/java/org/apache/kylin/source/hive/HiveMRInput.java 
b/source-hive/src/main/java/org/apache/kylin/source/hive/HiveMRInput.java
index 0e791ebe86..8653500b98 100644
--- a/source-hive/src/main/java/org/apache/kylin/source/hive/HiveMRInput.java
+++ b/source-hive/src/main/java/org/apache/kylin/source/hive/HiveMRInput.java
@@ -23,7 +23,6 @@
 import java.util.List;
 import java.util.Set;
 
-import com.google.common.collect.Lists;
 import org.apache.commons.lang.StringUtils;
 import org.apache.hadoop.fs.FileSystem;
 import org.apache.hadoop.fs.Path;
@@ -59,6 +58,7 @@
 import org.slf4j.Logger;
 import org.slf4j.LoggerFactory;
 
+import com.google.common.collect.Lists;
 import com.google.common.collect.Sets;
 
 public class HiveMRInput implements IMRInput {
@@ -153,7 +153,9 @@ public void 
addStepPhase1_CreateFlatTable(DefaultChainedExecutable jobFlow) {
 
 // then count and redistribute
 if (cubeConfig.isHiveRedistributeEnabled()) {
-
jobFlow.addTask(createRedistributeFlatHiveTableStep(hiveInitStatements, 
cubeName));
+if (flatDesc.getClusterBy() != null || 
flatDesc.getDistributedBy() != null) {
+
jobFlow.addTask(createRedistributeFlatHiveTableStep(hiveInitStatements, 
cubeName));
+}
 }
 
 // special for hive
@@ -449,8 +451,7 @@ private String cleanUpIntermediateFlatTable(KylinConfig 
config) throws IOExcepti
 }
 config.getCliCommandExecutor().execute(hiveCmdBuilder.build());
 rmdirOnHDFS(getExternalDataPaths());
-output.append(
-"Path " + getExternalDataPaths() + " is deleted. \n");
+output.append("Path " + getExternalDataPaths() + " is deleted. 
\n");
 
 return output.toString();
 }


 


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Created] (KYLIN-3404) IllegalArgumentException throw during cube optimization

2018-06-12 Thread Chuqian Yu (JIRA)
Chuqian Yu created KYLIN-3404:
-

 Summary: IllegalArgumentException throw during cube optimization  
 Key: KYLIN-3404
 URL: https://issues.apache.org/jira/browse/KYLIN-3404
 Project: Kylin
  Issue Type: Bug
Reporter: Chuqian Yu


{code:java}
2018-06-12 20:53:37,062 ERROR [Scheduler 2088456043 Job 
bea409fd-bf7d-49d6-a52a-7fed0919cf96-376] execution.AbstractExecutable:100 : 
1th retries for onExecuteFinished fails due to {}
java.lang.IllegalArgumentException
at org.apache.kylin.cube.CubeUpdate.setCubeInstance(CubeUpdate.java:50)
at org.apache.kylin.cube.CubeUpdate.(CubeUpdate.java:41)
at 
org.apache.kylin.job.execution.CheckpointExecutable.onExecuteFinished(CheckpointExecutable.java:88)
at 
org.apache.kylin.job.execution.AbstractExecutable.onExecuteFinishedWithRetry(AbstractExecutable.java:98)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:175)
at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:300)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
2018-06-12 20:53:37,063 ERROR [pool-9-thread-7] threadpool.DefaultScheduler:302 
: ExecuteException job:bea409fd-bf7d-49d6-a52a-7fed0919cf96
org.apache.kylin.job.exception.ExecuteException: 
java.lang.IllegalArgumentException
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:179)
at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:300)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.IllegalArgumentException
at org.apache.kylin.cube.CubeUpdate.setCubeInstance(CubeUpdate.java:50)
at org.apache.kylin.cube.CubeUpdate.(CubeUpdate.java:41)
at 
org.apache.kylin.job.execution.CheckpointExecutable.onExecuteFinished(CheckpointExecutable.java:88)
at 
org.apache.kylin.job.execution.AbstractExecutable.onExecuteFinishedWithRetry(AbstractExecutable.java:98)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:175)
... 4 more
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] coveralls commented on issue #154: KYLIN-3388 remove create redistribute flat hive table step if not set…

2018-06-12 Thread GitBox
coveralls commented on issue #154: KYLIN-3388 remove create redistribute flat 
hive table step if not set…
URL: https://github.com/apache/kylin/pull/154#issuecomment-396561500
 
 
   ## Pull Request Test Coverage Report for [Build 
3226](https://coveralls.io/builds/17443154)
   
   * **0** of **4**   **(0.0%)**  changed or added relevant lines in **2** 
files are covered.
   * **1** unchanged line in **1** file lost coverage.
   * Overall coverage increased (+**0.002%**) to **24.181%**
   
   ---
   
   |  Changes Missing Coverage | Covered Lines | Changed/Added Lines | % |
   | :-|--||---: |
   | 
[core-job/src/main/java/org/apache/kylin/job/JoinedFlatTable.java](https://coveralls.io/builds/17443154/source?filename=core-job%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fjob%2FJoinedFlatTable.java#L192)
 | 0 | 1 | 0.0%
   | 
[source-hive/src/main/java/org/apache/kylin/source/hive/HiveMRInput.java](https://coveralls.io/builds/17443154/source?filename=source-hive%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fsource%2Fhive%2FHiveMRInput.java#L156)
 | 0 | 3 | 0.0%
   
   
   |  Files with Coverage Reduction | New Missed Lines | % |
   | :-|--|--: |
   | 
[core-cube/src/main/java/org/apache/kylin/cube/inmemcubing/MemDiskStore.java](https://coveralls.io/builds/17443154/source?filename=core-cube%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fkylin%2Fcube%2Finmemcubing%2FMemDiskStore.java#L553)
 | 1 | 78.12% |
   
   
   |  Totals | [![Coverage 
Status](https://coveralls.io/builds/17443154/badge)](https://coveralls.io/builds/17443154)
 |
   | :-- | --: |
   | Change from base [Build 3225](https://coveralls.io/builds/17438083): |  
0.002% |
   | Covered Lines: | 14800 |
   | Relevant Lines: | 61205 |
   
   ---
   #   - [Coveralls](https://coveralls.io)
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] codecov-io commented on issue #154: KYLIN-3388 remove create redistribute flat hive table step if not set…

2018-06-12 Thread GitBox
codecov-io commented on issue #154: KYLIN-3388 remove create redistribute flat 
hive table step if not set…
URL: https://github.com/apache/kylin/pull/154#issuecomment-396535440
 
 
   # [Codecov](https://codecov.io/gh/apache/kylin/pull/154?src=pr=h1) Report
   > Merging [#154](https://codecov.io/gh/apache/kylin/pull/154?src=pr=desc) 
into 
[master](https://codecov.io/gh/apache/kylin/commit/fd7cc177f78e4b037d57e7a28b749c747485040e?src=pr=desc)
 will **increase** coverage by `<.01%`.
   > The diff coverage is `0%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/kylin/pull/154/graphs/tree.svg?width=650=150=pr=JawVgbgsVo)](https://codecov.io/gh/apache/kylin/pull/154?src=pr=tree)
   
   ```diff
   @@ Coverage Diff  @@
   ## master #154  +/-   ##
   
   + Coverage 22.15%   22.15%   +<.01% 
 Complexity 4021 4021  
   
 Files  1012 1012  
 Lines 6120761205   -2 
 Branches   8768 8769   +1 
   
   + Hits  1355813559   +1 
   + Misses4641046407   -3 
 Partials   1239 1239
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/kylin/pull/154?src=pr=tree) | Coverage Δ 
| Complexity Δ | |
   |---|---|---|---|
   | 
[...ain/java/org/apache/kylin/job/JoinedFlatTable.java](https://codecov.io/gh/apache/kylin/pull/154/diff?src=pr=tree#diff-Y29yZS1qb2Ivc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2t5bGluL2pvYi9Kb2luZWRGbGF0VGFibGUuamF2YQ==)
 | `0% <0%> (ø)` | `0 <0> (ø)` | :arrow_down: |
   | 
[...java/org/apache/kylin/source/hive/HiveMRInput.java](https://codecov.io/gh/apache/kylin/pull/154/diff?src=pr=tree#diff-c291cmNlLWhpdmUvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2t5bGluL3NvdXJjZS9oaXZlL0hpdmVNUklucHV0LmphdmE=)
 | `10.34% <0%> (ø)` | `1 <0> (ø)` | :arrow_down: |
   | 
[...a/org/apache/kylin/dict/Number2BytesConverter.java](https://codecov.io/gh/apache/kylin/pull/154/diff?src=pr=tree#diff-Y29yZS1kaWN0aW9uYXJ5L3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9reWxpbi9kaWN0L051bWJlcjJCeXRlc0NvbnZlcnRlci5qYXZh)
 | `81.74% <0%> (-0.8%)` | `17% <0%> (-1%)` | |
   | 
[...rg/apache/kylin/cube/inmemcubing/MemDiskStore.java](https://codecov.io/gh/apache/kylin/pull/154/diff?src=pr=tree#diff-Y29yZS1jdWJlL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9reWxpbi9jdWJlL2lubWVtY3ViaW5nL01lbURpc2tTdG9yZS5qYXZh)
 | `69.6% <0%> (-0.61%)` | `7% <0%> (ø)` | |
   | 
[...g/apache/kylin/source/datagen/ColumnGenerator.java](https://codecov.io/gh/apache/kylin/pull/154/diff?src=pr=tree#diff-Y29yZS1tZXRhZGF0YS9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUva3lsaW4vc291cmNlL2RhdGFnZW4vQ29sdW1uR2VuZXJhdG9yLmphdmE=)
 | `72.29% <0%> (+1.35%)` | `8% <0%> (ø)` | :arrow_down: |
   | 
[...he/kylin/dict/lookup/cache/RocksDBLookupTable.java](https://codecov.io/gh/apache/kylin/pull/154/diff?src=pr=tree#diff-Y29yZS1kaWN0aW9uYXJ5L3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9reWxpbi9kaWN0L2xvb2t1cC9jYWNoZS9Sb2Nrc0RCTG9va3VwVGFibGUuamF2YQ==)
 | `78.37% <0%> (+5.4%)` | `6% <0%> (+1%)` | :arrow_up: |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/kylin/pull/154?src=pr=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/kylin/pull/154?src=pr=footer). Last 
update 
[fd7cc17...5c83e21](https://codecov.io/gh/apache/kylin/pull/154?src=pr=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] Wayne1c opened a new pull request #154: KYLIN-3388 remove create redistribute flat hive table step if not set…

2018-06-12 Thread GitBox
Wayne1c opened a new pull request #154: KYLIN-3388 remove create redistribute 
flat hive table step if not set…
URL: https://github.com/apache/kylin/pull/154
 
 
   … distribute column


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] asfgit commented on issue #154: KYLIN-3388 remove create redistribute flat hive table step if not set…

2018-06-12 Thread GitBox
asfgit commented on issue #154: KYLIN-3388 remove create redistribute flat hive 
table step if not set…
URL: https://github.com/apache/kylin/pull/154#issuecomment-396530155
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Created] (KYLIN-3403) Querying sample cube with filter "KYLIN_CAL_DT.WEEK_BEG_DT >= CAST('2001-09-09' AS DATE)" returns unexpected empty result set

2018-06-12 Thread Yifan Zhang (JIRA)
Yifan Zhang created KYLIN-3403:
--

 Summary: Querying sample cube with filter 
"KYLIN_CAL_DT.WEEK_BEG_DT >= CAST('2001-09-09' AS DATE)" returns unexpected 
empty result set 
 Key: KYLIN-3403
 URL: https://issues.apache.org/jira/browse/KYLIN-3403
 Project: Kylin
  Issue Type: Bug
  Components: Query Engine
Reporter: Yifan Zhang
Assignee: Yifan Zhang


2018-05-16 13:30:17,824 DEBUG [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
cachesync.CachedCrudAssist : Loaded 3 ColumnACL(s) out of 3 resource
2018-05-16 13:30:17,840 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
routing.QueryRouter : Find candidates by table DEFAULT.KYLIN_SALES and 
project=learn_kylin : 
INVERTED_INDEX[name=kylin_sales_cube],CUBE[name=kylin_sales_cube]
2018-05-16 13:30:17,841 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
routing.QueryRouter : Applying rule: class 
org.apache.kylin.query.routing.rules.RemoveBlackoutRealizationsRule, 
realizations before: 
[INVERTED_INDEX[name=kylin_sales_cube],CUBE[name=kylin_sales_cube]], 
realizations after: 
[INVERTED_INDEX[name=kylin_sales_cube],CUBE[name=kylin_sales_cube]]
2018-05-16 13:30:17,843 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
routing.QueryRouter : Applying rule: class 
org.apache.kylin.query.routing.rules.RemoveUncapableRealizationsRule, 
realizations before: 
[INVERTED_INDEX[name=kylin_sales_cube],CUBE[name=kylin_sales_cube]], 
realizations after: 
[INVERTED_INDEX[name=kylin_sales_cube],CUBE[name=kylin_sales_cube]]
2018-05-16 13:30:17,844 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
rules.RealizationSortRule : INVERTED_INDEX[name=kylin_sales_cube] priority 1 
cost 8360. CUBE[name=kylin_sales_cube] priority 1 cost 836.
2018-05-16 13:30:17,844 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
routing.QueryRouter : Applying rule: class 
org.apache.kylin.query.routing.rules.RealizationSortRule, realizations before: 
[INVERTED_INDEX[name=kylin_sales_cube],CUBE[name=kylin_sales_cube]], 
realizations after: 
[CUBE[name=kylin_sales_cube],INVERTED_INDEX[name=kylin_sales_cube]]
2018-05-16 13:30:17,844 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
routing.QueryRouter : The realizations remaining: 
[CUBE[name=kylin_sales_cube],INVERTED_INDEX[name=kylin_sales_cube]],and the 
final chosen one for current olap context 0 is CUBE[name=kylin_sales_cube]
2018-05-16 13:30:18,042 DEBUG [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
enumerator.OLAPEnumerator : query storage...
2018-05-16 13:30:18,082 DEBUG [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
gtrecord.GTCubeStorageQueryBase : Need storage aggregation
2018-05-16 13:30:18,083 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
gtrecord.GTCubeStorageQueryBase : exactAggregation is false because need 
storage aggregation
2018-05-16 13:30:18,091 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
lookup.SnapshotManager : Loading snapshotTable from 
/table_snapshot/DEFAULT.KYLIN_CAL_DT/2d3d125a-30f7-438f-bede-7212c5666f4a.snapshot,
 with loadData: true
2018-05-16 13:30:18,197 DEBUG [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
lookup.SnapshotManager : Loaded snapshot at 
/table_snapshot/DEFAULT.KYLIN_CAL_DT/2d3d125a-30f7-438f-bede-7212c5666f4a.snapshot
2018-05-16 13:30:18,426 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
gtrecord.GTCubeStorageQueryBase : Filter column set for query: []
2018-05-16 13:30:18,426 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
gtrecord.GTCubeStorageQueryBase : Filter mask is: 0
2018-05-16 13:30:18,426 DEBUG [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
gtrecord.GTCubeStorageQueryBase : storageLimitLevel set to NO_LIMIT because 
derived column require post aggregation: [DEFAULT.KYLIN_SALES.PART_DT]
2018-05-16 13:30:18,426 DEBUG [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
gtrecord.GTCubeStorageQueryBase : storageLimitLevel set to NO_LIMIT because the 
query has order by
2018-05-16 13:30:18,427 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
gtrecord.GTCubeStorageQueryBase : Cuboid identified: cube=kylin_sales_cube, 
cuboidId=16384, groupsD=[], filterD=[], limitPushdown=2147483647, 
limitLevel=NO_LIMIT, storageAggr=true
2018-05-16 13:30:18,429 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
gtrecord.CubeSegmentScanner : Init CubeSegmentScanner for segment 
2012010100_20180403152544
2018-05-16 13:30:18,476 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
gtrecord.ScannerWorker : Segment 
kylin_sales_cube[2012010100_20180403152544] will be skipped
2018-05-16 13:30:18,477 DEBUG [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
enumerator.OLAPEnumerator : return TupleIterator...
2018-05-16 13:30:18,481 INFO [Query aa20e5fd-8b86-4359-a87a-2a2e7b8c100a-162] 
service.QueryService : Processed rows for each storageContext: 0
2018-05-16 13:30:18,481 INFO [Query 

[KYLIN-3388] Hive data may become inconsistent after redistribution

2018-06-12 Thread ShaoFeng Shi
Hello Kylin users,

Recently Yanghong Zhong from eBay team reported that the source data may
become inconsistent after the "Redistribute flat hive table" step. This is
caused by a bug in Hive for "distribute by rand()" statement. While Kylin
depends on this to make the data distribution more even. For more
information, please check:

https://issues.apache.org/jira/browse/KYLIN-3388

Before a hot-fix is released, we recommend you disable the redistribution
feature to ensure data's accuracy, by setting:

kylin.source.hive.redistribute-flat-table=false


in conf/kylin.properties. A restart is needed to take effect.

Thanks for the attention.

-- 
Best regards,

Shaofeng Shi 史少锋