[jira] [Created] (KYLIN-1196) Prepare release v1.2

2015-12-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1196:
---

 Summary: Prepare release v1.2
 Key: KYLIN-1196
 URL: https://issues.apache.org/jira/browse/KYLIN-1196
 Project: Kylin
  Issue Type: Bug
  Components: General
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1206) Support "UNION" in query

2015-12-07 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1206:
---

 Summary: Support "UNION" in query
 Key: KYLIN-1206
 URL: https://issues.apache.org/jira/browse/KYLIN-1206
 Project: Kylin
  Issue Type: Wish
  Components: Query Engine
Affects Versions: v1.1.1, v0.7.1, v0.7.2
Reporter: Shaofeng SHI
Assignee: liyang
 Fix For: Backlog


Kylin doesn't support UNION in SQL query now, this should be implemented.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1219) Kylin support SSO with Spring SAML

2015-12-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1219:
---

 Summary: Kylin support SSO with Spring SAML
 Key: KYLIN-1219
 URL: https://issues.apache.org/jira/browse/KYLIN-1219
 Project: Kylin
  Issue Type: New Feature
  Components: REST Service
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.1


Besides LDAP, SSO is widely adopted by enterprises for user authentication; As 
Kylin rest service is implemented with Spring security framework, it is natural 
to implement SSO with SAML 2.0



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1221) Allow user to kill Hadoop job in the job list page

2015-12-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1221:
---

 Summary: Allow user to kill Hadoop job in the job list page
 Key: KYLIN-1221
 URL: https://issues.apache.org/jira/browse/KYLIN-1221
 Project: Kylin
  Issue Type: Wish
  Components: Job Engine
Affects Versions: v0.7.1, v0.7.2, v1.0, v1.1
Reporter: Shaofeng SHI
Assignee: ZhouQianhao


Sometimes user expects to abondon the current running hadoop job, he can choose 
to kill the related job directly from the Kylin job page, instead of manually 
run yarn or mapred CLI to kill it.

Besides, when user discards a Kylin job, the underlying running hadoop job 
should also be killed automatically, to release the computing resources in time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1229) CubeMigrationCLI should not need set KYLIN_HOME

2015-12-13 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1229:
---

 Summary: CubeMigrationCLI should not need set KYLIN_HOME
 Key: KYLIN-1229
 URL: https://issues.apache.org/jira/browse/KYLIN-1229
 Project: Kylin
  Issue Type: Bug
Affects Versions: v1.0, v1.1, v1.2
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
Priority: Minor
 Fix For: v1.3


When release a cube from one Kylin server to another Kylin server, if didn't 
set KYLIN_HOME environment variable, it will throw exception:

{code}
hbase -Dlog4j.configuration=/conf/log4j.properties  
org.apache.hadoop.util.RunJar lib/kylin-job-1.2-SNAPSHOT.jar 
org.apache.kylin.job.tools.CubeMigrationCLI   kylin-qa:80  kylin-prod:80 
CUBE_NAME PROJ_NAME true true false

L4J [2015-12-14 
00:44:53,578][ERROR][org.apache.kylin.common.persistence.ResourceStore] - 
Create new store instance failed
java.lang.reflect.InvocationTargetException
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
at 
org.apache.kylin.common.persistence.ResourceStore.getStore(ResourceStore.java:76)
at 
org.apache.kylin.job.tools.CubeMigrationCLI.moveCube(CubeMigrationCLI.java:94)
at 
org.apache.kylin.job.tools.CubeMigrationCLI.moveCube(CubeMigrationCLI.java:138)
at 
org.apache.kylin.job.tools.CubeMigrationCLI.main(CubeMigrationCLI.java:82)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
Caused by: java.lang.RuntimeException: Didn't find KYLIN_CONF or KYLIN_HOME, 
please set one of them
at 
org.apache.kylin.common.KylinConfig.getKylinProperties(KylinConfig.java:600)
at 
org.apache.kylin.common.KylinConfig.getKylinPropertiesAsInputSteam(KylinConfig.java:608)
at 
org.apache.kylin.common.KylinConfig.loadKylinConfig(KylinConfig.java:249)
at 
org.apache.kylin.common.KylinConfig.getInstanceFromEnv(KylinConfig.java:145)
at 
org.apache.kylin.common.util.HadoopUtil.getCurrentHBaseConfiguration(HadoopUtil.java:56)
at 
org.apache.kylin.common.persistence.HBaseConnection.get(HBaseConnection.java:68)
at 
org.apache.kylin.common.persistence.HBaseResourceStore.getConnection(HBaseResourceStore.java:81)
at 
org.apache.kylin.common.persistence.HBaseResourceStore.createHTableIfNeeded(HBaseResourceStore.java:100)
at 
org.apache.kylin.common.persistence.HBaseResourceStore.(HBaseResourceStore.java:95)
... 13 more
Exception in thread "main" java.lang.IllegalArgumentException: Failed to find 
metadata store by url: kylin_metadata_qa@hbase
at 
org.apache.kylin.common.persistence.ResourceStore.getStore(ResourceStore.java:91)
at 
org.apache.kylin.job.tools.CubeMigrationCLI.moveCube(CubeMigrationCLI.java:94)
at 
org.apache.kylin.job.tools.CubeMigrationCLI.moveCube(CubeMigrationCLI.java:138)
at 
org.apache.kylin.job.tools.CubeMigrationCLI.main(CubeMigrationCLI.java:82)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
{code}

The problem is between HBaseResourceStore.getConnection() and 
HBaseConnection.get(url), which depends on an implicit KylinConfig, insteading 
using the given KylinConfig.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1230) When CubeMigrationCLI copied ACL from one env to another, it may not work

2015-12-14 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1230:
---

 Summary: When CubeMigrationCLI copied ACL from one env to another, 
it may not work
 Key: KYLIN-1230
 URL: https://issues.apache.org/jira/browse/KYLIN-1230
 Project: Kylin
  Issue Type: Bug
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Found an issue when using CubeMigrationCLI to migrate a cube from QA to 
production if selecting “copyACL” as true (this is a newly added feature). The 
root cause is the project in production has a different uuid as it in QA. The 
cube’s ACL refers to its parent by uuid. Then if a cube’s ACL is copied from 
QA, it couldn't find the parent, then an error is thrown.

To fix the issue, it should copy the project (if it doesn't exist in the target 
env), or give up copy ACL when detecting the project uuid is different (and 
remind user to manually update ACL).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1250) Got NPE when discarding a job

2015-12-23 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1250:
---

 Summary: Got NPE when discarding a job
 Key: KYLIN-1250
 URL: https://issues.apache.org/jira/browse/KYLIN-1250
 Project: Kylin
  Issue Type: Bug
  Components: General
Affects Versions: v1.2
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v1.3


User reported that he couldn't rebuild a cube, Kylin reports "The cube 
 has running job () please discard it and try again." 
Then he tried to discard the job, but got an NullPointerException;

{code}
Caused by: java.lang.NullPointerException
at org.apache.kylin.rest.service.JobService.cancelJob(JobService.java:321)
at 
org.apache.kylin.rest.service.JobService$$FastClassByCGLIB$$83a44b2a.invoke()
at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:204)
at 
org.springframework.aop.framework.Cglib2AopProxy$CglibMethodInvocation.invokeJoinpoint(Cglib2AopProxy.java:689)
at 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:150)
at 
org.springframework.security.access.intercept.aopalliance.MethodSecurityInterceptor.invoke(MethodSecurityInterceptor.java:64)
at 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInv
{code}

User said "We haven’t done the purge action. It’s possible that the bug was 
triggered when we have made the changes to filtering while the cube was still 
in the build process?"



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1252) Use org.apache.httpcomponents.httpclient to replace commons-httpclient

2015-12-24 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1252:
---

 Summary: Use org.apache.httpcomponents.httpclient to replace 
commons-httpclient
 Key: KYLIN-1252
 URL: https://issues.apache.org/jira/browse/KYLIN-1252
 Project: Kylin
  Issue Type: Improvement
  Components: General
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
Priority: Minor
 Fix For: v1.3


After merge the patch of KYLIN-1014, there are two http client library in 
kylin's pom.xml: org.apache.httpcomponents.httpclient and commons-httpclient;

As the development of commons-httpclient is ended, we should replace it with 
the httpcomponents.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1278) Query engine optimization: skip the condition checking on partition column if a segment's time is totaly included in the query range

2016-01-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1278:
---

 Summary: Query engine optimization: skip the condition checking on 
partition column if a segment's time is totaly included in the query range
 Key: KYLIN-1278
 URL: https://issues.apache.org/jira/browse/KYLIN-1278
 Project: Kylin
  Issue Type: Improvement
  Components: Query Engine
Reporter: Shaofeng SHI
Assignee: liyang
 Fix For: Backlog


See an example: a cube (which is partitioned by date column "CAL_DT") has three 
segments:

S1, S2, S3;

Now a query like "select  from fact_table where CAL_DT > '-mm-dd' group 
by dimA, dimB..." comes, and the "CAL_DT" is only used for filtering, not in 
group by; 

By checking the startTime/endTime, Kylin found S2 and S3 need be scanned, and 
S3 is totally match the condition "CAL_DT > -mm-dd". Then when scan S3, 
Kylin can ignore the condition on "CAL_DT", selecting a smaller cuboid, this 
should gain better performance. 

 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1281) Add "endDate" for cube descriptor

2016-01-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1281:
---

 Summary: Add "endDate" for cube descriptor
 Key: KYLIN-1281
 URL: https://issues.apache.org/jira/browse/KYLIN-1281
 Project: Kylin
  Issue Type: Improvement
  Components: Metadata
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.0


For some legacy cube, we expect set an "endDate" for it and don't allow 
building beyond that date. So we can create new cube from that date.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1292) Create a doc for all properties in conf/kylin.properties

2016-01-06 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1292:
---

 Summary: Create a doc for all properties in conf/kylin.properties
 Key: KYLIN-1292
 URL: https://issues.apache.org/jira/browse/KYLIN-1292
 Project: Kylin
  Issue Type: Improvement
  Components: Documentation
Reporter: Shaofeng SHI


Need a central place to introduce the variables in kylin.properties, list the 
information about:

name, value options, default value, impact, since version, etc. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1311) Stream cubing auto assignment and load balance

2016-01-12 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1311:
---

 Summary: Stream cubing auto assignment and load balance
 Key: KYLIN-1311
 URL: https://issues.apache.org/jira/browse/KYLIN-1311
 Project: Kylin
  Issue Type: Sub-task
  Components: Job Engine
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.1


This is a sub task of KYLIN-1117, need assign the stream cubing job to cluster 
for load balance and fail over.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1322) Wrong package name for TimedJsonStreamParser

2016-01-15 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1322:
---

 Summary: Wrong package name for TimedJsonStreamParser
 Key: KYLIN-1322
 URL: https://issues.apache.org/jira/browse/KYLIN-1322
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


Get this error when trigger a streaming job with 2.x-staging:

2016-01-15 09:28:58,567 ERROR [main StreamingCLI:97]: error start streaming
java.lang.RuntimeException: failed to create instance of StreamingParser
at 
org.apache.kylin.source.kafka.KafkaStreamingInput.getBatchWithTimeWindow(KafkaStreamingInput.java:101)
at 
org.apache.kylin.engine.streaming.OneOffStreamingBuilder$1.run(OneOffStreamingBuilder.java:72)
at 
org.apache.kylin.engine.streaming.cli.StreamingCLI.startOneOffCubeStreaming(StreamingCLI.java:104)
at 
org.apache.kylin.engine.streaming.cli.StreamingCLI.main(StreamingCLI.java:91)
Caused by: java.lang.ClassNotFoundException: 
org.apache.kylin.streaming.TimedJsonStreamParser
at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:190)
at 
org.apache.kylin.source.kafka.StreamingParser.getStreamingParser(StreamingParser.java:82)
at 
org.apache.kylin.source.kafka.KafkaStreamingInput.getBatchWithTimeWindow(KafkaStreamingInput.java:74)
... 3 more


And then I found the class has been refactored to package 
org.apache.kylin.source.kafka, while the generated JSON is still using the old 
package.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1328) DataModel cache should be flushed as other types

2016-01-18 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1328:
---

 Summary: DataModel cache should be flushed as other types
 Key: KYLIN-1328
 URL: https://issues.apache.org/jira/browse/KYLIN-1328
 Project: Kylin
  Issue Type: Bug
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.0


When remove a data model, Kylin will trigger an cache event whose type is 
"DATA_MODEL", the CacheService.removeCache method is invoked, and then an 
"UnsupportedOperationException" is thrown, this is not expected.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1351) Support common RDBMS as data source in Kylin

2016-01-20 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1351:
---

 Summary: Support common RDBMS as data source in Kylin
 Key: KYLIN-1351
 URL: https://issues.apache.org/jira/browse/KYLIN-1351
 Project: Kylin
  Issue Type: New Feature
Reporter: Shaofeng SHI


>From v2.0, Kylin's plug-in architecture makes it possible to have multiple 
>data sources, cube engines and storages. Some users ever aksed that whether 
>Kylin support source data feeded from RDBMS like Oracle, MySQL, now it is 
>possible to do that. Some tools like Apache Sqoop can easily export data from 
>RDBMS to HDFS, that would help Kylin get the data and then build that into 
>cubes. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1354) Couldn't edit a cube if it has no "partition date" set

2016-01-21 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1354:
---

 Summary: Couldn't edit a cube if it has no "partition date" set
 Key: KYLIN-1354
 URL: https://issues.apache.org/jira/browse/KYLIN-1354
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.2
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


Create a cube which has no partition date column,  save.

Edit this cube, in the "Refresh setting" step, the "Partition Date Column" 
wasn't set (this is correct), the "Start Date" is "1969-12-31" (this is 
confusing), the button "next" is disabled without error/warning message. After 
select a partition column it becomes enabled, while this incorrect as this cube 
isn't partitioned.

 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1355) Add script to build a java-hotfix package, instead of a full package

2016-01-21 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1355:
---

 Summary: Add script to build a java-hotfix package, instead of a 
full package
 Key: KYLIN-1355
 URL: https://issues.apache.org/jira/browse/KYLIN-1355
 Project: Kylin
  Issue Type: Improvement
  Components: Environment 
Reporter: Shaofeng SHI
Assignee: hongbin ma


Sometimes we need deploy only java code change, but he ./scripts/package.sh 
will make a full binary package which cost extra time on building the web 
resources. It would be more efficient if we have a way to package only jar 
files into a tar ball. Then in the server side, we only replace the jars to 
take effective.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1367) Use by-layer cubing algorithm if there is memory hungry measure

2016-01-25 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1367:
---

 Summary: Use by-layer cubing algorithm if there is memory hungry 
measure
 Key: KYLIN-1367
 URL: https://issues.apache.org/jira/browse/KYLIN-1367
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1375) Update tutorials for Kylin v2.0

2016-01-27 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1375:
---

 Summary: Update tutorials for Kylin v2.0
 Key: KYLIN-1375
 URL: https://issues.apache.org/jira/browse/KYLIN-1375
 Project: Kylin
  Issue Type: Task
  Components: Documentation
Affects Versions: v2.0
Reporter: Shaofeng SHI


V2.0 has many UI changes, the tutorials need be updated.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1377) TopN measure should support more expressions

2016-01-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1377:
---

 Summary: TopN measure should support more expressions
 Key: KYLIN-1377
 URL: https://issues.apache.org/jira/browse/KYLIN-1377
 Project: Kylin
  Issue Type: New Feature
Reporter: Shaofeng SHI
 Fix For: v2.1


TopN should support not only SUM, but also MAX, MIN as the expression.

A possible case is, find out the sellers which sold the top expensive items:

select seller_id, max(price) from sals_records where region = 'US' and year = 
'2015' order by max(price) desc limit 100;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1378) Add UI for TopN measure

2016-01-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1378:
---

 Summary: Add UI for TopN measure
 Key: KYLIN-1378
 URL: https://issues.apache.org/jira/browse/KYLIN-1378
 Project: Kylin
  Issue Type: Sub-task
  Components: Web 
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


Need the user interface for user to define the TopN. User need to selecting: 1) 
the literal column; 2) the metrics column; 3) the expression (default SUM); 4) 
the soring order (default Desc).

A sample is:

{
"name" : "TOP_SELLER",
"function" : {
  "expression" : "TOP_N",
  "parameter" : {
"name": "counter",
"type" : "column",
"value" : "PRICE",
"next_parameter" : {
  "name": "literal",
  "type" : "column",
  "value" : "SELLER_ID",
  "next_parameter" : {
"name": "expression",
"type" : "",
"value" : "SUM",
"next_parameter" : {
  "name": "order",
  "type" : "",
  "value" : "DESC"
}
}
  },
  "returntype" : "topn(100)"
},




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1382) CubeMigrationCLI reports error when migrate cube

2016-01-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1382:
---

 Summary: CubeMigrationCLI reports error when migrate cube
 Key: KYLIN-1382
 URL: https://issues.apache.org/jira/browse/KYLIN-1382
 Project: Kylin
  Issue Type: Bug
  Components: Client - CLI
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.0


With 2.0-rc code, when migrate a cube, it reports error:

{code}
2016-01-29 00:20:34,451 INFO  [main CubeMigrationCLI:282]: Operation index :14
2016-01-29 00:20:34,451 INFO  [main CubeMigrationCLI:303]: Executing operation: 
COPY_ACL:5c658fb6-e2cf-470d-af35-2b88b80ec694, 
fb366d08-d084-4b3e-84f3-d9a3c64e7031, project,
2016-01-29 00:20:34,535 ERROR [main CubeMigrationCLI:286]: error met
java.lang.IllegalArgumentException: to use hbase storage, pls set 
'kylin.storage.url=hbase' in kylin.properties
at 
org.apache.kylin.storage.hbase.HBaseConnection.newHBaseConfiguration(HBaseConnection.java:88)
at 
org.apache.kylin.storage.hbase.HBaseConnection.get(HBaseConnection.java:130)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.doOpt(CubeMigrationCLI.java:425)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.doOpts(CubeMigrationCLI.java:283)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.moveCube(CubeMigrationCLI.java:142)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.moveCube(CubeMigrationCLI.java:150)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.main(CubeMigrationCLI.java:91)
2016-01-29 00:20:34,536 INFO  [main CubeMigrationCLI:287]: Try undoing previous 
changes
2016-01-29 00:20:34,536 INFO  [main CubeMigrationCLI:467]: Undo operation: 
COPY_ACL:5c658fb6-e2cf-470d-af35-2b88b80ec694, 
fb366d08-d084-4b3e-84f3-d9a3c64e7031, Tracking,
2016-01-29 00:20:34,568 ERROR [main CubeMigrationCLI:293]: error met
java.lang.IllegalArgumentException: to use hbase storage, pls set 
'kylin.storage.url=hbase' in kylin.properties
at 
org.apache.kylin.storage.hbase.HBaseConnection.newHBaseConfiguration(HBaseConnection.java:88)
at 
org.apache.kylin.storage.hbase.HBaseConnection.get(HBaseConnection.java:130)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.doOpt(CubeMigrationCLI.java:425)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.doOpts(CubeMigrationCLI.java:283)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.moveCube(CubeMigrationCLI.java:142)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.moveCube(CubeMigrationCLI.java:150)
at 
org.apache.kylin.storage.hbase.util.CubeMigrationCLI.main(CubeMigrationCLI.java:91)
2016-01-29 00:20:34,568 INFO  [main CubeMigrationCLI:294]: Continue undoing...
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1383) Evaluate "deploy.env" in kylin.properties

2016-01-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1383:
---

 Summary: Evaluate "deploy.env" in kylin.properties
 Key: KYLIN-1383
 URL: https://issues.apache.org/jira/browse/KYLIN-1383
 Project: Kylin
  Issue Type: Improvement
  Components: General
Reporter: Shaofeng SHI


In kylin.properties, there is a config "deploy.env", which has three options: 
DEV|QA|PROD, but there is no explaination on what behaviors/impaction of each. 
Need re-evaluate this property and decide whether keep it there.
 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1385) Enhance FactDistinctColumnsJob to use multiple reducers

2016-01-31 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1385:
---

 Summary: Enhance FactDistinctColumnsJob to use multiple reducers
 Key: KYLIN-1385
 URL: https://issues.apache.org/jira/browse/KYLIN-1385
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Affects Versions: v1.2, v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.1


So far the FactDistinctColumnsJob is using 1 reducer to merge the distinct 
values for all dimension columns on fact table. When there are 1 or multiple 
ultra high cardinality columns, it couldn't gain concurrency; It should 
dispatch the workload to multiple reducers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1387) Streaming cubing doesn't generate cuboids files on HDFS, cause cube merge failure

2016-01-31 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1387:
---

 Summary: Streaming cubing doesn't generate cuboids files on HDFS, 
cause cube merge failure
 Key: KYLIN-1387
 URL: https://issues.apache.org/jira/browse/KYLIN-1387
 Project: Kylin
  Issue Type: Bug
  Components: streaming
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.1


In an early version of Kylin streaming, the HBaseMROutput2.java will use HBase 
Scan API to do the merge. Later Yang refactored that into 
HBaseMROutput2Transition.java, while keep using the cuboids files on HDFS to do 
the merge. But the StreamingBatchBuilder didn't generate the cuboids, which 
causes the merge failed to run.

Solution: as a workaround, the StreamingBatchBuilder should leave cuboids files 
when building micro segments.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1392) Support fixedLength encoding for TopN measure

2016-02-01 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1392:
---

 Summary: Support fixedLength encoding for TopN measure
 Key: KYLIN-1392
 URL: https://issues.apache.org/jira/browse/KYLIN-1392
 Project: Kylin
  Issue Type: Sub-task
  Components: Job Engine, Metadata, Query Engine
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Dong Li


Currently TopN measure uses only dictionary encoding for the literal column; It 
should be made as configurable, so user can select to use fixedLength to encode 
that.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1393) Streaming: add a parameter for "max-gap-to-fill" at cube level

2016-02-02 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1393:
---

 Summary: Streaming: add a parameter for "max-gap-to-fill" at cube 
level
 Key: KYLIN-1393
 URL: https://issues.apache.org/jira/browse/KYLIN-1393
 Project: Kylin
  Issue Type: Improvement
  Components: streaming
Affects Versions: v2.0
Reporter: Shaofeng SHI
 Fix For: v2.1


When there is a gap (or say hole) be found in a streaming cube, kylin will try 
to build a segment for that gap at one time; If the gap is big, it may not be 
able to finish the building in-memory. So kylin should provide a configurable 
max-gap-to-fill at the cube level, user can customize that by evaluating the 
speed that data comes. Then a big gap will be splitted as several small gaps 
which can be built in sequence.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1407) Update the UI page for calculate table cardinality

2016-02-05 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1407:
---

 Summary: Update the UI page for calculate table cardinality
 Key: KYLIN-1407
 URL: https://issues.apache.org/jira/browse/KYLIN-1407
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.0, v1.1, v1.2
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


There are a couple issues in current "Calculate Cardinality" page:

1. the "Delimiter" and "Format" is not needed since kylin change to use 
hcatalog to read hive table; they should be removed from UI

2. the page title is inaccurate

3. should allow user to select (or enter) a partition information, so only need 
scan specific partition to get the rough cardinality info.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1412) Widget width is too small to select

2016-02-08 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1412:
---

 Summary: Widget width is too small to select
 Key: KYLIN-1412
 URL: https://issues.apache.org/jira/browse/KYLIN-1412
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1413) Row key column's sequence is wrong after saving the cube

2016-02-08 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1413:
---

 Summary: Row key column's sequence is wrong after saving the cube
 Key: KYLIN-1413
 URL: https://issues.apache.org/jira/browse/KYLIN-1413
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v2.1
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


With the latest build on 2.x-staging, I create a cube, and adjusted the rowkey 
sequence with my preference, while after saving the cube, I found the sequence 
was wrong. Even re-edit the cube, still got the issue.

This need be double checked against the 2.0-rc to ensure it doesn't exist in 
2.0.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1414) Couldn't drag and drop rowkey, js error is thrown in browser console

2016-02-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1414:
---

 Summary: Couldn't drag and drop rowkey, js error is thrown in 
browser console
 Key: KYLIN-1414
 URL: https://issues.apache.org/jira/browse/KYLIN-1414
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v2.0


Create a new cube with the web wizard, in the "Advanced setting" step, try to 
drag and drop a column in the "Rowkeys" section, but found it couldn't be d&d; 
In the browser console (Chrome), there is JS error:

{code}
  TypeError: element.sortable is not a function
at link (http://kylin-qa/kylin/js/scripts.min.0.js:32925:19)
at nodeLinkFn (http://kylin-qa/kylin/js/scripts.min.0.js:6780:13)
at compositeLinkFn (http://kylin-qa/kylin/js/scripts.min.0.js:6167:13)
at compositeLinkFn (http://kylin-qa/kylin/js/scripts.min.0.js:6170:13)
at publicLinkFn (http://kylin-qa/kylin/js/scripts.min.0.js:6063:30)
at boundTranscludeFn (http://kylin-qa/kylin/js/scripts.min.0.js:6187:21)
at controllersBoundTransclude 
(http://kylin-qa/kylin/js/scripts.min.0.js:6801:18)
at Object.ngIfWatchAction [as fn] 
(http://kylin-qa/kylin/js/scripts.min.0.js:19799:15)
at Scope.$digest (http://kylin-qa/kylin/js/scripts.min.0.js:12702:29)
at Scope.$apply (http://kylin-qa/kylin/js/scripts.min.0.js:12967:24)

{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1415) Cube parallel merge

2016-02-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1415:
---

 Summary: Cube parallel merge
 Key: KYLIN-1415
 URL: https://issues.apache.org/jira/browse/KYLIN-1415
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine, Metadata, streaming
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Investigate to remove the restriction on cube merge: only 1 merge job is 
allowed for a cube at one time. This is important especially for streaming 
cubing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1417) TimedJsonStreamParser is case sensitive for message's property name

2016-02-14 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1417:
---

 Summary: TimedJsonStreamParser is case sensitive for message's 
property name
 Key: KYLIN-1417
 URL: https://issues.apache.org/jira/browse/KYLIN-1417
 Project: Kylin
  Issue Type: Bug
  Components: streaming
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.0


TimedJsonStreamParser is case sensitive for message's property name, this is 
wrong; It should be case insensitive.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1420) Query returns empty result on partition column's boundary condition

2016-02-14 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1420:
---

 Summary: Query returns empty result on partition column's boundary 
condition
 Key: KYLIN-1420
 URL: https://issues.apache.org/jira/browse/KYLIN-1420
 Project: Kylin
  Issue Type: Bug
  Components: Query Engine
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.0


This issue is reported by an user. The cube has a "month_end_dt" column as the 
partition column. In the first segment, it built the data from 2015-01-01 to 
2016-01-01; The second segment it built the data after 2016-01-01.

The queries which doesn't have filtering condition works as expected, such as:
select month_end_dt,count(*) from fact_table group by month_end_dt order by 
month_end_dt;

The result set includes the records for each month;

But when filter on the month_end_date='2015-12-31' it is returning 0 records:
select month_end_dt, count(*) from fact_table where month_end_dt='2015-12-31' 
group by month_end_dt 

After doing some investigation, identify it is a bug in v2 query engine: In 
CubeSegmentScanner.getSegmentStartAndEnd(), it uses -1 as 'roundingFlag' for 
cubeSeg.getDateRangeEnd(), which may get a smaller value than the cubeSegment's 
end date; While in GTScanRangePlanner.newScanRange(), it still treat the value 
as a closed condition, that caused the query on boundary condition returns 
empty directly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1421) Cube "source record" is always zero for streaming

2016-02-15 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1421:
---

 Summary: Cube "source record" is always zero for streaming
 Key: KYLIN-1421
 URL: https://issues.apache.org/jira/browse/KYLIN-1421
 Project: Kylin
  Issue Type: Bug
  Components: streaming
Affects Versions: v2.0
Reporter: Shaofeng SHI


The streaming cube engine doesn't collect the source record number, which cause 
the total record number for the cube is always be zero on UI.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1423) HBase size precision issue

2016-02-16 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1423:
---

 Summary: HBase size precision issue
 Key: KYLIN-1423
 URL: https://issues.apache.org/jira/browse/KYLIN-1423
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


With v2.0, in a cube's "HBase" tab, it shows:

{code}
HTable: KYLIN_KDKGMXLZRJ

Region Count: 2
Size: 260.0 MB
Start Time: 2016-02-16 16:40:00
End Time: 2016-02-16 20:40:00
HTable: KYLIN_FR2EIUHZ28

Region Count: 2
Size: 0 bytes
Start Time: 2016-02-16 20:40:00
End Time: 2016-02-17 00:40:00
{code}

Two issues about the "Size" display:

1) the size has 5 precision is not accurate at all because the nubmer after .  
are all 0. Just "260 MB" is good.
2) the second table shows "0 bytes" seems like an empty table, while it has 
many records; if the size is small or failed to get the size, show "N/A" would 
be better than show "0 bytes"



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1424) Should support multiple selection in picking up dimension/measure column step in data model wizard

2016-02-16 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1424:
---

 Summary: Should support multiple selection in picking up 
dimension/measure column step in data model wizard
 Key: KYLIN-1424
 URL: https://issues.apache.org/jira/browse/KYLIN-1424
 Project: Kylin
  Issue Type: Improvement
  Components: Web 
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


I like the UI of "Auto Generator" in "Dimesion" step of the cube wizard, which 
allows me to select multiple colums at once; hope the UI in data model wizard 
can follow the same style to give better user experience.

Besides, maybe the "Auto Generator" is not the best name, it can be another 
name like "Add in batch".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1429) First segment couldn't be detected/shown, only after second is built it appears

2016-02-18 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1429:
---

 Summary: First segment couldn't be detected/shown, only after 
second is built it appears
 Key: KYLIN-1429
 URL: https://issues.apache.org/jira/browse/KYLIN-1429
 Project: Kylin
  Issue Type: Bug
  Components: streaming
Affects Versions: v2.0
Reporter: Shaofeng SHI


Create a new streaming cube, start the micro batch build, after the first build 
finish, on web UI it still shows no segment; After finish the second micro 
batch, on web it will show the two segments at once; 

The problem only exists in the first segment.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1430) Not add "STREAMING_" prefix when import a streaming table

2016-02-18 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1430:
---

 Summary: Not add "STREAMING_" prefix when import a streaming table
 Key: KYLIN-1430
 URL: https://issues.apache.org/jira/browse/KYLIN-1430
 Project: Kylin
  Issue Type: Bug
  Components: Metadata, Web 
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v2.0


Now when add a streaming table, when enter a table name, Kylin will 
automatically add a prefix on saving; This might not be expected, as the query 
may expect to use the original table name, user need keep control on it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1431) Define stream config at table level, instead of on cube level

2016-02-18 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1431:
---

 Summary: Define stream config at table level, instead of on cube 
level
 Key: KYLIN-1431
 URL: https://issues.apache.org/jira/browse/KYLIN-1431
 Project: Kylin
  Issue Type: Improvement
  Components: Metadata, streaming, Web 
Affects Versions: v2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


In 2.0 streaming, user need enter the kafka information when create the cube, 
like the topic, the broker list, etc; while these info should be independent 
with cube, and can be reused across cubes which share the same table.

The expected design is, define kafka config when adding the table.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1456) Shouldn't use "1970-01-01" as the default end date

2016-02-27 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1456:
---

 Summary: Shouldn't use "1970-01-01" as the default end date
 Key: KYLIN-1456
 URL: https://issues.apache.org/jira/browse/KYLIN-1456
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.3
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.3
 Attachments: Screen Shot 2016-02-28 at 9.13.44 AM.png

Create a new cube, set 2016-01-01 as start date; Build the first segment, kylin 
shows the dialogue asking for selecting the end date, while the default end 
date is 1970-01-01. Please see the attachement.

Expected behavior: leave blank for the end date.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1477) TopN measure should support ascending order

2016-03-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1477:
---

 Summary: TopN measure should support ascending order
 Key: KYLIN-1477
 URL: https://issues.apache.org/jira/browse/KYLIN-1477
 Project: Kylin
  Issue Type: Wish
Affects Versions: v2.0
Reporter: Shaofeng SHI
 Fix For: Backlog


So far the TopN measure only support in descending order; it should support 
ascending order, for the case that need to find the lowest metrics.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1478) TopN measure should support non-dictionary encoding for ultra high cardinality

2016-03-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1478:
---

 Summary: TopN measure should support non-dictionary encoding for 
ultra high cardinality
 Key: KYLIN-1478
 URL: https://issues.apache.org/jira/browse/KYLIN-1478
 Project: Kylin
  Issue Type: Wish
Affects Versions: v2.0
Reporter: Shaofeng SHI
 Fix For: Future


TopN measure uses dictionary to encode the literal column, that may not work 
when the cardinality is ultra high. Need support other encoding like 
fixedLength.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1499) Couldn't save query, error in backend

2016-03-15 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1499:
---

 Summary: Couldn't save query, error in backend
 Key: KYLIN-1499
 URL: https://issues.apache.org/jira/browse/KYLIN-1499
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


Run kylin in sandbox, spring profile "testing"; after run a SQL query, click 
"Save" to save the query, there is no response in UI, in kylin.log there is 
error:

{code}
2016-03-16 14:11:52,071 ERROR [http-bio-7070-exec-9] 
controller.BasicController:44 :
org.apache.hadoop.hbase.TableNotFoundException: kylin_metadata_user
at 
org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.locateRegionInMeta(ConnectionManager.java:1352)
at 
org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.locateRegion(ConnectionManager.java:1167)
at 
org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.locateRegion(ConnectionManager.java:1150)
at 
org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.locateRegion(ConnectionManager.java:1107)
at 
org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.getRegionLocation(ConnectionManager.java:941)
at 
org.apache.hadoop.hbase.client.RegionServerCallable.prepare(RegionServerCallable.java:78)
at 
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:125)
at org.apache.hadoop.hbase.client.HTable.get(HTable.java:854)
at 
org.apache.kylin.rest.service.QueryService.getQueries(QueryService.java:188)
at 
org.apache.kylin.rest.service.QueryService.saveQuery(QueryService.java:126)
at 
org.apache.kylin.rest.service.QueryService$$FastClassByCGLIB$$4957273f.invoke()
at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:204)
at 
org.springframework.aop.framework.Cglib2AopProxy$DynamicAdvisedInterceptor.intercept(Cglib2AopProxy.java:618)
at 
org.apache.kylin.rest.service.QueryService$$EnhancerByCGLIB$$fbf31bec.saveQuery()
at 
org.apache.kylin.rest.controller.QueryController.saveQuery(QueryController.java:111)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)

{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1507) Couldn't find hive dependency jar on some platform like CDH

2016-03-19 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1507:
---

 Summary: Couldn't find hive dependency jar on some platform like 
CDH
 Key: KYLIN-1507
 URL: https://issues.apache.org/jira/browse/KYLIN-1507
 Project: Kylin
  Issue Type: Bug
  Components: General
Affects Versions: v1.5.0
Reporter: Shaofeng SHI


Reported by user ianzeng  in u...@kylin.apache.org mailing list:

I has installed kylin 1.5 on redhead 6.3. I try build sample cube. But 
got error msg as follow:

2016-03-18 18:18:43,084 WARN [main] org.apache.hadoop.conf.Configuration: 
job.xml:an attempt to override final parameter: hadoop.ssl.server.conf;  
Ignoring.
2016-03-18 18:18:43,093 WARN [main] org.apache.hadoop.conf.Configuration: 
job.xml:an attempt to override final parameter: 
mapreduce.job.end-notification.max.attempts;  Ignoring.
2016-03-18 18:18:43,509 INFO [main] 
org.apache.hadoop.conf.Configuration.deprecation: session.id is deprecated. 
Instead, use dfs.metrics.session-id
2016-03-18 18:18:43,921 INFO [main] 
org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter: File Output 
Committer Algorithm version is 1
2016-03-18 18:18:43,933 INFO [main] org.apache.hadoop.mapred.Task:  Using 
ResourceCalculatorProcessTree : [ ]
2016-03-18 18:18:44,120 WARN [main] org.apache.hadoop.mapred.YarnChild: 
Exception running child : java.lang.RuntimeException: 
java.lang.ClassNotFoundException: Class 
org.apache.hive.hcatalog.mapreduce.HCatInputFormat not found
at 
org.apache.hadoop.conf.Configuration.getClass(Configuration.java:2047)
at 
org.apache.hadoop.mapreduce.task.JobContextImpl.getInputFormatClass(JobContextImpl.java:184)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:746)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1642)
at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)
Caused by: java.lang.ClassNotFoundException: Class 
org.apache.hive.hcatalog.mapreduce.HCatInputFormat not found
at 
org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:1953)
at 
org.apache.hadoop.conf.Configuration.getClass(Configuration.java:2045)
... 8 more

And 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1521) Couldn't refresh a cube segment whose start time is before 1970-01-01

2016-03-22 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1521:
---

 Summary: Couldn't refresh a cube segment whose start time is 
before 1970-01-01
 Key: KYLIN-1521
 URL: https://issues.apache.org/jira/browse/KYLIN-1521
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.3.0, v1.5.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.3.1, v1.5.1


When refresh a segment whose start date is before 1970-01-01, the "Submit" 
button is disabled, user couldn't proceed.

The root cause is, in refresh page it checks whether the start time is > 0, if 
not the button is disabled; This check should be removed to support earlier 
dates. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1524) Get "java.lang.Double cannot be cast to java.lang.Long" error when Top-N metris data type is BigInt

2016-03-22 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1524:
---

 Summary: Get "java.lang.Double cannot be cast to java.lang.Long" 
error when Top-N metris data type is BigInt
 Key: KYLIN-1524
 URL: https://issues.apache.org/jira/browse/KYLIN-1524
 Project: Kylin
  Issue Type: Bug
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
 Fix For: v1.5.1






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1528) Create a branch for v1.5 with HBase 1.1 API

2016-03-23 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1528:
---

 Summary: Create a branch for v1.5 with HBase 1.1 API
 Key: KYLIN-1528
 URL: https://issues.apache.org/jira/browse/KYLIN-1528
 Project: Kylin
  Issue Type: Improvement
  Components: Storage - HBase
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Create a new branch for Kylin v1.5 compile with HBase v1.1 API.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1537) Failed to create flat hive table, when name is too long

2016-03-27 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1537:
---

 Summary: Failed to create flat hive table, when name is too long
 Key: KYLIN-1537
 URL: https://issues.apache.org/jira/browse/KYLIN-1537
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v1.5.1


Run Kylin regression test on HDP 2.4, Hive version 1.2.1.2.4; A cube couldn't 
build, but others can; Got error in the first step:

{code}
OS command error exit with 12 -- hive -e "USE default;
DROP TABLE IF EXISTS 
kylin_intermediate_test_kylin_cube_with_slr_desc_1970010100_2015010100;

CREATE EXTERNAL TABLE IF NOT EXISTS 
kylin_intermediate_test_kylin_cube_with_slr_desc_1970010100_2015010100
(
DEFAULT_TEST_KYLIN_FACT_CAL_DT date
,DEFAULT_TEST_KYLIN_FACT_LEAF_CATEG_ID bigint
,DEFAULT_TEST_KYLIN_FACT_LSTG_SITE_ID int
,DEFAULT_TEST_CATEGORY_GROUPINGS_META_CATEG_NAME string
,DEFAULT_TEST_CATEGORY_GROUPINGS_CATEG_LVL2_NAME string
,DEFAULT_TEST_CATEGORY_GROUPINGS_CATEG_LVL3_NAME string
,DEFAULT_TEST_KYLIN_FACT_LSTG_FORMAT_NAME string
,DEFAULT_TEST_KYLIN_FACT_SLR_SEGMENT_CD smallint
,DEFAULT_TEST_KYLIN_FACT_SELLER_ID bigint
,DEFAULT_TEST_KYLIN_FACT_PRICE decimal(19,4)
,DEFAULT_TEST_KYLIN_FACT_ITEM_COUNT int
)
ROW FORMAT DELIMITED FIELDS TERMINATED BY '\177'
STORED AS SEQUENCEFILE
LOCATION 
'/kylin/kylin_metadata/kylin-a28a48e4-83a2-4b71-b06d-9a281540507d/kylin_intermediate_test_kylin_cube_with_slr_desc_1970010100_2015010100';

SET dfs.replication=2;
SET hive.exec.compress.output=true;
SET hive.auto.convert.join.noconditionaltask=true;
SET hive.auto.convert.join.noconditionaltask.size=3;
SET hive.merge.mapfiles=true;
SET hive.merge.mapredfiles=true;
SET hive.merge.size.per.task=6400;
INSERT OVERWRITE TABLE 
kylin_intermediate_test_kylin_cube_with_slr_desc_1970010100_2015010100 
SELECT
TEST_KYLIN_FACT.CAL_DT
,TEST_KYLIN_FACT.LEAF_CATEG_ID
,TEST_KYLIN_FACT.LSTG_SITE_ID
,TEST_CATEGORY_GROUPINGS.META_CATEG_NAME
,TEST_CATEGORY_GROUPINGS.CATEG_LVL2_NAME
,TEST_CATEGORY_GROUPINGS.CATEG_LVL3_NAME
,TEST_KYLIN_FACT.LSTG_FORMAT_NAME
,TEST_KYLIN_FACT.SLR_SEGMENT_CD
,TEST_KYLIN_FACT.SELLER_ID
,TEST_KYLIN_FACT.PRICE
,TEST_KYLIN_FACT.ITEM_COUNT
FROM DEFAULT.TEST_KYLIN_FACT as TEST_KYLIN_FACT 
INNER JOIN EDW.TEST_CAL_DT as TEST_CAL_DT
ON TEST_KYLIN_FACT.CAL_DT = TEST_CAL_DT.CAL_DT
INNER JOIN DEFAULT.TEST_CATEGORY_GROUPINGS as TEST_CATEGORY_GROUPINGS
ON TEST_KYLIN_FACT.LEAF_CATEG_ID = TEST_CATEGORY_GROUPINGS.LEAF_CATEG_ID AND 
TEST_KYLIN_FACT.LSTG_SITE_ID = TEST_CATEGORY_GROUPINGS.SITE_ID
INNER JOIN EDW.TEST_SITES as TEST_SITES
ON TEST_KYLIN_FACT.LSTG_SITE_ID = TEST_SITES.SITE_ID
INNER JOIN EDW.TEST_SELLER_TYPE_DIM as TEST_SELLER_TYPE_DIM
ON TEST_KYLIN_FACT.SLR_SEGMENT_CD = TEST_SELLER_TYPE_DIM.SELLER_TYPE_CD
WHERE (TEST_KYLIN_FACT.CAL_DT < '2015-01-01')
;

"
WARNING: Use "yarn jar" to launch YARN applications.

Logging initialized using configuration in 
file:/etc/hive/2.4.0.0-169/0/hive-log4j.properties
OK
Time taken: 1.812 seconds
OK
Time taken: 5.622 seconds
FAILED: Hive Internal Error: 
org.codehaus.jettison.json.JSONException(JSONArray[0] is not a JSONObject.)
org.codehaus.jettison.json.JSONException: JSONArray[0] is not a JSONObject.
at 
org.codehaus.jettison.json.JSONArray.getJSONObject(JSONArray.java:258)
at 
org.apache.atlas.hive.bridge.HiveMetaStoreBridge.getEntityReferenceFromDSL(HiveMetaStoreBridge.java:193)
at 
org.apache.atlas.hive.bridge.HiveMetaStoreBridge.getTableReference(HiveMetaStoreBridge.java:221)
at 
org.apache.atlas.hive.bridge.HiveMetaStoreBridge.registerTable(HiveMetaStoreBridge.java:276)
at 
org.apache.atlas.hive.hook.HiveHook.handleCreateTable(HiveHook.java:271)
at org.apache.atlas.hive.hook.HiveHook.fireAndForget(HiveHook.java:205)
at org.apache.atlas.hive.hook.HiveHook.run(HiveHook.java:172)
at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1585)
at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1254)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1118)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1108)
at 
org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:216)
at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:168)
at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:379)
at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:314)
at 
org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:711)
at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:684)
at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:624)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodA

[jira] [Created] (KYLIN-1538) DoubleDeltaSerializer case obvious error after deserialize and serialize

2016-03-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1538:
---

 Summary: DoubleDeltaSerializer case obvious error after 
deserialize and serialize 
 Key: KYLIN-1538
 URL: https://issues.apache.org/jira/browse/KYLIN-1538
 Project: Kylin
  Issue Type: Bug
  Components: General
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
Assignee: liyang
 Fix For: v1.5.1






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1540) REST API for deleting segment

2016-03-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1540:
---

 Summary: REST API for deleting segment
 Key: KYLIN-1540
 URL: https://issues.apache.org/jira/browse/KYLIN-1540
 Project: Kylin
  Issue Type: New Feature
  Components: REST Service
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v1.5.1


To allow user have more flexibility in managing the cube segments, Kylin need a 
REST API to delete segment. But the deletion can only happen on head or tail 
segment, one segment one time; 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1551) Should check and ensure TopN measure has two parameters specified

2016-03-30 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1551:
---

 Summary: Should check and ensure TopN measure has two parameters 
specified
 Key: KYLIN-1551
 URL: https://issues.apache.org/jira/browse/KYLIN-1551
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.5.1


User from mailing list reported an error when using TopN:
{code}
when I use TOP-N, exception occured

java.lang.IndexOutOfBoundsException: Index: 1, Size: 1
at java.util.ArrayList.rangeCheck(ArrayList.java:635)
at java.util.ArrayList.get(ArrayList.java:411)
at 
org.apache.kylin.measure.topn.TopNMeasureType.getColumnsNeedDictionary(TopNMeasureType.java:170)
at 
org.apache.kylin.cube.model.CubeDesc.getAllColumnsNeedDictionary(CubeDesc.java:975)
at 
org.apache.kylin.cube.CubeManager.getAllDictColumnsOnFact(CubeManager.java:904)
at 
org.apache.kylin.engine.mr.steps.FactDistinctColumnsJob.run(FactDistinctColumnsJob.java:80)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84)
at 
org.apache.kylin.engine.mr.common.MapReduceExecutable.doWork(MapReduceExecutable.java:118)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:114)
at 
org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:114)
at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:124)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)

{code}

The reason should be: no second parameter be specified for TopN. I can 
reproduce it on v1.5.0; The cube GUI should check and report to user when find 
such a case.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1553) Cannot find rowkey column "COL_NAME" in cube CubeDesc

2016-03-31 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1553:
---

 Summary: Cannot find rowkey column "COL_NAME" in cube CubeDesc
 Key: KYLIN-1553
 URL: https://issues.apache.org/jira/browse/KYLIN-1553
 Project: Kylin
  Issue Type: Bug
  Components: Metadata, Web 
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.5.1


Create a cube, select a lookup table's PK as dimension column, save the cube 
Kylin reports error "Cannot find rowkey column SITE_ID in cube CubeDesc 
[name=my_cube]"

Check the JSON of the cube definition, the PK wasn't replaced with the FK on 
fact table (expected behavior). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1557) Enhance the check on aggregation group dimension number

2016-04-04 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1557:
---

 Summary: Enhance the check on aggregation group dimension number
 Key: KYLIN-1557
 URL: https://issues.apache.org/jira/browse/KYLIN-1557
 Project: Kylin
  Issue Type: Improvement
  Components: Metadata
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v1.5.1


If the dimension number in an aggregation group exceeds the number of 
"kylin.cube.aggrgroup.max.size" (default 12), Kylin will throw an error saying 
"Aggregation group  has too many dimensions"; User has to remove some 
dimensions to proceed.

This check adds up the number of mandantory dimensions, hierarchy, joint and 
normal dimensions; This is too strong. The check should be optimized, all 
mandantory + joint dimensions should be treated as 1; hierarchy dimensions 
should be treated as N; nomal dimensions be 2^N; Control total combination 
numbers at a certain level.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1560) Make BatchCubingJobBuilder2 easier to add additional step

2016-04-05 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1560:
---

 Summary: Make BatchCubingJobBuilder2 easier to add additional step
 Key: KYLIN-1560
 URL: https://issues.apache.org/jira/browse/KYLIN-1560
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
Priority: Minor


Add one or two empty methods in BatchCubingJobBuilder2 to make subclasses can 
easily insert steps before/after cubing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1566) Add and use a separate kylin_job_conf.xml for in-mem cubing

2016-04-06 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1566:
---

 Summary: Add and use a separate kylin_job_conf.xml for in-mem 
cubing
 Key: KYLIN-1566
 URL: https://issues.apache.org/jira/browse/KYLIN-1566
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
Assignee: Dong Li


>From v1.5.0, kylin has two cubing algorithm: by-layer and by-split(in-mem).

The in-mem cubing require more memory resource for the mapper; so far it is 
configured by parameter "kylin.job.cubing.inmem.mrjob_conf_override" with 
default value "mapreduce.map.java.opts=-Xmx2700m;  
mapreduce.map.memory.mb=3072;  mapreduce.task.io.sort.mb=200"; This is a little 
confusing as user can also configure such parameters in kylin_job_conf.xml

A better way is adding a new config file for in-mem cubing job, moving the 
in-mem specific mapreduce parameters there.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1567) Front-end work for KYLIN-1557

2016-04-06 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1567:
---

 Summary: Front-end work for KYLIN-1557
 Key: KYLIN-1567
 URL: https://issues.apache.org/jira/browse/KYLIN-1567
 Project: Kylin
  Issue Type: Improvement
  Components: Web 
Affects Versions: v1.5.1
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.5.2


KYLIN-1557 will add check on the aggregation group combinations; If an agg 
group's combination number exceeds a threshold, it will report error. So the 
cube desc generated by JS should better be good by default; A simple 
implementation would be: if user selects <= 12 dimensions, it is okay to put 
them all in one group's "include" field; If > 12, keep 11 in the "include" 
field, and move all left to the "joint" list; 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1580) Use 1 byte instead of 8 bytes as column indicator in fact distinct MR job

2016-04-12 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1580:
---

 Summary: Use 1 byte instead of 8 bytes as column indicator in fact 
distinct MR job
 Key: KYLIN-1580
 URL: https://issues.apache.org/jira/browse/KYLIN-1580
 Project: Kylin
  Issue Type: Improvement
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v1.5.2


One small enhancement in fact distinct job; today it uses 8 bytes in the head 
of the key to indicates the column index as well as cuboid id; actually it can 
be optimized to 1 byte for column index, and next 8 for cuboid id if the first 
is a special value; this will reduce the network IO between mappers and 
reducers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1581) NPE in Job engine when execute MR job

2016-04-13 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1581:
---

 Summary: NPE in Job engine when execute MR job
 Key: KYLIN-1581
 URL: https://issues.apache.org/jira/browse/KYLIN-1581
 Project: Kylin
  Issue Type: Bug
Affects Versions: v1.5.1, v1.5.0
Reporter: Shaofeng SHI


An NPE throws when executing a job step:

{code}
org.apache.kylin.job.exception.ExecuteException: java.lang.NullPointerException
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:124)
at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:124)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.NullPointerException
at 
org.apache.kylin.engine.mr.common.MapReduceExecutable.onExecuteStart(MapReduceExecutable.java:77)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:105)
at 
org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:114)
... 4 more

{code}

It looks like that, this step had been executed before, maybe after a restart 
it wants to check the job's status from hadoop, but the recorded job id wasn't 
found in hadoop, caused this NPE. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1582) Add a tutorial for building a streaming cube step by step

2016-04-13 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1582:
---

 Summary: Add a tutorial for building a streaming cube step by step
 Key: KYLIN-1582
 URL: https://issues.apache.org/jira/browse/KYLIN-1582
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1623) Make the hll precision for data samping configurable

2016-04-26 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1623:
---

 Summary: Make the hll precision for data samping configurable
 Key: KYLIN-1623
 URL: https://issues.apache.org/jira/browse/KYLIN-1623
 Project: Kylin
  Issue Type: New Feature
  Components: Job Engine
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


now kylin uses hll(14) for sampling; it should be configurable at cube level 
for small or large cubes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1631) Web UI of TopN, "group by" column only need fact column

2016-04-26 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1631:
---

 Summary: Web UI of TopN, "group by" column only need fact column
 Key: KYLIN-1631
 URL: https://issues.apache.org/jira/browse/KYLIN-1631
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.5.2


As Top-N is for the long-tail issue, the "group by" column should be from fact 
table. So far the web ui shows all columns (fact + lookup), that is more than 
needed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1643) Kylin's hll counter count "NULL" as a value

2016-05-02 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1643:
---

 Summary: Kylin's hll counter count "NULL" as a value
 Key: KYLIN-1643
 URL: https://issues.apache.org/jira/browse/KYLIN-1643
 Project: Kylin
  Issue Type: Bug
  Components: General
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Reported by Xie Linjiong, he found Kylin's HyperLogLog count NULL as 1, but 
Hive count it as 0; Below is a sample:


A. source table:
hive> select * from temp.temp_kylin_null_test_20160429;
OK
10003   NULL
10002   NULL
10001   male

B. hive's query result:
hive> select count(distinct gender)],count(*) from 
temp.temp_kylin_null_test_20160429 where user_id in (10002,10003);

OK
0   2

C. Kylin's query result:

1  2




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1646) Column appeared twice if it was declared as both dimension and measure

2016-05-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1646:
---

 Summary: Column appeared twice if it was declared as both 
dimension and measure
 Key: KYLIN-1646
 URL: https://issues.apache.org/jira/browse/KYLIN-1646
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Attachments: Screen Shot 2016-05-03 at 9.50.50 AM.png

Create a data model, define a column as both dimension and measure (this is 
reasonable as the model may have several cubes). When create the cube measure, 
it shows the column name twice, please see the screen shot.

Expected behavior, the list option should not have duplication.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1647) Purge a cube, and then build again, the start date is not updated

2016-05-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1647:
---

 Summary: Purge a cube, and then build again, the start date is not 
updated
 Key: KYLIN-1647
 URL: https://issues.apache.org/jira/browse/KYLIN-1647
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.1
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


Have a cube which has been built; disable it and then click "Purge", the purge 
got succeed; then I click "Build" to rebuild it, the web UI shows a dialogue to 
enter the end date; but the start date is not the very begining of this cube, 
it is the end date of the segment that I just purged.

Although refresh the web page can flush the cache, expect it's behavior is 
correct without extra action.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1660) Streaming/kafka config not match with table name

2016-05-05 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1660:
---

 Summary: Streaming/kafka config not match with table name
 Key: KYLIN-1660
 URL: https://issues.apache.org/jira/browse/KYLIN-1660
 Project: Kylin
  Issue Type: Bug
  Components: streaming, Web 
Affects Versions: v1.5.1
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.5.2


Define a streaming table, in the "Table name", if enter a name without db 
schema, like "STREAMING_TBL", the save can get success; but when trigger the 
streaming cube build, an exception "Table DEFAULT.STREAMING_TBL is not a 
streaming table." will be thrown. In the metastore, I see the streaming conf 
and kafka conf are named with the origin table name ( "STREAMING_TBL" in this 
case), while the table desc, its name is " "DEFAULT.STREAMING_TBL"

Expected behavior: automatically append db schema for the table before saving 
the stream config.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1666) Should check project selection before add a stream table

2016-05-08 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1666:
---

 Summary: Should check project selection before add a stream table
 Key: KYLIN-1666
 URL: https://issues.apache.org/jira/browse/KYLIN-1666
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.1, v1.5.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


In Kylin web, if not select a "project", the page will display "Please select 
your project first", but I still be able to click "Add streaming table"; when 
save the table info, a NPE was thrown:

{code}
java.lang.NullPointerException
at 
java.util.concurrent.ConcurrentSkipListMap.comparable(ConcurrentSkipListMap.java:659)
at 
java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:821)
at 
java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1626)
at 
org.apache.kylin.common.restclient.SingleValueCache.get(SingleValueCache.java:85)
at 
org.apache.kylin.metadata.project.ProjectManager.getProject(ProjectManager.java:138)
at 
org.apache.kylin.metadata.project.ProjectManager.addTableDescToProject(ProjectManager.java:280)
at 
org.apache.kylin.rest.service.CubeService.syncTableToProject(CubeService.java:580)
at 
org.apache.kylin.rest.service.CubeService$$FastClassByCGLIB$$17a07c0e.invoke()
at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:204)
{code}

Suggestion: don't allow user to take create actions if no project is selected.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1667) Streaming table name should allow enter "DB.TABLE" format

2016-05-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1667:
---

 Summary: Streaming table name should allow enter "DB.TABLE" format
 Key: KYLIN-1667
 URL: https://issues.apache.org/jira/browse/KYLIN-1667
 Project: Kylin
  Issue Type: Bug
  Components: streaming, Web 
Affects Versions: v1.5.1, v1.5.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


Add a streaming table, in the "Table Name" field, if I enter a name 
"DEFAULT.TBL",  an error will be thrown in backend:

{code}
java.lang.IllegalStateException: Cannot find table 'null' in metadata manager
at 
org.apache.kylin.metadata.project.ProjectManager.addTableDescToProject(ProjectManager.java:284)
at 
org.apache.kylin.rest.service.CubeService.syncTableToProject(CubeService.java:580)
at 
org.apache.kylin.rest.service.CubeService$$FastClassByCGLIB$$17a07c0e.invoke()
at net.sf.cglib.proxy.MethodProxy.invoke(MethodProxy.java:204)

{code}

Kylin UI should support custom DB name when creating a streaming table. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1668) Rowkey column shouldn't allow delete and add

2016-05-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1668:
---

 Summary: Rowkey column shouldn't allow delete and add
 Key: KYLIN-1668
 URL: https://issues.apache.org/jira/browse/KYLIN-1668
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.1, v1.3.0, v1.5.0
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


Every column in rowkey are required, couldn't be deleted. And no need to add 
other columns. The "-" icon and "New Rowkey Column" button should be removed 
from the UI.

Besides, please add some text to let user know the column sequence is 
adjustable with drag&drop. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1669) Sunset the "Capacity" field from DataModel

2016-05-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1669:
---

 Summary: Sunset the "Capacity" field from DataModel
 Key: KYLIN-1669
 URL: https://issues.apache.org/jira/browse/KYLIN-1669
 Project: Kylin
  Issue Type: Improvement
  Components: Metadata, Web 
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Kylin used to use the "Capacity" on model as the hint to build the cube; While 
as only three options (SMALL, MEDIUM, LARGE), it couldn't give much 
flexibility, and may confuse user.

As today kylin allows cube level to overwrite the config in kylin.properties, 
many parameters can be specified on cube, that would give user much 
flexibility; So the "Capacity" concept is redundant, can sunset it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1677) Distribute source data by certain columns when creating flat table

2016-05-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1677:
---

 Summary: Distribute source data by certain columns when creating 
flat table
 Key: KYLIN-1677
 URL: https://issues.apache.org/jira/browse/KYLIN-1677
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Inspired by KYLIN-1656, Kylin can distribute the source data by certain columns 
when creating the flat hive table; Then the data assigned to a mapper will have 
more similarity, more aggregation can happen at mapper side, and then less 
shuffle and reduce is needed.

Columns can be used for the distribution includes: ultra high cardinality 
column, mandantory column, partition date/time column, etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1686) Get "Access is denied" error on clicking "Admin" tab

2016-05-13 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1686:
---

 Summary: Get "Access is denied" error on clicking "Admin" tab
 Key: KYLIN-1686
 URL: https://issues.apache.org/jira/browse/KYLIN-1686
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Reporter: Shaofeng SHI
Assignee: Zhong,Jason


Enable LDAP authentication, login with an analyst account, when clicking the 
"Admin" tab, the page shows an exception:
{code}
org.springframework.security.access.AccessDeniedException: Access is denied
at 
org.springframework.security.access.vote.AffirmativeBased.decide(AffirmativeBased.java:83)
at 
org.springframework.security.access.intercept.AbstractSecurityInterceptor.beforeInvocation(AbstractSecurityInterceptor.java:206)
at 
org.springframework.security.access.intercept.aopalliance.MethodSecurityInterceptor.invoke(MethodSecurityInterceptor.java:60)
at 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:172)
at 
org.springframework.aop.framework.Cglib2AopProxy$DynamicAdvisedInterceptor.intercept(Cglib2AopProxy.java:622)
at 
org.apache.kylin.rest.service.DiagnosisService$$EnhancerByCGLIB$$6cb9c74a.getProjectBadQueryHistory()
at 
io.kyligence.kap.rest.controller.BadQueryController.getBadQuerySql(BadQueryController.java:60)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at 
org.springframework.web.method.support.InvocableHandlerMethod.invoke(InvocableHandlerMethod.java:213)
at 
org.springframework.web.method.support.InvocableHandlerMethod.invokeForRequest(InvocableHandlerMethod.java:126)
at 
org.springframework.web.servlet.mvc.method.annotation.ServletInvocableHandlerMethod.invokeAndHandle(ServletInvocableHandlerMethod.java:96)
at 
org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.invokeHandlerMethod(RequestMappingHandlerAdapter.java:617)
at 
org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.handleInternal(RequestMappingHandlerAdapter.java:578)
at 
org.springframework.web.servlet.mvc.method.AbstractHandlerMethodAdapter.handle(AbstractHandlerMethodAdapter.java:80)
at 
org.springframework.web.servlet.DispatcherServlet.doDispatch(DispatcherServlet.java:923)
at 
org.springframework.web.servlet.DispatcherServlet.doService(DispatcherServlet.java:852)
at 
org.springframework.web.servlet.FrameworkServlet.processRequest(FrameworkServlet.java:882)
at 
org.springframework.web.servlet.FrameworkServlet.doGet(FrameworkServlet.java:778)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:620)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:727)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1687) Error to select cuboid

2016-05-13 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1687:
---

 Summary: Error to select cuboid
 Key: KYLIN-1687
 URL: https://issues.apache.org/jira/browse/KYLIN-1687
 Project: Kylin
  Issue Type: Bug
  Components: Metadata
Reporter: Shaofeng SHI
Assignee: hongbin ma


Reported by user lancelot chen  from mailing list:

{code}

Caused by: java.util.NoSuchElementException
at java.util.ArrayList$Itr.next(ArrayList.java:794)
at java.util.Collections.min(Collections.java:665)
at 
org.apache.kylin.cube.cuboid.Cuboid.translateToValidCuboid(Cuboid.java:201)
at 
org.apache.kylin.cube.cuboid.Cuboid.translateToValidCuboid(Cuboid.java:125)
at org.apache.kylin.cube.cuboid.Cuboid.findById(Cuboid.java:67)
at 
org.apache.kylin.storage.hbase.cube.v2.CubeStorageQuery.identifyCuboid(CubeStorageQuery.java:183)
at 
org.apache.kylin.storage.hbase.cube.v2.CubeStorageQuery.search(CubeStorageQuery.java:96)
at 
org.apache.kylin.query.enumerator.OLAPEnumerator.queryStorage(OLAPEnumerator.java:125)
at 
org.apache.kylin.query.enumerator.OLAPEnumerator.moveNext(OLAPEnumerator.java:71)
at Baz$1$1.moveNext(Unknown Source)
at 
org.apache.calcite.linq4j.EnumerableDefaults.groupBy_(EnumerableDefaults.java:819)
at 
org.apache.calcite.linq4j.EnumerableDefaults.groupBy(EnumerableDefaults.java:754)
at 
org.apache.calcite.linq4j.DefaultEnumerable.groupBy(DefaultEnumerable.java:302)
at Baz.bind(Unknown Source)
at 
org.apache.calcite.jdbc.CalcitePrepare$CalciteSignature.enumerable(CalcitePrepare.java:326)
at 
org.apache.calcite.jdbc.CalciteConnectionImpl.enumerable(CalciteConnectionImpl.java:281)
at 
org.apache.calcite.jdbc.CalciteMetaImpl._createIterable(CalciteMetaImpl.java:545)
at 
org.apache.calcite.jdbc.CalciteMetaImpl.createIterable(CalciteMetaImpl.java:536)
at 
org.apache.calcite.avatica.AvaticaResultSet.execute(AvaticaResultSet.java:187)
at 
org.apache.calcite.jdbc.CalciteResultSet.execute(CalciteResultSet.java:65)
at 
org.apache.calcite.jdbc.CalciteResultSet.execute(CalciteResultSet.java:44)
at 
org.apache.calcite.avatica.AvaticaConnection$1.execute(AvaticaConnection.java:566)
at 
org.apache.calcite.jdbc.CalciteMetaImpl.prepareAndExecute(CalciteMetaImpl.java:578)
at 
org.apache.calcite.avatica.AvaticaConnection.prepareAndExecuteInternal(AvaticaConnection.java:571)
at 
org.apache.calcite.avatica.AvaticaStatement.executeInternal(AvaticaStatement.java:135)
... 80 more

{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1717) Make job engine scheduler configurable

2016-05-20 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1717:
---

 Summary: Make job engine scheduler configurable
 Key: KYLIN-1717
 URL: https://issues.apache.org/jira/browse/KYLIN-1717
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Today the job engine scheduler is a simple implementation; need abstract it to 
decouple the implementation



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1747) Calculate all 0 (exception mandatory) cuboids

2016-05-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1747:
---

 Summary: Calculate all 0 (exception mandatory) cuboids
 Key: KYLIN-1747
 URL: https://issues.apache.org/jira/browse/KYLIN-1747
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine, Query Engine
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v1.5.3


Kylin doesn't calculate all 0 combinations (exception mandatory dimensions) 
today; There wasn't specific consideration, but this may lead some query be 
slow.

Investigate to remove that design, and make sure all functions (including 
query, merge cuboid statistics, etc) won't be broken.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1763) Release v1.5.2.1

2016-06-04 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1763:
---

 Summary: Release v1.5.2.1
 Key: KYLIN-1763
 URL: https://issues.apache.org/jira/browse/KYLIN-1763
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1767) Update UI for TopN: specify encoding and multiple "group by"

2016-06-06 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1767:
---

 Summary: Update UI for TopN: specify encoding and multiple "group 
by"
 Key: KYLIN-1767
 URL: https://issues.apache.org/jira/browse/KYLIN-1767
 Project: Kylin
  Issue Type: Task
  Components: Web 
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.5.3


KYLIN-1478 and KYLIN-1693 need changes from the UI part. The sample json can be 
found in these two JIRAs;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1774) Update Kylin's tomcat version to 7.0.69

2016-06-08 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1774:
---

 Summary: Update Kylin's tomcat version to 7.0.69
 Key: KYLIN-1774
 URL: https://issues.apache.org/jira/browse/KYLIN-1774
 Project: Kylin
  Issue Type: Task
  Components: REST Service
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
Priority: Minor
 Fix For: v1.5.3






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1777) Streaming cube build shouldn't check working segment

2016-06-08 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1777:
---

 Summary: Streaming cube build shouldn't check working segment
 Key: KYLIN-1777
 URL: https://issues.apache.org/jira/browse/KYLIN-1777
 Project: Kylin
  Issue Type: Bug
  Components: streaming
Affects Versions: v1.5.3
Reporter: Shaofeng SHI
 Fix For: v1.5.3


A bug newly introduced with the changes of KYLIN-1744, reported by felixcui01 
in mailing list:

{code}
2016-06-07 17:55:51,024 ERROR [main StreamingCLI:106]: error start streaming 
java.lang.IllegalStateException: There is already a building segment! 
at 
org.apache.kylin.cube.CubeManager.checkNoBuildingSegment(CubeManager.java:567) 
at org.apache.kylin.cube.CubeManager.appendSegment(CubeManager.java:400) 
at 
org.apache.kylin.engine.streaming.cube.StreamingCubeBuilder.createBuildable(StreamingCubeBuilder.java:118)
 
at 
org.apache.kylin.engine.streaming.OneOffStreamingBuilder$1.run(OneOffStreamingBuilder.java:76)
 
at 
org.apache.kylin.engine.streaming.cli.StreamingCLI.startOneOffCubeStreaming(StreamingCLI.java:126)
 
at 
org.apache.kylin.engine.streaming.cli.StreamingCLI.main(StreamingCLI.java:100)

{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1780) Potential issue in SnapshotTable.equas()

2016-06-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1780:
---

 Summary: Potential issue in SnapshotTable.equas()
 Key: KYLIN-1780
 URL: https://issues.apache.org/jira/browse/KYLIN-1780
 Project: Kylin
  Issue Type: Bug
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


SnapshotTable.equas()



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1789) Couldn't use View as Lookup when join type is "inner"

2016-06-15 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1789:
---

 Summary: Couldn't use View as Lookup when join type is "inner"
 Key: KYLIN-1789
 URL: https://issues.apache.org/jira/browse/KYLIN-1789
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Affects Versions: v1.5.2, v1.5.2.1
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v1.5.3


Reported by Bhanu Mohanty in user mailing list:

I am using kylin-1.5.2.1

Added hive view as a look up table 

Getting  error at Build Dimension Dictionary
DEFAULT.kylin_intermediate_DEFAULT_*

If the join is "inner" 
It works when I changed the join to "left" 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1793) Job couldn't stop when hive commands got error with beeline

2016-06-16 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1793:
---

 Summary: Job couldn't stop when hive commands got error with 
beeline
 Key: KYLIN-1793
 URL: https://issues.apache.org/jira/browse/KYLIN-1793
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Affects Versions: v1.5.2, v1.5.1, v1.5.0
Reporter: Shaofeng SHI
Assignee: Dong Li


Configure Kylin to use beeline as the hive command line; submit a cube build 
job, the job moves to 100% with success, while I found there was error in the 
hive related steps, but the error wasn't captured by Kylin;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1795) bin/sample.sh doesn't work when configured hive client is beeline

2016-06-16 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1795:
---

 Summary: bin/sample.sh doesn't work when configured hive client is 
beeline
 Key: KYLIN-1795
 URL: https://issues.apache.org/jira/browse/KYLIN-1795
 Project: Kylin
  Issue Type: Bug
  Components: Client - CLI
Affects Versions: v1.5.0
Reporter: Shaofeng SHI
Priority: Minor


If configure to use beeline as hive client in kylin.properties, then run 
bin/sample.sh, it will fail to load data into hive, like:

0: jdbc:hive2://hiveserver:10> LOAD DATA LOCAL INPATH 'DEFAULT.KYLIN_SALES.csv' 
OVERWRITE INTO TABLE DEFAULT.KYLIN_SALES;
Error: Error while compiling statement: FAILED: SemanticException Line 1:23 
Invalid path ''DEFAULT.KYLIN_SALES.csv'': No files matching path 
file:/var/run/cloudera-scm-agent/process/1358-hive-HIVESERVER2/DEFAULT.KYLIN_SALES.csv
 (state=42000,code=4)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1797) bin/find-hive-dependency.sh may not get correct local classpath when configured hive client is beeline

2016-06-16 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1797:
---

 Summary: bin/find-hive-dependency.sh may not get correct local 
classpath when configured hive client is beeline
 Key: KYLIN-1797
 URL: https://issues.apache.org/jira/browse/KYLIN-1797
 Project: Kylin
  Issue Type: Bug
  Components: Client - CLI
Affects Versions: v1.3.0, v1.5.0
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1818) change kafka dependency to provided

2016-06-23 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1818:
---

 Summary: change kafka dependency to provided
 Key: KYLIN-1818
 URL: https://issues.apache.org/jira/browse/KYLIN-1818
 Project: Kylin
  Issue Type: Improvement
  Components: streaming
Affects Versions: v1.5.2
Reporter: Shaofeng SHI
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1831) Couldn't run "count (distinct pk)" on a lookup table

2016-06-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1831:
---

 Summary: Couldn't run "count (distinct pk)" on a lookup table
 Key: KYLIN-1831
 URL: https://issues.apache.org/jira/browse/KYLIN-1831
 Project: Kylin
  Issue Type: Bug
  Components: Query Engine
Affects Versions: v1.5.2
Reporter: Shaofeng SHI
Assignee: Dong Li


I have a SSB testing cube, in which ssb.customer is a lookup table, the cube 
has been built successfully (table snapshot exists); As KYLIN-1016 mentioned, 
count distinct should work even if there is no pre-defined measure; The SQL is: 
"select count (distinct c_custkey) from ssb.customer", the error log is:

{code}
Caused by: java.lang.IllegalStateException: Can't create EnumerableAggregate!
at 
org.apache.kylin.query.relnode.OLAPAggregateRel.implementEnumerable(OLAPAggregateRel.java:373)
at 
org.apache.kylin.query.relnode.OLAPRel$JavaImplementor.createEnumerable(OLAPRel.java:158)
at 
org.apache.kylin.query.relnode.OLAPRel$JavaImplementor.createEnumerable(OLAPRel.java:154)
at 
org.apache.kylin.query.relnode.OLAPToEnumerableConverter.implement(OLAPToEnumerableConverter.java:99)
at 
org.apache.calcite.adapter.enumerable.EnumerableRelImplementor.implementRoot(EnumerableRelImplementor.java:102)
at 
org.apache.calcite.adapter.enumerable.EnumerableInterpretable.toBindable(EnumerableInterpretable.java:92)
at 
org.apache.calcite.prepare.CalcitePrepareImpl$CalcitePreparingStmt.implement(CalcitePrepareImpl.java:1171)
at org.apache.calcite.prepare.Prepare.prepareSql(Prepare.java:297)
at org.apache.calcite.prepare.Prepare.prepareSql(Prepare.java:196)
at 
org.apache.calcite.prepare.CalcitePrepareImpl.prepare2_(CalcitePrepareImpl.java:721)
at 
org.apache.calcite.prepare.CalcitePrepareImpl.prepare_(CalcitePrepareImpl.java:588)
at 
org.apache.calcite.prepare.CalcitePrepareImpl.prepareSql(CalcitePrepareImpl.java:558)
at 
org.apache.calcite.jdbc.CalciteConnectionImpl.parseQuery(CalciteConnectionImpl.java:214)
at 
org.apache.calcite.jdbc.CalciteMetaImpl.prepareAndExecute(CalciteMetaImpl.java:573)
at 
org.apache.calcite.avatica.AvaticaConnection.prepareAndExecuteInternal(AvaticaConnection.java:571)
at 
org.apache.calcite.avatica.AvaticaStatement.executeInternal(AvaticaStatement.java:135)
... 80 more
Caused by: org.apache.calcite.rel.InvalidRelException: distinct aggregation not 
supported
at 
org.apache.calcite.adapter.enumerable.EnumerableAggregate.(EnumerableAggregate.java:71)
at 
org.apache.kylin.query.relnode.OLAPAggregateRel.implementEnumerable(OLAPAggregateRel.java:370)
... 95 more
{code}




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1838) Fix sample cube definition

2016-06-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1838:
---

 Summary: Fix sample cube definition
 Key: KYLIN-1838
 URL: https://issues.apache.org/jira/browse/KYLIN-1838
 Project: Kylin
  Issue Type: Bug
  Components: Documentation
Reporter: Shaofeng SHI
Priority: Minor
 Fix For: v1.5.3


Problem reported by Alberto Ramón  in user mailing 
list:

In example "learn_kylin", table name "DEFAULT.KYLIN_CATEGORY_GROUPINGS" appears 
2 times:



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1859) Use segment "uuid" instead of "name" to seek a segment across the system

2016-07-07 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1859:
---

 Summary: Use segment "uuid" instead of "name" to seek a segment 
across the system
 Key: KYLIN-1859
 URL: https://issues.apache.org/jira/browse/KYLIN-1859
 Project: Kylin
  Issue Type: Bug
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
Priority: Minor


The "uuid" is unique, while "name" isn't (name is composed by startTime, 
endTime). It's better to use "uuid" in the job chains.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1860) "Models" is very slow to show up

2016-07-08 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1860:
---

 Summary: "Models" is very slow to show up
 Key: KYLIN-1860
 URL: https://issues.apache.org/jira/browse/KYLIN-1860
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.2, v1.5.3
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.5.3


There is an "loading" icon, after about 1 minute it shows up, this is too slow; 
there must be something wrong. Pls see the screenshot



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1861) Hive may fail to create flat table with "GC overhead error"

2016-07-08 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1861:
---

 Summary: Hive may fail to create flat table with "GC overhead 
error"
 Key: KYLIN-1861
 URL: https://issues.apache.org/jira/browse/KYLIN-1861
 Project: Kylin
  Issue Type: Test
Reporter: Shaofeng SHI


in conf/kylin_hive_conf.xml, it enables 
"hive.auto.convert.join.noconditionaltask", which is the optimization about 
converting common join into mapjoin based on the input file size, and give a 
big value (3, 300MB) to 
"hive.auto.convert.join.noconditionaltask.size". So if the memory of the mapper 
isn't enough, it will cause the error.

To fix the error, user can reduce the  
"hive.auto.convert.join.noconditionaltask.size" in conf/kylin_hive_conf.xml, 
and then retry the job.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1862) "table not found" in "Build Dimension Dictionary" step

2016-07-08 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1862:
---

 Summary: "table not found" in "Build Dimension Dictionary" step
 Key: KYLIN-1862
 URL: https://issues.apache.org/jira/browse/KYLIN-1862
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Affects Versions: v1.5.3
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
 Fix For: v1.5.3


In a cube, I use a hive view as lookup table; The view name is 
"ssb.v_customer"; 

In the "Materialize Hive View in Lookup Tables" step, the log said:

Table default.kylin_intermediate_ssb_v_customer stats: [numFiles=2, 
numRows=180, totalSize=82661469, rawDataSize=169673761]

In the "Build Dimension Dictionary" step, it failed with message:

"DEFAULT.kylin_intermediate_DEFAULT_V_CUSTOMER table not found"



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1868) Page is blank with JS error

2016-07-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1868:
---

 Summary: Page is blank with JS error
 Key: KYLIN-1868
 URL: https://issues.apache.org/jira/browse/KYLIN-1868
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.3
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.5.3
 Attachments: Screen Shot 2016-07-11 at 11.08.21 AM.png

Try to create a new cube, in the "dimension" tab, the page is blank (see 
attachement) and there is JS error in console:

{code}
scripts.min.0.js:10159 TypeError: Cannot read property 'fact_table' of undefined
at Scope.$scope.initColumns 
(http://host:7070/kylin/js/scripts.min.0.js:42705:47)
at new  (http://host:7070/kylin/js/scripts.min.0.js:42770:16)

{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1871) Couldn't generate sample data for stream cube

2016-07-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1871:
---

 Summary: Couldn't generate sample data for stream cube
 Key: KYLIN-1871
 URL: https://issues.apache.org/jira/browse/KYLIN-1871
 Project: Kylin
  Issue Type: Bug
  Components: General
Affects Versions: v1.5.3
Reporter: Shaofeng SHI
Priority: Minor
 Fix For: v1.5.3


Trying to build sample streaming cube with the doc 
https://kylin.apache.org/docs15/tutorial/cube_streaming.html , but got error 
when generating the sample data:

 ./bin/kylin.sh org.apache.kylin.source.kafka.util.KafkaSampleProducer --topic 
kylin_demo --broker sandbox:6667 —delay 0

Exception in thread "main" java.lang.NoClassDefFoundError: 
kafka/producer/ProducerConfig
at 
org.apache.kylin.source.kafka.util.KafkaSampleProducer.main(KafkaSampleProducer.java:102)
Caused by: java.lang.ClassNotFoundException: kafka.producer.ProducerConfig
at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
... 1 more

it should be related with KYLIN-1818



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1902) 404 error on get /kylin/models

2016-07-18 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1902:
---

 Summary: 404 error on get /kylin/models
 Key: KYLIN-1902
 URL: https://issues.apache.org/jira/browse/KYLIN-1902
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.2
Reporter: Shaofeng SHI
Assignee: Zhong,Jason
 Fix For: v1.5.3


Request URL:http://localhost:7070/kylin/models
Request Method:GET
Status Code:404 Not Found
Remote Address:[::1]:7070

I think this issue has existed for some time; Although it doesn't impact on 
user experience, an 404 error in web console is confusing. @Jason Can we fix it 
in v1.5.3?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1911) NPE when extended column has NULL value

2016-07-21 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1911:
---

 Summary: NPE when extended column has NULL value
 Key: KYLIN-1911
 URL: https://issues.apache.org/jira/browse/KYLIN-1911
 Project: Kylin
  Issue Type: Bug
  Components: Query Engine
Affects Versions: v1.5.2, v1.5.3, v1.5.2.1
Reporter: Shaofeng SHI
Assignee: hongbin ma



{code}

Caused by: java.lang.NullPointerException
at java.lang.String.(String.java:505)
at 
org.apache.kylin.measure.extendedcolumn.ExtendedColumnMeasureType$2.reload(ExtendedColumnMeasureType.java:152)
at 
org.apache.kylin.storage.hbase.cube.v2.CubeTupleConverter.translateResult(CubeTupleConverter.java:175)
at 
org.apache.kylin.storage.hbase.cube.v2.SequentialCubeTupleIterator.hasNext(SequentialCubeTupleIterator.java:116)
at 
org.apache.kylin.query.enumerator.OLAPEnumerator.moveNext(OLAPEnumerator.java:74)
at Baz$1$1.moveNext(Unknown Source)
at 
org.apache.calcite.linq4j.EnumerableDefaults.groupBy_(EnumerableDefaults.java:819)
at 
org.apache.calcite.linq4j.EnumerableDefaults.groupBy(EnumerableDefaults.java:754)
at 
org.apache.calcite.linq4j.DefaultEnumerable.groupBy(DefaultEnumerable.java:302)
at Baz.bind(Unknown Source)

{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1917) TopN counter merge performance improvement

2016-07-22 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1917:
---

 Summary: TopN counter merge performance improvement
 Key: KYLIN-1917
 URL: https://issues.apache.org/jira/browse/KYLIN-1917
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Affects Versions: v1.5.2, v1.5.3
Reporter: Shaofeng SHI
 Fix For: Backlog


Observed the reduce phase of cube build is slow when there is TopN counter. 
There should be room for performance improvement.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1926) Loosen the constraint on FK-PK data type matching

2016-07-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1926:
---

 Summary: Loosen the constraint on FK-PK data type matching
 Key: KYLIN-1926
 URL: https://issues.apache.org/jira/browse/KYLIN-1926
 Project: Kylin
  Issue Type: Improvement
  Components: Metadata
Affects Versions: all
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI
Priority: Minor
 Fix For: v1.5.4


If lookup table's PK datatype isn't equal to fact table's FK datatype, Kylin 
will report error saying "Primary key are not consistent with Foreign key". 
This constraint is too strong. Should allow user to disable this check.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1929) Error to load slow query in "Monitor" page for non-admin user

2016-07-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1929:
---

 Summary: Error to load slow query in "Monitor" page for non-admin 
user
 Key: KYLIN-1929
 URL: https://issues.apache.org/jira/browse/KYLIN-1929
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v1.5.3
Reporter: Shaofeng SHI
Assignee: Dong Li
 Fix For: v1.5.4
 Attachments: Screen Shot 2016-07-29 at 4.47.11 PM.png

Login Kylin with a non-admin account, click to "Monitor" tab, an error dialog 
shows "Failed to load query" (see screenshot)

In browser console, a 403 error:
 GET http://host:7070/kylin/api/diag/project/sql 403 (Forbidden)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1933) Deploy in cluster mode, the "query" node report "scheduler has not been started" every second

2016-08-02 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1933:
---

 Summary: Deploy in cluster mode, the "query" node report 
"scheduler has not been started" every second 
 Key: KYLIN-1933
 URL: https://issues.apache.org/jira/browse/KYLIN-1933
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Affects Versions: v1.5.3
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (KYLIN-1978) kylin.sh compatible issue on Ubuntu

2016-08-27 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1978:
---

 Summary: kylin.sh compatible issue on Ubuntu
 Key: KYLIN-1978
 URL: https://issues.apache.org/jira/browse/KYLIN-1978
 Project: Kylin
  Issue Type: Bug
  Components: Environment 
Affects Versions: v1.5.3
Reporter: Shaofeng SHI
Assignee: hongbin ma


Reported by Marcelo(marcelo.n...@quantium.com.au) in the mailing list:

DISTRIB_ID=Ubuntu
DISTRIB_RELEASE=14.04
DISTRIB_CODENAME=trusty
DISTRIB_DESCRIPTION="Ubuntu 14.04.4 LTS"
NAME="Ubuntu"
VERSION="14.04.4 LTS, Trusty Tahr"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu 14.04.4 LTS"
VERSION_ID="14.04"
HOME_URL="http://www.ubuntu.com/";
SUPPORT_URL="http://help.ubuntu.com/";
BUG_REPORT_URL="http://bugs.launchpad.net/ubuntu/";

mapr@qtausc-vpcsdev04:~/kylin/apache-kylin-1.5.3-HBase1.x-bin/bin$ ./kylin.sh
start
*KYLIN_HOME is set to /home/mapr/kylin/apache-kylin-1.5.3-HBase1.x-bin*
cat: invalid option -- '1'
Try 'cat --help' for more information.
-mkdir: Not enough arguments: expected 1 but got 0
Usage: hadoop fs [generic options] -mkdir [-p]  ...
failed to create , Please make sure the user has right to access


That is what is happening when I try to start kylin.
I traced the error and the first one come from get-properties.sh at these
line
for i in `cat ${KYLIN_HOME}/conf/kylin.properties | grep -w "^$1" | grep -v
'^#' | awk -F= '{ n = index($0,"="); print substr($0,n+1)}' | cut -c 1-`

and as you can see kylin home is set.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


  1   2   3   4   >