Re: [Draft][REPORT] Apache Kylin - May 2017

2017-05-03 Thread ShaoFeng Shi
+1 Looks good to me;

2017-05-04 13:25 GMT+08:00 Luke Han :

> Dear community,
>  I have drafted below board report for review, please help to check and
> let me know if there's any issue.
>  Feel free to reply here if there are more activities, conference,
> meetup and other events, also community development and others which should
> be included in this report.
>
>  Will submit this report to board later.
>
>  Thanks.
>
> Luke
>
>
> ## Description:
> Apache Kylin is an open source Distributed Analytics Engine designed
> to provide SQL interface and multi-dimensional analysis (OLAP) on
> Hadoop supporting extremely large datasets.
>
>
> ## Issues:
> - there are no issues requiring board attention at this time
>
> ## Activity:
> - Yang Li presented Apache Kylin 2.0 features
> at Strata Hadoop World San Jose on 2017-03-16
> - Luke Han presented Apache Kylin open source
> at OSCAR Beijing on 2017-04-19
> - Apache Kylin Meetup @Toutiao in Beijing
> hosted on 2017-04-29 with 200 attendees onsite
> and 200 online
> - Chaozhong Yang presented Apache Kylin use case
> in Toutiao at above Meetup
> at OSCAR Beijing on 2017-04-19
> - Dong Li presented Apache Kylin
> at OSC China Fujian on 2017-02-25
> - Dong Li presented Apache Kylin
> at OSC China Xiamen on 2017-02-26
> - Dong Li presented Apache Kylin
> at OSC China Wuhan on 2017-04-15
> - Dong Li presented Apache Kylin
> at OSC China Changsha on 2017-04-16
> - Hongbin Ma presented Apache Kylin
> at NJSDGlobal Nanjing on 2017-04-21
> - Chen Wang presented Apache Kylin
> at DBAPlus Meetup China Shanghai on 2017-04-08
> - Roger Shi presented Apache Kylin
> at SQL on Hadoop Meetup Shanghai on 2017-04-29
>
> ## PMC changes:
>
> - Currently 18 PMC members.
> - No new PMC members added in the last 3 months
> - Last PMC addition was Dong Li on Mon Apr 11 2016
>
> ## Committer base changes:
>
> - Currently 28 committers.
> - New commmitters:
> - Alberto Ramón was added as a committer on Thu Apr 27 2017
> - Zhixiong Chen was added as a committer on Thu Apr 27 2017
> - Roger Shi was added as a committer on Thu Apr 27 2017
>
> ## Releases:
>
> - 2.0.0 was released on Sat Apr 29 2017
>
> ## Mailing list activity:
>
> - dev@kylin.apache.org:
> - 334 subscribers (up 23 in the last 3 months):
> - 809 emails sent to list (900 in previous quarter)
>
> - iss...@kylin.apache.org:
> - 61 subscribers (up 9 in the last 3 months):
> - 923 emails sent to list (1525 in previous quarter)
>
> - u...@kylin.apache.org:
> - 270 subscribers (up 52 in the last 3 months):
> - 239 emails sent to list (320 in previous quarter)
>
> ## JIRA activity:
>
> - 164 JIRA tickets created in the last 3 months
> - 151 JIRA tickets closed/resolved in the last 3 months
>



-- 
Best regards,

Shaofeng Shi 史少锋


Re: How do I deploy kylin when I have a Hadoop environment!

2017-05-03 Thread Li Feng
Hi John,

Pls refer to this install guide: 
http://kylin.apache.org/cn/docs20/install/index.html

BR,
Lee.

在 17/5/4 12:48,“john-126” 写入:

Dear Dev:

   How do I deploy kylin when I have a Hadoop environment!Please give
me a plan and suggestion。

   

   My Hadoop environment:

   hadoop version:2.6.4

   Linux OS:centos 6.8

   JDK:1.8.131

 

--

Best regards!

John.xiong

 





[Draft][REPORT] Apache Kylin - May 2017

2017-05-03 Thread Luke Han
Dear community,
 I have drafted below board report for review, please help to check and
let me know if there's any issue.
 Feel free to reply here if there are more activities, conference,
meetup and other events, also community development and others which should
be included in this report.

 Will submit this report to board later.

 Thanks.

Luke


## Description:
Apache Kylin is an open source Distributed Analytics Engine designed
to provide SQL interface and multi-dimensional analysis (OLAP) on
Hadoop supporting extremely large datasets.


## Issues:
- there are no issues requiring board attention at this time

## Activity:
- Yang Li presented Apache Kylin 2.0 features
at Strata Hadoop World San Jose on 2017-03-16
- Luke Han presented Apache Kylin open source
at OSCAR Beijing on 2017-04-19
- Apache Kylin Meetup @Toutiao in Beijing
hosted on 2017-04-29 with 200 attendees onsite
and 200 online
- Chaozhong Yang presented Apache Kylin use case
in Toutiao at above Meetup
at OSCAR Beijing on 2017-04-19
- Dong Li presented Apache Kylin
at OSC China Fujian on 2017-02-25
- Dong Li presented Apache Kylin
at OSC China Xiamen on 2017-02-26
- Dong Li presented Apache Kylin
at OSC China Wuhan on 2017-04-15
- Dong Li presented Apache Kylin
at OSC China Changsha on 2017-04-16
- Hongbin Ma presented Apache Kylin
at NJSDGlobal Nanjing on 2017-04-21
- Chen Wang presented Apache Kylin
at DBAPlus Meetup China Shanghai on 2017-04-08
- Roger Shi presented Apache Kylin
at SQL on Hadoop Meetup Shanghai on 2017-04-29

## PMC changes:

- Currently 18 PMC members.
- No new PMC members added in the last 3 months
- Last PMC addition was Dong Li on Mon Apr 11 2016

## Committer base changes:

- Currently 28 committers.
- New commmitters:
- Alberto Ramón was added as a committer on Thu Apr 27 2017
- Zhixiong Chen was added as a committer on Thu Apr 27 2017
- Roger Shi was added as a committer on Thu Apr 27 2017

## Releases:

- 2.0.0 was released on Sat Apr 29 2017

## Mailing list activity:

- dev@kylin.apache.org:
- 334 subscribers (up 23 in the last 3 months):
- 809 emails sent to list (900 in previous quarter)

- iss...@kylin.apache.org:
- 61 subscribers (up 9 in the last 3 months):
- 923 emails sent to list (1525 in previous quarter)

- u...@kylin.apache.org:
- 270 subscribers (up 52 in the last 3 months):
- 239 emails sent to list (320 in previous quarter)

## JIRA activity:

- 164 JIRA tickets created in the last 3 months
- 151 JIRA tickets closed/resolved in the last 3 months


[jira] [Created] (KYLIN-2584) Always use embeded spark for spark-cubing

2017-05-03 Thread Dong Li (JIRA)
Dong Li created KYLIN-2584:
--

 Summary: Always use embeded spark for spark-cubing
 Key: KYLIN-2584
 URL: https://issues.apache.org/jira/browse/KYLIN-2584
 Project: Kylin
  Issue Type: Improvement
  Components: Spark Engine
Affects Versions: v2.0.0
Reporter: Dong Li
Assignee: Shaofeng SHI


We've embeded spark binaries in kylin package since 2.0.0. But still allowed to 
use other SPARK_HOME defined in environment.

If this SPARK version is not comtipable with kylin 2.0, for example, spark 2.0, 
which affects kylin classpath and block startup of kylin service.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


Re: Adding support for JDBC

2017-05-03 Thread Luke Han
Hi Luis,
 Why not landing data into Hive first from other RDBMs?

 There are some options will come soon to support other data source,
please check our JIRA for detail.

 Thanks.
Luke


Best Regards!
-

Luke Han

On Tue, May 2, 2017 at 12:08 PM, Luis Dominguez 
wrote:

> Any plans to add support for JDBC to expand source options? Currently, it
> only supports Hive.  However,  if I want to use a different data source,
>  having a JDBC option would open up options.
>
>
> -Regards,
>
> *Luis Dominguez*
> Data Warehouse Manager/Architect - Business Intelligence
> XO Group Inc. (NYSE: XOXO)
> *TheKnot.com *  |  TheNest.com
>   |  TheBump.com
> 
> P (512) 498.3317
>


kylin sso配置

2017-05-03 Thread zptang

我想问一下,如何配置 kylin基于cas实现sso



Adding support for JDBC

2017-05-03 Thread Luis Dominguez
Any plans to add support for JDBC to expand source options? Currently, it
only supports Hive.  However,  if I want to use a different data source,
 having a JDBC option would open up options.


-Regards,

*Luis Dominguez*
Data Warehouse Manager/Architect - Business Intelligence
XO Group Inc. (NYSE: XOXO)
*TheKnot.com *  |  TheNest.com
  |  TheBump.com

P (512) 498.3317


kylin sso

2017-05-03 Thread zptang
Hello
Boy!
I want to know How to configure kylin based on cas implementation sso? Could 
you help me?


How do I deploy kylin when I have a Hadoop environment!

2017-05-03 Thread john-126
Dear Dev:

   How do I deploy kylin when I have a Hadoop environment!Please give
me a plan and suggestion。

   

   My Hadoop environment:

   hadoop version:2.6.4

   Linux OS:centos 6.8

   JDK:1.8.131

 

--

Best regards!

John.xiong

 



where is the source code kylin-2.0.0-hbase1x

2017-05-03 Thread xl l
 hi,all:
In http://kylin.apache.org/download/
apache-kylin-2.0.0-bin-hbase098.tar.gz  source code in:
https://github.com/apache/kylin/  tag:* kylin-2.0.0-hbase0.98* 。
But  I can't find   tag :  *apache-kylin-2.0.0-bin-hbase1x * in github

so where  is the source code* apache-kylin-2.0.0-hbase1x*?

which hbase version is tag* kylin-2.0.0 * ?



-- 
* Best Wishes*


Re: 回复: Error while executing SQL "select count(*) as nums,courseid from optionaction group by courseid LIMIT 50000": null

2017-05-03 Thread ShaoFeng Shi
Cool! thanks for the update.

2017-05-02 16:19 GMT+08:00 35925...@qq.com <35925...@qq.com>:

> 问题已经找到
> 原因是我的hive中的javax.jdo.option.ConnectionURL 的连接默认字符集为latin1
> mysql中character_set_database 的值为latin1
>而其他值,例如character_set_client、character_set_connection、
> character_set_results、character_set_server均为utf8
>
>   将所有的字符集均修改为latin1,重新build cube,再查询,就没有原来的错误了,可以查到结果
>
>
>
> 35925...@qq.com
>
> 发件人: 35925138
> 发送时间: 2017-05-02 10:39
> 收件人: dev
> 主题: 回复:答复: 回复:答复: Error while executing SQL "select count(*) as
> nums,courseid from optionaction group by courseid LIMIT 5": null
> 一下是执行一条sql的全部日志,请帮忙分析,只要执行任意一条用到cube索引的sql,均会报错。普通的数据查询,例如select * from
> table ,没有问题。
>
> 2017-05-02 10:30:32,303 INFO  [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> service.QueryService:336 : Using project: optionaction
> 2017-05-02 10:30:32,303 INFO  [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> service.QueryService:337 : The original query:  select count(*),fdz from
> useraction group by fdz
> 2017-05-02 10:30:32,304 INFO  [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> service.QueryService:440 : The corrected query: select count(*),fdz from
> useraction group by fdz
> LIMIT 10
> 2017-05-02 10:30:32,312 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> sql.parser:546 : Reduced COUNT(*)
> 2017-05-02 10:30:32,313 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> sql.parser:546 : Reduced FDZ
> 2017-05-02 10:30:32,313 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> sql.parser:546 : Reduced FDZ
> 2017-05-02 10:30:32,314 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> sql.parser:546 : Reduced SELECT COUNT(*), `FDZ`
> FROM `USERACTION`
> GROUP BY `FDZ`
> 2017-05-02 10:30:32,318 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> calcite.sql2rel:552 : Plan after converting SqlNode to RelNode
> LogicalSort(fetch=[10])
>   LogicalProject(EXPR$0=[$1], FDZ=[$0])
> LogicalAggregate(group=[{0}], EXPR$0=[COUNT()])
>   LogicalProject(FDZ=[$1])
> OLAPTableScan(table=[[DEFAULT, USERACTION]], fields=[[0, 1, 2]])
>
> 2017-05-02 10:30:32,322 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:361 : For final plan, using rel#175:LogicalSort.NONE.[](
> input=HepRelVertex#174,fetch=10)
> 2017-05-02 10:30:32,322 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:361 : For final plan, using
> rel#173:LogicalProject.NONE.[](input=HepRelVertex#172,EXPR$0=$1,FDZ=$0)
> 2017-05-02 10:30:32,323 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:361 : For final plan, using
> rel#171:LogicalAggregate.NONE.[](input=HepRelVertex#170,
> group={0},EXPR$0=COUNT())
> 2017-05-02 10:30:32,323 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:361 : For final plan, using
> rel#169:LogicalProject.NONE.[](input=HepRelVertex#168,FDZ=$1)
> 2017-05-02 10:30:32,323 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:361 : For final plan, using
> rel#154:OLAPTableScan.OLAP.[](table=[DEFAULT, USERACTION],fields=[0, 1,
> 2])
> 2017-05-02 10:30:32,325 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:829 : PLANNER = org.apache.calcite.plan.
> volcano.VolcanoPlanner@5421a86f; TICK = 1/1; PHASE = PRE_PROCESS_MDR;
> COST = {inf}
> 2017-05-02 10:30:32,325 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:829 : PLANNER = org.apache.calcite.plan.
> volcano.VolcanoPlanner@5421a86f; TICK = 2/1; PHASE = PRE_PROCESS; COST =
> {inf}
> 2017-05-02 10:30:32,325 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:829 : PLANNER = org.apache.calcite.plan.
> volcano.VolcanoPlanner@5421a86f; TICK = 3/1; PHASE = OPTIMIZE; COST =
> {inf}
> 2017-05-02 10:30:32,326 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:503 : Pop match: rule [EnumerableSortRule] rels
> [rel#166:LogicalSort.NONE.[](input=rel#165:Subset#3.NONE.[],fetch=10)]
> 2017-05-02 10:30:32,326 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:194 : call#804: Apply rule [EnumerableSortRule] to
> [rel#166:LogicalSort.NONE.[](input=rel#165:Subset#3.NONE.[],fetch=10)]
> 2017-05-02 10:30:32,326 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:217 : call#804 generated 0 successors.
> 2017-05-02 10:30:32,326 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:829 : PLANNER = org.apache.calcite.plan.
> volcano.VolcanoPlanner@5421a86f; TICK = 4/2; PHASE = OPTIMIZE; COST =
> {inf}
> 2017-05-02 10:30:32,326 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:503 : Pop match: rule [OLAPSortRule] rels
> [rel#166:LogicalSort.NONE.[](input=rel#165:Subset#3.NONE.[],fetch=10)]
> 2017-05-02 10:30:32,327 DEBUG [Query c9bf885b-3e1c-4e6f-81ad-d0a98c176a71-75]
> plan.RelOptPlanner:194 : call#795: Apply rule [OLAPSortRule] to
> 

Re: Cube build error on Step 7 Build Base Cuboid Data

2017-05-03 Thread ShaoFeng Shi
Lufeng,

Thanks for the update. I also received reporting about this error recently,
but no finding. Is it because the intermediate table be empty (no data in
the selected time range)? As Kylin allows empty segment, this should not
block the build, please feel free to open a JIRA to us.


[jira] [Created] (KYLIN-2583) Code refactor, move data source statement to query module

2017-05-03 Thread Yifan Zhang (JIRA)
Yifan Zhang created KYLIN-2583:
--

 Summary: Code refactor, move data source statement to query module
 Key: KYLIN-2583
 URL: https://issues.apache.org/jira/browse/KYLIN-2583
 Project: Kylin
  Issue Type: Improvement
Reporter: Yifan Zhang
Assignee: Yifan Zhang
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


Re: Why 'is null' is not evaluable?

2017-05-03 Thread magang
Yang wrote
> Looks like a bug to me.
> 
> On Thu, Apr 27, 2017 at 3:52 PM, magang 

> mg4work@

>  wrote:
> 
>> Hi,
>>
>> When sql contains filter: 'is null'/'is not null', the filter will not be
>> push down to coprocessor, because the CompareTupleFilter.isEvaluable()
>> method return false, is it by design or just a bug?
>>
>> related code is here:
>> https://github.com/apache/kylin/blob/master/core-
>> metadata/src/main/java/org/apache/kylin/metadata/filter/
>> CompareTupleFilter.java#L216
>>
>> --
>> View this message in context: http://apache-kylin.74782.x6.
>> nabble.com/Why-is-null-is-not-evaluable-tp7796.html
>> Sent from the Apache Kylin mailing list archive at Nabble.com.
>>

Thanks Yang! Will log a ticket to track it

--
View this message in context: 
http://apache-kylin.74782.x6.nabble.com/Why-is-null-is-not-evaluable-tp7796p7861.html
Sent from the Apache Kylin mailing list archive at Nabble.com.