Re: [VOTE] Release apache-kylin-2.1.0 (RC1)

2017-08-08 Thread ShaoFeng Shi
Yeah I already did that.

2017-08-08 14:25 GMT+08:00 Luke Han :

> Hi Shaofeng, when canceling one vote, please also update in the original
> thread
>
>
> Best Regards!
> -
>
> Luke Han
>
> On Mon, Aug 7, 2017 at 9:52 AM, 杨浩  wrote:
>
> > +1
> >
> > 2017-08-06 20:39 GMT+08:00 Li Yang :
> >
> > > +1
> > >
> > > mvn test passed
> > >
> > > java version "1.7.0_95"
> > > OpenJDK Runtime Environment (rhel-2.6.4.0.el6_7-x86_64 u95-b00)
> > > OpenJDK 64-Bit Server VM (build 24.95-b01, mixed mode)
> > >
> > >
> > >
> > > On Sat, Aug 5, 2017 at 6:37 PM, ShaoFeng Shi 
> > > wrote:
> > >
> > > > Hi all,
> > > >
> > > > I have created a build for Apache Kylin 2.1.0, release candidate 1.
> > > >
> > > > Changes highlights:
> > > > KYLIN-2506 - Refactor global dictionary
> > > > KYLIN-2575 - Experimental feature: Computed Column
> > > > KYLIN-2579 KYLIN-2580  - Improvement on subqueries
> > > > KYLIN-2633 - Upgrade Spark to 2.1
> > > > KYLIN-2646 - Project level query authorization
> > > >
> > > > And more than 60 bug fixes.
> > > >
> > > > Thanks to everyone who has contributed to this release.
> > > > Here’s release notes:
> > > > https://issues.apache.org/jira/secure/ReleaseNote.jspa?
> > > > projectId=12316121&version=12340443
> > > >
> > > > The commit to be voted upon:
> > > >
> > > > https://github.com/apache/kylin/commit/
> 47b5a0ded63e721736dbe6c4ecf1f0
> > > > 2b0b97ba43
> > > >
> > > > Its hash is 47b5a0ded63e721736dbe6c4ecf1f02b0b97ba43.
> > > >
> > > > The artifacts to be voted on are located here:
> > > > https://dist.apache.org/repos/dist/dev/kylin/apache-kylin-2.1.0-rc1/
> > > >
> > > > (The binary packages for HBase 1.x and CDH 5.7are also provided for
> > > > testing)
> > > >
> > > > The hashes of the artifacts are as follows:
> > > > apache-kylin-2.1.0-src.tar.gz.md5 bb0458bff380f0670ccea57773d809d9
> > > > apache-kylin-2.1.0-src.tar.gz.sha1 5e5eebacdd72ded25508c4947565a1
> > > > ef48784ddb
> > > >
> > > > A staged Maven repository is available for review at:
> > > > https://repository.apache.org/content/repositories/
> > orgapachekylin-1042/
> > > >
> > > > Release artifacts are signed with the following key:
> > > > https://people.apache.org/keys/committer/shaofengshi.asc
> > > >
> > > > Please vote on releasing this package as Apache Kylin 2.1.0.
> > > >
> > > > The vote is open for the next 72 hours and passes if a majority of
> > > > at least three +1 PPMC votes are cast.
> > > >
> > > > [ ] +1 Release this package as Apache Kylin 2.1.0
> > > > [ ]  0 I don't feel strongly about it, but I'm okay with the release
> > > > [ ] -1 Do not release this package because...
> > > >
> > > > Here is my vote:
> > > >
> > > > +1 (binding)
> > > >
> > > > --
> > > > Best regards,
> > > >
> > > > Shaofeng Shi 史少锋
> > > >
> > >
> >
>



-- 
Best regards,

Shaofeng Shi 史少锋


The REST query request failed

2017-08-08 Thread shendandan
Hello!
 I use tableau as kylin's visualization tool. And I want to know can I 
create calculated field in tableau to select?
" if [还款逾期天数] >90 then [本息余额_1] ELSE 0 END " this is my calculated field 
but there is a error when I used the calculated field.
the error is :
   [Kylin][ODBC 1.0(w) Driver]The REST query request failed, the error message 
is: Error while executing SQL "SELECT SUM((CASE WHEN 
("BASEINFO_REPAY_HIS_FACT_PARTATION"."CUR_LATEDAYS" > 90) THEN 
"BASEINFO_REPAY_HIS_FACT_PARTATION"."REMAIN_CAPITAL_INTEREST" ELSE 0 END)) AS 
"sum_Calculation_FCFAIAHBFAGFBCEH_ok" FROM 
"TN_QUERY"."BASEINFO_REPAY_HIS_FACT_PARTATION" 
"BASEINFO_REPAY_HIS_FACT_PARTATION" LEFT JOIN "TN_QUERY"."DIM_BP_PRODUCT" 
"DIM_BP_PRODUCT" ON ("BASEINFO_REPAY_HIS_FACT_PARTATION"."PRODUCT_CODE" = 
"DIM_BP_PRODUCT"."PRODUCT_CODE") LEFT JOIN "TN_QUERY"."DIM_BP_SALES_CHANNEL" 
"DIM_BP_SALES_CHANNEL" ON ("BASEINFO_REPAY_HIS_FACT_PARTATION"."BUSINESS_TYPE" 
= "DIM_BP_SALES_CHANNEL"."CHANNEL_CODE") LEFT JOIN 
"TN_QUERY"."DIM_FUND_CHANNEL" "DIM_FUND_CHANNEL" ON 
("BASEINFO_REPAY_HIS_FACT_PARTATION"."ROOT_CODE" = 
"DIM_FUND_CHANNEL"."FUND_CHANNEL_CODE") LEFT JOIN "TN_QUERY"."DIM_ORG" 
"DIM_ORG" ON ("BASEINFO_REPAY_HIS_FACT_PARTATION"."BRANCH_CODE" = 
"DIM_ORG"."BRANCH_CODE") ": Can't find any realization. Please confirm with 
providers. SQL digest: fact table 
TN_QUERY.BASEINFO_REPAY_HIS_FACT_PARTATION,group by [],filter on [],with 
aggregates[FunctionDesc [expression=SUM, parameter=null, returnType=null]].

Re: sql语句中当in的列表超过一定数据量的时候性能突然变差,如何解决?

2017-08-08 Thread wangzy24
Found by looking at the source code by kylin.query.scan.fuzzykey.max
attribute control, the default is 200, when conditions combined with more
than 200, it will range scan, if less than this value will be according to
the rowkey to obtain, in kylin. If you want to modify the properties of
adding kylin.query.scan.fuzzykey.max attribute

--
View this message in context: 
http://apache-kylin.74782.x6.nabble.com/sql-in-tp8630p8665.html
Sent from the Apache Kylin mailing list archive at Nabble.com.


Re: How to migrate cube from one Kylin to another (for example, Kylin QA to Kylin Production) within the different Hadoop Cluster

2017-08-08 Thread wangzy24
Not to do the whole data migration, it is iterative development, the demand
upgrade, the part of the metadata that we want to migrate, what could be the
solution?



shaofengshi wrote
> Kylin metadata migration is very simple, just follow the "backup/restore
> metadata" guide.
> 
> But if you want to have the Cube data also be moved to new cluster, you
> need migrate:
> 
> 1) Hive tables (metadata and files);
> 2) Files in Kylin HDFS working dir;
> 3) HBase tables (KYLIN_.*)
> 
> There is no a tool to automate these steps. You need do that by your own.
> 
> 2017-08-03 15:06 GMT+08:00 wangzy24 <

> wangzy24@

> >:
> 
>> What would you do if you wanted to do metadata migration between
>> completely
>> independent environments within the different Hadoop Cluster?
>>
>> --
>> View this message in context: http://apache-kylin.74782.x6.
>> nabble.com/How-to-migrate-cube-from-one-Kylin-to-
>> another-for-example-Kylin-QA-to-Kylin-Production-within-
>> the-dir-tp8628.html
>> Sent from the Apache Kylin mailing list archive at Nabble.com.
>>
> 
> 
> 
> -- 
> Best regards,
> 
> Shaofeng Shi 史少锋



--
View this message in context: 
http://apache-kylin.74782.x6.nabble.com/How-to-migrate-cube-from-one-Kylin-to-another-for-example-Kylin-QA-to-Kylin-Production-within-the-dir-tp8628p8666.html
Sent from the Apache Kylin mailing list archive at Nabble.com.


[jira] [Created] (KYLIN-2779) Segment size estimation is wrong for bitmap measure

2017-08-08 Thread Alexander Sterligov (JIRA)
Alexander Sterligov created KYLIN-2779:
--

 Summary: Segment size estimation is wrong for bitmap measure
 Key: KYLIN-2779
 URL: https://issues.apache.org/jira/browse/KYLIN-2779
 Project: Kylin
  Issue Type: Bug
Affects Versions: v2.0.0
Reporter: Alexander Sterligov


If bitmap is used for distinct count, then size estimation is wrong for about 4 
times.

The problem is fixed by kylin.cube.size-estimate-countdistinct-ratio=0.2.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[Draft][REPORT] Apache Kylin - August 2017

2017-08-08 Thread Luke Han
Hi all,

 Here's Kylin board report for this cycle, please help to review and
leave your feedback here.


  Thanks.


## Description:
Apache Kylin is an open source Distributed Analytics Engine designed
to provide SQL interface and multi-dimensional analysis (OLAP) on
Hadoop supporting extremely large datasets.

## Issues:
- there are no issues requiring board attention at this time

## Activity:

- Hadoop & Kylin Meetup hosted in Beijing on 2017-07-14

- Alberto Ramón presented Kylin topic

at Docker Meetup on 2017-06-15

- Yang Li presented Keynote about Apache Kylin
at Tableau User Conf Shenzhen on 2017-07-12

- Luke Han presented Keynote about Apache Kylin
at Strata Hadoop World Beijing on 2017-07-15

- Dong Li presented Apache Kylin  2.x
at Strata Hadoop World Beijing on 2017-07-15

- Luke Han presented Keynote about Apache Kylin
at Strata Hadoop World Beijing on 2017-07-15

- Kaige Liu presented Kylin on AWS
at AWS Summit China in Beijing on 2017-07-26

- Dayue Gao presented Kylin topic
at Druid Meetup Beijing on 2017-08-05



## PMC changes:

- Currently 18 PMC members.
- No new PMC members added in the last 3 months

## Committer base changes:

- Currently 28 committers
- No new committers in the last 3 months.

- Last committer addition: Fri Apr 28 2017 (Alberto Ramón)

## Releases:

- Last release was 2.0.0, released on Sun Apr 30 2017

## Mailing list activity:

- dev@kylin.apache.org:
- 370 subscribers (up 33 in the last 3 months)
- 778 emails sent in the past 3 months, 794 in the previous cycle

- iss...@kylin.apache.org:
- 71 subscribers (up 9 in the last 3 months)
- 1258 emails sent in the past 3 months, 921 in the previous cycle

- u...@kylin.apache.org:
- 286 subscribers (up 10 in the last 3 months)
- 327 emails sent in the past 3 months, 232 in the previous cycle

## JIRA activity:

- 180 JIRA tickets created in the last 3 months
- 124 JIRA tickets closed/resolved in the last 3 months


Use AWS Redshift as Data Source for Apache Kylin

2017-08-08 Thread bsmithab
Hi 

We are looking at option of Using Redshift as data source to create OLAP in
Apache Kylin. Is this possible? if Yes Can we install the apache Kylin on
the Redshift cluster server or Is it mandatory that we need to have hadoop
structure as base on which Apache Kyling can be installed. If we have to use
Redshift as data source what is the approach we need to take. Please guide 

--
View this message in context: 
http://apache-kylin.74782.x6.nabble.com/Use-AWS-Redshift-as-Data-Source-for-Apache-Kylin-tp8669.html
Sent from the Apache Kylin mailing list archive at Nabble.com.


Re: Use AWS Redshift as Data Source for Apache Kylin

2017-08-08 Thread Luke Han
Hi there,
  Today, Kylin only supports ready data from Hive or Kafka.
It doable if you could find a way to expose your data through
Hive interface.
   Or, you could customize a little bit to read data from Redshift.

   And, one idea we have (actually there are many other users are doing
today)
is to setup EMR + Kylin, then upload data from Redshift or other data
source to
EMR/S3, then Kylin could easily to consume through Hive.

  BTW, why you are trying to read data from Redshift and build cube in
Kylin?
What's challenge you are facing now?

  Thanks.
Luke


Best Regards!
-

Luke Han

On Tue, Aug 8, 2017 at 11:31 PM, bsmithab  wrote:

> Hi
>
> We are looking at option of Using Redshift as data source to create OLAP in
> Apache Kylin. Is this possible? if Yes Can we install the apache Kylin on
> the Redshift cluster server or Is it mandatory that we need to have hadoop
> structure as base on which Apache Kyling can be installed. If we have to
> use
> Redshift as data source what is the approach we need to take. Please guide
>
> --
> View this message in context: http://apache-kylin.74782.x6.
> nabble.com/Use-AWS-Redshift-as-Data-Source-for-Apache-Kylin-tp8669.html
> Sent from the Apache Kylin mailing list archive at Nabble.com.
>


Re: Kylin 主从同步经常失败

2017-08-08 Thread lichaojacobs
I am sure it's not the network problem. Because some changes can successfully
send to other nodes. 
So, I think it's the program sometimes can not reponse status with 200,
perhaps 500. But there are no more specific error message to verify that. So
I suggest if you can add retry logic and narrow the try catch scope to log
more detailed message

--
View this message in context: 
http://apache-kylin.74782.x6.nabble.com/Kylin-tp8640p8663.html
Sent from the Apache Kylin mailing list archive at Nabble.com.


[jira] [Created] (KYLIN-2780) QueryController.getMetadata and CacheController.wipeCache may be deadlock

2017-08-08 Thread Fangyuan Deng (JIRA)
Fangyuan Deng created KYLIN-2780:


 Summary: QueryController.getMetadata and CacheController.wipeCache 
may be deadlock
 Key: KYLIN-2780
 URL: https://issues.apache.org/jira/browse/KYLIN-2780
 Project: Kylin
  Issue Type: Bug
  Components: Query Engine
Affects Versions: v2.0.0, v1.5.4.1
Reporter: Fangyuan Deng
Assignee: liyang


When we send update metadata request to the query sever manually,
while some getMetadata requests come,
dead lock may hadoop.





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


Re: Timeout when refreshing a cube

2017-08-08 Thread bk_180
Hi Shaofeng Shi, 


Looking for some information on Hbase UI I realized that the node informed
in the log was stopped during the process. So this explaind the timeout,
correct? 

{"url":"http://:7070/kylin/api/jobs/349d62e3-7748-46ee-8009-3085726b5933","exception":"java.lang.RuntimeException:
org.apache.kylin.job.exception.PersistentException:
java.net.SocketTimeoutException: callTimeout=6, callDuration=63012: row
'/execute/349d62e3-7748-46ee-8009-3085726b5933' on table 'kylin_metadata' at
region=kylin_metadata,/dict/VIEW_CREDITO/COD_PROMOTOR/a61c83e7-01e7-4971-9539-dd5c7ed07dff.dict,1500376865680.301b695304237ff0d1a7774bbff83b67.,
/hostname=,/16020,1501960877256, seqNum=163046"}

We are running a Cluster on AWS EMR (hbase in S3) and using the tasknodes
with autoscalling feature turned on, I´m understandig that this node was
decomissioned due this configuration.

Can you see any other problem we migth face?

Thanks for answering my post.

--
View this message in context: 
http://apache-kylin.74782.x6.nabble.com/Timeout-when-refreshing-a-cube-tp8658p8671.html
Sent from the Apache Kylin mailing list archive at Nabble.com.


Re: Use AWS Redshift as Data Source for Apache Kylin

2017-08-08 Thread bsmithab
Can we use ODBC/JDBC connection to directly pull data from Redshift Data
warehouse? 

--
View this message in context: 
http://apache-kylin.74782.x6.nabble.com/Use-AWS-Redshift-as-Data-Source-for-Apache-Kylin-tp8669p8673.html
Sent from the Apache Kylin mailing list archive at Nabble.com.


Re: Use AWS Redshift as Data Source for Apache Kylin

2017-08-08 Thread Luke Han
JDBC is good idea which actually already have some investment here.
Please stay tuned for the progress.

Thanks.


Best Regards!
-

Luke Han

On Wed, Aug 9, 2017 at 12:50 PM, bsmithab  wrote:

> Can we use ODBC/JDBC connection to directly pull data from Redshift Data
> warehouse?
>
> --
> View this message in context: http://apache-kylin.74782.x6.
> nabble.com/Use-AWS-Redshift-as-Data-Source-for-Apache-
> Kylin-tp8669p8673.html
> Sent from the Apache Kylin mailing list archive at Nabble.com.
>


??????[Draft][REPORT] Apache Kylin - August 2017

2017-08-08 Thread ??????
+1.




--  --
??: "Luke Han";;
: 2017??8??8??(??) 7:29
??: "private"; 
"dev"; 

: [Draft][REPORT] Apache Kylin - August 2017



Hi all,

 Here's Kylin board report for this cycle, please help to review and
leave your feedback here.


  Thanks.


## Description:
Apache Kylin is an open source Distributed Analytics Engine designed
to provide SQL interface and multi-dimensional analysis (OLAP) on
Hadoop supporting extremely large datasets.

## Issues:
- there are no issues requiring board attention at this time

## Activity:

- Hadoop & Kylin Meetup hosted in Beijing on 2017-07-14

- Alberto Ram??n presented Kylin topic

at Docker Meetup on 2017-06-15

- Yang Li presented Keynote about Apache Kylin
at Tableau User Conf Shenzhen on 2017-07-12

- Luke Han presented Keynote about Apache Kylin
at Strata Hadoop World Beijing on 2017-07-15

- Dong Li presented Apache Kylin  2.x
at Strata Hadoop World Beijing on 2017-07-15

- Luke Han presented Keynote about Apache Kylin
at Strata Hadoop World Beijing on 2017-07-15

- Kaige Liu presented Kylin on AWS
at AWS Summit China in Beijing on 2017-07-26

- Dayue Gao presented Kylin topic
at Druid Meetup Beijing on 2017-08-05



## PMC changes:

- Currently 18 PMC members.
- No new PMC members added in the last 3 months

## Committer base changes:

- Currently 28 committers
- No new committers in the last 3 months.

- Last committer addition: Fri Apr 28 2017 (Alberto Ram??n)

## Releases:

- Last release was 2.0.0, released on Sun Apr 30 2017

## Mailing list activity:

- dev@kylin.apache.org:
- 370 subscribers (up 33 in the last 3 months)
- 778 emails sent in the past 3 months, 794 in the previous cycle

- iss...@kylin.apache.org:
- 71 subscribers (up 9 in the last 3 months)
- 1258 emails sent in the past 3 months, 921 in the previous cycle

- u...@kylin.apache.org:
- 286 subscribers (up 10 in the last 3 months)
- 327 emails sent in the past 3 months, 232 in the previous cycle

## JIRA activity:

- 180 JIRA tickets created in the last 3 months
- 124 JIRA tickets closed/resolved in the last 3 months