Re:Documentation to test early versions of Kylin on Parquet v2

2020-06-26 Thread Xiaoxiang Yu
Hi, It is a good question, Parquet Storage's source code link is here https://github.com/apache/kylin/tree/kylin-on-parquet-v2 , documentation link is here https://cwiki.apache.org/confluence/display/KYLIN/KIP-1%3A+Parquet+storage (old doc: https://github.com/Kyligence/kylin-on-parquet-v2

Documentation to test early versions of Kylin on Parquet v2

2020-06-24 Thread hubert stefani
Hi, After the exciting webinar on upcoming version of Kylin on Parquet, I would like to test with my team this new approach. Is there an up-to date documentation to help us start ? Hubert

Fwd: See you at: Apache Kylin on Parquet: Introduction to the New Storage Engine

2020-06-16 Thread ShaoFeng Shi
END:VTIMEZONE BEGIN:VEVENT DTSTAMP:20200615T074150Z DTSTART;TZID=America/Los_Angeles:20200617T183000 DTEND;TZID=America/Los_Angeles:20200617T20 STATUS:CONFIRMED SUMMARY:Apache Kylin on Parquet: Introduction to the New Storage Engine DESCRIPTION:Big Data Bellevue (BDB)\nWednesday\, June 17 at 6:30 PM

Re: Kylin with Parquet

2020-06-14 Thread ShaoFeng Shi
Hi Manish, As parquet support encoding, Kylin doesn't need to do that anymore; That means, it is possible to read the origin values directly from the cube files. Best regards, Shaofeng Shi 史少锋 Apache Kylin PMC Email: shaofeng...@apache.org Apache Kylin FAQ: https://kylin.apache.org/docs

Re: Kylin with Parquet

2020-06-13 Thread Manish Jain
It make sense. Are we also planning to change storage encoding or it will remain the same? Will we able read data using normal Hive/Presto queries or it will require Kylin reader only ?! On Sun, 14 Jun 2020 at 7:42 AM, ShaoFeng Shi wrote: > This is a good question; > > One of the purposes of

Re: Kylin with Parquet

2020-06-13 Thread ShaoFeng Shi
This is a good question; One of the purposes of developing the parquet storage is to overcome the limitations of HBase, which also means, to replace HBase; If the new storage is successful, we may stop to maintain the HBase engine. Best regards, Shaofeng Shi 史少锋 Apache Kylin PMC Email:

Re: Kylin with Parquet

2020-06-13 Thread Manish Jain
Ok, thanks. On Sat, 13 Jun 2020 at 8:15 PM, Liukaige wrote: > Hey Manish, > > The new Parquet storage is almost ready but has not been released yet. In > one installation you can only choose one of them, not both together. > I guess the community will continue to maintain HBase version for a

Kylin with Parquet

2020-06-12 Thread Manish Jain
Is Kylin supports storage in Parquet now instead of HBase ? Or it supports both Parquet and Hbase ? -- Best Regards, Manish Jain

[Online Event] Introduction and Quick Start of Kylin on Parquet, April 18, 2020

2020-04-16 Thread George Ni
Kylin users & developers, There will be an online Kylin event this Saturday (April 18). Kylin on HBase has been mature for a long time, but there are still some limitations. Engineers from Kyligence will share their new storage solution, Kylin On Parquet. Date: April 18 Time: 8:00 PM -

Re: Kylin on parquet

2019-06-29 Thread ShaoFeng Shi
Hi David, The Kylin on Parquet is not yet ready for release: there are room to improve the performance. The major feature in Kylin 3.0 is the real-time OLAP, the team is putting more effort on this part. So the progress on parquet storage was slow. If you like this feature, please +1 on the JIRA

Re: Kylin on parquet

2019-06-28 Thread Billy Liu
The feature is still on-going, but I don't think it's a part of 3.0. Kylin 3.0 alpha has been released, most features are finalized for this major version. With Warm regards Billy Liu David Kis 于2019年6月28日周五 下午11:57写道: > > Hi, > > There was a discussion earlier that Kylin will be able to use

Kylin on parquet

2019-06-28 Thread David Kis
Hi, There was a discussion earlier that Kylin will be able to use Parquet file as storage format. Are there are any updates regarding this topic? Will it be implemented in the upcoming 3.0 release? Thanks, David

Re: Re: Evaluate Kylin on Parquet

2019-01-01 Thread ShaoFeng Shi
>> >> page index. The filtering of the dictionary is nothing but a file >> index, >> >> because we can insert an index server here. >> >> >> >> JiaTao Tao 于2018年12月19日周三 下午4:45写道: >> >> >> >> > Hi

Re: Re: Evaluate Kylin on Parquet

2018-12-31 Thread Li Yang
page index(for page > pruning) > >> > etc. We can put all these stuff in such a system and make the > >> separation of > >> > duties cleaner. > >> > > >> > > >> > Ma Gang 于2018年12月19日周三 上午6:31写道: > >> > > >

Re: Re: Evaluate Kylin on Parquet

2018-12-29 Thread ShaoFeng Shi
ibed in: https://issues.apache.org/jira/browse/KYLIN-3490 >> > > >> > > At 2018-12-17 15:36:01, "ShaoFeng Shi" >> wrote: >> > > >> > > The dimension dictionary is a legacy design for HBase storage I think; >> > > because HBase has no data type, ever

Re: Re: Evaluate Kylin on Parquet

2018-12-19 Thread JiaTao Tao
> Work email: shaofeng@kyligence.io > Kyligence Inc: https://kyligence.io/ > > Apache Kylin FAQ: https://kylin.apache.org/docs/gettingstarted/faq.html > Join Kylin user mail group: user-subscr...@kylin.apache.org > Join Kylin dev mail group: dev-subscr...@kylin.apache.org &

Re:Re: Evaluate Kylin on Parquet

2018-12-18 Thread Ma Gang
roup: user-subscr...@kylin.apache.org Join Kylin dev mail group: dev-subscr...@kylin.apache.org Chao Long 于2018年12月17日周一 下午1:23写道: In this PoC, we verified Kylin On Parquet is viable, but the query performance still have room to improve. We can improve it from the following aspects: 1

Re: Evaluate Kylin on Parquet

2018-12-16 Thread ShaoFeng Shi
://kyligence.io/ Apache Kylin FAQ: https://kylin.apache.org/docs/gettingstarted/faq.html Join Kylin user mail group: user-subscr...@kylin.apache.org Join Kylin dev mail group: dev-subscr...@kylin.apache.org Chao Long 于2018年12月17日周一 下午1:23写道: > In this PoC, we verified Kylin On Parquet is via

??????Evaluate Kylin on Parquet

2018-12-16 Thread Chao Long
In this PoC, we verified Kylin On Parquet is viable, but the query performance still have room to improve. We can improve it from the following aspects: 1, Minimize result set serialization time Since Kylin need Object[] data to process, we convert Dataset to RDD, and then convert the &quo

Evaluate Kylin on Parquet

2018-12-14 Thread ShaoFeng Shi
Hello Kylin users, The first version of Kylin on Parquet [1] feature has been staged in Kylin code repository for public review and evaluation. You can check out the "kylin-on-parquet" branch [2] to read the code, and also can make a binary build to run an example. When creating a cub