Re: [DISCUSS] For the dimension default should be no dictionary

2017-03-02 Thread Jacky Li
nality to high cardinality, this will make most >>>>>>> compression >>>>>>> and fit for scenario that does not have frequent filter on high card >>>>>> column >>>>>>> 2) Put high cardinality colu

Re: [DISCUSS] For the dimension default should be no dictionary

2017-03-02 Thread bill.zhou
umn, it can be put in >>> SORT_COLUMNS >>> >>> option. >>> >>> >>> >>> If this option is not specified by user, carbon will pick MDK as it >>> is >>> >> now. >>> >>> >>> >>> 2. TABLE_DICTIONARY

Re: [DISCUSS] For the dimension default should be no dictionary

2017-03-01 Thread Ravindra Pesala
S >> >>> option. >> >>> >> >>> If this option is not specified by user, carbon will pick MDK as it is >> >> now. >> >>> >> >>> 2. TABLE_DICTIONARY >> >>> This is to specify the table

Re: [DISCUSS] For the dimension default should be no dictionary

2017-03-01 Thread Kumar Vishal
specified by user, carbon will pick MDK as it > is > > >> now. > > >>> > > >>> 2. TABLE_DICTIONARY > > >>> This is to specify the table level dictionary columns. Will create > > global > > >>> dictionary for a

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-28 Thread Ravindra Pesala
RY > >>> This is to specify the table level dictionary columns. Will create > global > >>> dictionary for all columns in this option for every data load. > >>> > >>> When to use: The option is designed for accelerating aggregate query, >

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-28 Thread QiangCai
-mailing-list-archive.1130556.n5.nabble.com/DISCUSS-For-the-dimension-default-should-be-no-dictionary-tp8010p8122.html Sent from the Apache CarbonData Mailing List archive mailing list archive at Nabble.com.

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-28 Thread Jacky Li
;> >>> If this option is not specified by user, means all columns encoding >> without >>> global dictionary support. Normal shuffle on decoded value will be >> applied >>> when doing group by operation. >>> >>> I think these two opt

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-28 Thread Jacky Li
goal of them is to satisfy the most scenario without deep tuning of the >> table >> For advanced user who want to do deep tuning, we can debate to add more >> options. But we need to identify what scenario is not satisfied by using >> these two options first. >> >> Regards, >> Jacky >> >> >> >> -- >> View this message in context: http://apache-carbondata- >> mailing-list-archive.1130556.n5.nabble.com/DISCUSS-For-the- >> dimension-default-should-be-no-dictionary-tp8010p8081.html >> Sent from the Apache CarbonData Mailing List archive mailing list archive >> at Nabble.com. >> > > > -- > Regards > Liang

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-28 Thread Ravindra Pesala
gt; goal of them is to satisfy the most scenario without deep tuning of the > > table > > For advanced user who want to do deep tuning, we can debate to add more > > options. But we need to identify what scenario is not satisfied by using > > these two options first. > > > > Regards, > > Jacky > > > > > > > > -- > > View this message in context: http://apache-carbondata- > > mailing-list-archive.1130556.n5.nabble.com/DISCUSS-For-the- > > dimension-default-should-be-no-dictionary-tp8010p8081.html > > Sent from the Apache CarbonData Mailing List archive mailing list archive > > at Nabble.com. > > > > > > -- > Regards > Liang > -- Thanks & Regards, Ravi

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-28 Thread bill.zhou
> > the dictionary_exclude properties, the dimension will be consider as >> >> > dictionary default. I think default should be no dictionary. >> >> > >> >> > For example when I do the POC for one customer, it has 300 >> columns >> >

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-28 Thread Liang Chen
ut we need to identify what scenario is not satisfied by using > these two options first. > > Regards, > Jacky > > > > -- > View this message in context: http://apache-carbondata- > mailing-list-archive.1130556.n5.nabble.com/DISCUSS-For-the- > dimension-default-should-be-no-dictionary-tp8010p8081.html > Sent from the Apache CarbonData Mailing List archive mailing list archive > at Nabble.com. > -- Regards Liang

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-28 Thread Jacky Li
-mailing-list-archive.1130556.n5.nabble.com/DISCUSS-For-the-dimension-default-should-be-no-dictionary-tp8010p8081.html Sent from the Apache CarbonData Mailing List archive mailing list archive at Nabble.com.

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-28 Thread Jacky Li
e will suffer. >>>> >>>> >>>> >>>> Regards, >>>> Ravindra. >>>> >>>> On 26 February 2017 at 20:04, bill.zhou < >> >>> zgcsky08@ >> >>> > wrote: >>>> >>>

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-27 Thread Liang Chen
t;> Ravindra. > > >> > > >> On 26 February 2017 at 20:04, bill.zhou < > > > > > zgcsky08@ > > > > > > wrote: > > >> > > >> > hi All > > >> > Now when create the CarbonData table,if the dimension don't add > > >>

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-27 Thread Ravindra Pesala
der as > >> > dictionary default. I think default should be no dictionary. > >> > > >> > For example when I do the POC for one customer, it has 300 columns > >> and > >> > 200 dimensions, but only 5 columns is used for filter, so he only need >

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-26 Thread bill.zhou
this 5 columns to dictionary and leave other 195 columns to no >> dictionary. >> > But now he need specify for the 195 columns to dictionary_exclude >> > properties >> > the will waste time and make the create table command huge, also will >> > impact >>

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-26 Thread Kumar Vishal
e > > properties > > the will waste time and make the create table command huge, also will > > impact > > the load performance. > > > > So I suggestion dimension default should be no dictionary and this > can > > also help customer easy to kno

Re: [DISCUSS] For the dimension default should be no dictionary

2017-02-26 Thread Ravindra Pesala
d performance. > > So I suggestion dimension default should be no dictionary and this can > also help customer easy to know the dictionary column which is useful. > > > > -- > View this message in context: http://apache-carbondata- > mailing-list-archive.1130556.n5.nab

[DISCUSS] For the dimension default should be no dictionary

2017-02-26 Thread bill.zhou
nsion-default-should-be-no-dictionary-tp8010.html Sent from the Apache CarbonData Mailing List archive mailing list archive at Nabble.com.