Yes (thanks for your help)

My fact table is only 3.9 Millons, I will try Cube_06 with more data
One of my Dim have 800K rows, I want test create this Dim with Buckets on
Hive

2016-12-21 11:25 GMT+01:00 ShaoFeng Shi <shaofeng...@apache.org>:

> Hi Alberto, this is a Great test, the only issue might be the data set is
> too small for Kylin, but the conclusion are the same, like a) enable
> compression can improve overall performance; b) optimize the cube design
> with "hierarchy"/"joint" can reduce the calculations and storage, etc
>
> For "Cube_06" test, usually partition is used for table which has huge
> amount of data (partition can be used for data pruning); Lookup tables
> don't need be partitioned: making all records in 1 single file will be more
> efficient than diving them into 70 files;  \
>
> If you want to compare hive parition/non-partition, suggest you find a
> bigger fact table, e.g 5 or 10 million rows;
>
> 2016-12-21 17:53 GMT+08:00 Alberto Ramón <a.ramonporto...@gmail.com>:
>
> > I attached as PDF,  ... I don't know it this is forbidden in MailList
> >
> > googleDrive
> > <https://drive.google.com/drive/folders/0B-6nZ2q-
> HPTNem1KTTRHbDhpOG8?usp=
> > sharing>
> >  (tell me if there is any problem)
> >
> > 2016-12-21 10:27 GMT+01:00 ShaoFeng Shi <shaofeng...@apache.org>:
> >
> > > Hi Alberto, where I can preview this doc? Thanks!
> > >
> > > 2016-12-21 6:46 GMT+08:00 Alberto Ramón <a.ramonporto...@gmail.com>:
> > >
> > > > I made a small tech notes about my performance tests, the doc is
> > > > unfinished (I need more time, test and knowledge)
> > > > Review my English mistakes is pending
> > > >
> > > > If somebody have any comment, test, more experience , ... feel free
> > make
> > > > any suggestion
> > > >
> > > > Alb
> > > >
> > >
> > >
> > >
> > > --
> > > Best regards,
> > >
> > > Shaofeng Shi 史少锋
> > >
> >
>
>
>
> --
> Best regards,
>
> Shaofeng Shi 史少锋
>

Reply via email to