Re: [DISCUSSION] Regarding to redundancy code and some issues.

Liang Chen Sat, 04 Nov 2017 07:32:19 -0700

+1, all are good proposals.

Regards
Liang



David CaiQiang wrote
> Hi All,
>    Here, I listed the following points to improve the code.
> 
> Redundancy:
> 1. CarbonLoadModel.isDirectLoad
> It is always true, better to remove the related code.
> Now CarbonData doesn't pre-partition the input data by machine node again,
> so it is not required.
> 
> 2. isTableSplitPartition
> in CarbonDataRDDFactory and NewCarbonDataLoadRDD, it is always false,
> better
> to remove the related code also.
> 
> Re-factory:
> 1. CarbonDataRDDFactory.loadCarbonData 
> This method is not readable, it is too large to support load data from the
> input file or select query, support load or insert or update, support
> partition and so on. Better to decouple the code by function.
> 
> 2. Unit Test Case
> There are about 400 CSV files in Unit Test Case.
> Suggesting to unify the input scenario to reduce the CSV file and improve
> the coverage of UT.
> 
> Issue:
> 1. During the data loading, sort_columns should support all datatype
> 
> 2.During the query, the end key uses one byte "0xFF" by default, it is not
> correct.
> 
>    Any question? Any suggesting?
> 
> 
> 
> 
> -----
> Best Regards
> David Cai
> --
> Sent from:
> http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/





--
Sent from: 
http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/

Re: [DISCUSSION] Regarding to redundancy code and some issues.

Reply via email to