[GitHub] [carbondata] Zhangshunyu edited a comment on issue #3637: [CARBONDATA-3721][CARBONDATA-3590] Support Bucket Table

GitBox Fri, 28 Feb 2020 02:45:40 -0800

Zhangshunyu edited a comment on issue #3637: [CARBONDATA-3721][CARBONDATA-3590] 
Support Bucket Table
URL: https://github.com/apache/carbondata/pull/3637#issuecomment-592454295
 
 
   @ravipesala pls check all the new testcases added in TableBucketingTestCase 
and the comment i added in the pr desc. we have this feature but not work fine 
as expected.
   1. all data stored into 1 file, not clustered in current code.
   2. join with parquet return wrong result, even carbon tables themselves the 
string value use diff hashcode, the join result mismatch. we should use hash 
method same as spark and keep consistent value for same input.
   3. after compaction it will store into file of bucket id 0.
   4. new insert flow not work for bucket table.
   5. the others pls check testcases added


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

[GitHub] [carbondata] Zhangshunyu edited a comment on issue #3637: [CARBONDATA-3721][CARBONDATA-3590] Support Bucket Table

Reply via email to