[GitHub] [carbondata] Zhangshunyu commented on a change in pull request #3637: [CARBONDATA-3721][CARBONDATA-3590] Optimize Bucket Table

GitBox Mon, 02 Mar 2020 22:17:57 -0800

Zhangshunyu commented on a change in pull request #3637: 
[CARBONDATA-3721][CARBONDATA-3590] Optimize Bucket Table
URL: https://github.com/apache/carbondata/pull/3637#discussion_r386817746


 ##########
 File path: processing/pom.xml
 ##########
 @@ -45,6 +45,11 @@
       <artifactId>spark-sql_${scala.binary.version}</artifactId>
       <version>${spark.version}</version>
     </dependency>
+    <dependency>
+      <groupId>org.apache.spark</groupId>
 
 Review comment:
   @jackylk if want to keep correct join result with parquet bucket tables, 
need to use same methods to hash the data of each datatype, so the code is 
needed.
   1. copy the code from spark, but there are about 2,000 lines and if we copy 
the code, once spark change them we need to change together, its not a good 
choice, more details pls check the conversations above.
   2. depend on spark-unsafe jar, we just depend 1 jar of spark and the changes 
of diff spark version don't have effect on us since we use it by version 
control in pom.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

[GitHub] [carbondata] Zhangshunyu commented on a change in pull request #3637: [CARBONDATA-3721][CARBONDATA-3590] Optimize Bucket Table

Reply via email to