Grant Henke has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12484 )

Change subject: KUDU-2672: [spark] Optionally repartition to match Kudu 
partitions
......................................................................


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/12484/1/java/kudu-spark/src/main/scala/org/apache/kudu/spark/kudu/KuduContext.scala
File 
java/kudu-spark/src/main/scala/org/apache/kudu/spark/kudu/KuduContext.scala:

http://gerrit.cloudera.org:8080/#/c/12484/1/java/kudu-spark/src/main/scala/org/apache/kudu/spark/kudu/KuduContext.scala@386
PS1, Line 386:     val keyedRdd = rdd.mapPartitions { rows =>
> I also talked to Thomas (an Impala dev) about their approach. He agreed tha
Correct, nothing stops us from making the partitioner serializable and sharing 
it in the future.


http://gerrit.cloudera.org:8080/#/c/12484/1/java/kudu-spark/src/main/scala/org/apache/kudu/spark/kudu/KuduContext.scala@407
PS1, Line 407:     val shuffledRDD = if (writeOptions.repartitionSort) {
> OK, but should we at least change the default for repartitionSort to true?
The default for `repartition` is false. This is because I don't want users jobs 
to change behavior drastically on upgrade. Additionally I would like at least a 
release with this feature before making it the default.

The default for `repartitionSort` is true. Meaning when we repartition we will 
sort by default.



--
To view, visit http://gerrit.cloudera.org:8080/12484
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: kudu
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8763615997bccc08901235841149fc3bacb321e7
Gerrit-Change-Number: 12484
Gerrit-PatchSet: 1
Gerrit-Owner: Grant Henke <[email protected]>
Gerrit-Reviewer: Adar Dembo <[email protected]>
Gerrit-Reviewer: Grant Henke <[email protected]>
Gerrit-Reviewer: Kudu Jenkins (120)
Gerrit-Reviewer: Mike Percy <[email protected]>
Gerrit-Reviewer: Will Berkeley <[email protected]>
Gerrit-Comment-Date: Mon, 25 Feb 2019 20:55:37 +0000
Gerrit-HasComments: Yes

Reply via email to