[ https://issues.apache.org/jira/browse/SPARK-28933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16920584#comment-16920584 ]
Liang-Chi Hsieh commented on SPARK-28933: ----------------------------------------- This issue was resolved by [https://github.com/apache/spark/pull/25639]. > Reduce unnecessary shuffle in ALS when initializing factors > ----------------------------------------------------------- > > Key: SPARK-28933 > URL: https://issues.apache.org/jira/browse/SPARK-28933 > Project: Spark > Issue Type: Improvement > Components: ML > Affects Versions: 3.0.0 > Reporter: Liang-Chi Hsieh > Assignee: Liang-Chi Hsieh > Priority: Major > > When Initializing factors in ALS, we should use {{mapPartitions}} instead of > current {{map}}, so we can preserve existing partition of the RDD of > {{InBlock}}. The RDD of {{InBlock}} is already partitioned by src block id. > We don't change the partition when initializing factors. -- This message was sent by Atlassian Jira (v8.3.2#803003) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org