Github user shahidki31 commented on a diff in the pull request: https://github.com/apache/spark/pull/23016#discussion_r234396276 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -174,6 +174,10 @@ class PrefixSpan private ( val freqSequences = results.map { case (seq: Array[Int], count: Long) => new FreqSequence(toPublicRepr(seq), count) } + // Cache the final RDD to the same storage level as input + freqSequences.persist(data.getStorageLevel) --- End diff -- @srowen Yes. That is the correct approach. I updated the code. Thanks
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org