[
https://issues.apache.org/jira/browse/SPARK-23394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Attila Zsolt Piros updated SPARK-23394:
---------------------------------------
Attachment: Spark_2.4.0-SNAPSHOT.png
Spark_2.2.1.png
> Storage info's Cached Partitions doesn't consider the replications (but
> sc.getRDDStorageInfo does)
> --------------------------------------------------------------------------------------------------
>
> Key: SPARK-23394
> URL: https://issues.apache.org/jira/browse/SPARK-23394
> Project: Spark
> Issue Type: Bug
> Components: Spark Core
> Affects Versions: 2.3.0
> Reporter: Attila Zsolt Piros
> Priority: Major
> Attachments: Screen Shot 2018-02-12 at 11.24.22.png, Spark_2.2.1.png,
> Spark_2.4.0-SNAPSHOT.png
>
>
> Start spark as:
> {code:bash}
> $ bin/spark-shell --master local-cluster[2,1,1024]
> {code}
> {code:scala}
> scala> import org.apache.spark.storage.StorageLevel._
> import org.apache.spark.storage.StorageLevel._
> scala> sc.parallelize((1 to 100), 10).persist(MEMORY_AND_DISK_2).count
> res0: Long = 100
>
> scala> sc.getRDDStorageInfo(0).numCachedPartitions
> res1: Int = 20
> {code}
> But on the UI at the Storage tab Cached Partitions is 10. See attached
> screenshot.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]