srowen commented on a change in pull request #25445: [SPARK-28541][WEBUI] 
Document Storage page
URL: https://github.com/apache/spark/pull/25445#discussion_r315251438
 
 

 ##########
 File path: docs/web-ui.md
 ##########
 @@ -45,6 +45,54 @@ The Storage tab displays the persisted RDDs and DataFrames, 
if any, in the appli
 page shows the storage levels, sizes and partitions of all RDDs, and the 
details page shows the
 sizes and using executors for all partitions in an RDD or DataFrame.
 
+{% highlight scala %}
+scala> import org.apache.spark.storage.StorageLevel._
+import org.apache.spark.storage.StorageLevel._
+
+scala> val rdd = sc.range(0, 100, 1, 5).setName("rdd")
+rdd: org.apache.spark.rdd.RDD[Long] = rdd MapPartitionsRDD[1] at range at 
<console>:27
+
+scala> rdd.persist(MEMORY_ONLY_SER)
+res0: rdd.type = rdd MapPartitionsRDD[1] at range at <console>:27
+
+scala> rdd.count
+res1: Long = 100                                                               
 
+
+scala> val df = Seq((1, "andy"), (2, "bob"), (2, "andy")).toDF("count", "name")
+df: org.apache.spark.sql.DataFrame = [count: int, name: string]
+
+scala> df.persist(DISK_ONLY)
+res2: df.type = [count: int, name: string]
+
+scala> df.count
+res3: Long = 3
+{% endhighlight %}
+
+<p style="text-align: center;">
+  <img src="img/webui-storage-tab.png"
+       title="Storage tab"
+       alt="Storage tab"
+       width="100%" />
+  <!-- Images are downsized intentionally to improve quality on retina 
displays -->
+</p>
+
+After running the above example, we can find two RDDs listed in the Storage 
tab. Basic information like
+storage level, number of partitions and memory overhead are provided. Note 
that the newly persisted RDDs
+or DataFrames are not shown in the tab before they are materialized. To 
monitor a specific RDD or DataFrame,
+make sure an action operation has been triggered.
+
+<p style="text-align: center;">
+  <img src="img/webui-storage-detail.png"
+       title="Storage detail"
+       alt="Storage detail"
+       width="100%" />
+  <!-- Images are downsized intentionally to improve quality on retina 
displays -->
+</p>
+
+You can click the RDD name 'rdd' for obtaining the details of data 
persistance, such as the data
 
 Review comment:
   persistance -> persistence

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to