FridaPolished opened a new pull request #32304:
URL: https://github.com/apache/spark/pull/32304


   **What changes were proposed in this pull request?**
   The original Spark UI on SHS does not have a specific way to explore 
accumulators information, this data is scattered across stages which makes it 
difficult to access it.
    The accumulators information is located on the Stage Page, but this column 
is hard to read since it groups all the accumulators names and values for a 
specific task. This is one of the reasons why understanding the changes of 
counters from a job can become a difficult task.
   The proposed change creates an explore page for Accumulators on the SparkUI. 
The information will be located under a new tab that has an overview page with 
links to check for more details about the accumulators information by a 
particular name or stage.
   [Community discussion on accumulators 
page](http://apache-spark-developers-list.1001551.n3.nabble.com/accumulators-td8814.html)
   
   **Why are the changes needed?**
   A page that allows users to explore accumulators information is a helpful 
feature for users because currently the information is scattered across stages 
and the display on the Stages Page is difficult to read. This page offers easy 
access to accumulators information in detail which translates into a better 
user experience and an increased velocity to solve problems and track 
accumulator changes.
   
   **Does this PR introduce user-facing change?**
   Yes. A new tab will be added to the Spark UI and three new pages will 
dependent on it.
   
   <img width="889" alt="Screen Shot 2021-04-09 at 3 05 39 PM" 
src="https://user-images.githubusercontent.com/45573418/115795081-15b31300-a384-11eb-88c2-56d7454faee5.png";>
   
    **How was this patch tested?**
   This patch was tested manually by using real spark event logs from a Spark 
History Server on a production cluster. I also used unit testing to check for 
contents inside the new data structure used for the Accumulators Overview page.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to