Github user andrewor14 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10835#discussion_r50742371
  
    --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkHadoopUtil.scala 
---
    @@ -370,6 +370,14 @@ object SparkHadoopUtil {
     
       val SPARK_YARN_CREDS_COUNTER_DELIM = "-"
     
    +  /**
    +   * Number of records to update input metrics when reading from 
HadoopRDDs.
    +   *
    +   * Each update is potentially expensive because we need to use 
reflection to access the
    +   * Hadoop FileSystem API of interest (only available in 2.5), so we 
should do this sparingly.
    +   */
    +  private[spark] val UPDATE_INPUT_METRICS_INTERVAL_RECORDS = 1000
    --- End diff --
    
    we used to do this as part of the confusing bytes callback in 
`InputMetrics`. The purpose of that was to keep the UI updated as we read input 
data. This is just a more straightforward way of doing it so we can keep 
`InputMetrics` simple.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to