shardulm94 commented on PR #40637:
URL: https://github.com/apache/spark/pull/40637#issuecomment-1496640034

   Thanks @ShreyeshArangath for this! I think it helps clear a lot of 
unnecessary noise from user logs and keeps the logs manageable.
   
   One thing I noticed is that we set `spark.yarn.report.logging.frequency` to 
`30` by default. I think it is a much more sensible default, especially given 
a) Spark jobs submitted to YARN would usually run longer than a minute 
including the overhead of launching Spark driver b) We always report any state 
change immediately e.g. `ACCEPTED` -> `RUNNING`. I would be pro defaulting to 
`30`. But if we want to eliminate behavior change, we can always default it to 
`1`.
   
   I saw the tests are failing with what seems to be a transient Github 
connection issue.
   ```
   /usr/bin/git -c protocol.version=2 fetch --prune --progress 
--no-recurse-submodules origin +refs/heads/*:refs/remotes/origin/* 
+refs/tags/*:refs/tags/*
     Error: fatal: unable to access 'https://github.com/apache/spark/': The 
requested URL returned error: 429
     The process '/usr/bin/git' failed with exit code 128
   ```
   Can you push an empty commit to re-run the tests?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to