gaborgsomogyi commented on a change in pull request #27398: 
[SPARK-30481][DOCS][FOLLOWUP] Document event log compaction into new section of 
monitoring.md
URL: https://github.com/apache/spark/pull/27398#discussion_r380308311
 
 

 ##########
 File path: docs/monitoring.md
 ##########
 @@ -95,6 +95,49 @@ The history server can be configured as follows:
   </tr>
 </table>
 
+### Applying compaction on rolling event log files
+
+A long-running application (e.g. streaming) can bring a huge single event log 
file which may cost a lot to maintain and
+also requires a bunch of resource to replay per each update in Spark History 
Server.
+
+Enabling <code>spark.eventLog.rolling.enabled</code> and 
<code>spark.eventLog.rolling.maxFileSize</code> would
+let you have rolling event log files instead of single huge event log file 
which may help some scenarios on its own,
+but it still doesn't help you reducing the overall size of logs.
+
+Spark History Server can apply 'compaction' on the rolling event log files to 
reduce the overall size of
 
 Review comment:
   Nit: Not sure what's the intention with this `‘` sign here. Maybe 
`compaction` word is not so special that it must be highlighted. This applies 
more places.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to