[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files
[ https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15726701#comment-15726701 ] Thomas Graves commented on SPARK-18733: --- oh nevermind its looking at lastupdated time. > Spark history server file cleaner excludes in-progress files > > > Key: SPARK-18733 > URL: https://issues.apache.org/jira/browse/SPARK-18733 > Project: Spark > Issue Type: Bug > Components: Web UI >Affects Versions: 2.0.2 >Reporter: Ergin Seyfe > > When we restart history server, it does spend a lot of time to load/replay > incomplete applications which mean the inprogress log files in the log folder. > We have already enabled "spark.history.fs.cleaner.enabled" but seems like > it's skipping the inprogress files. > I checked the log folder and saw that there are many old orphan files. > Probably files left over due to spark-driver failures or OOMs. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files
[ https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15726671#comment-15726671 ] Thomas Graves commented on SPARK-18733: --- yes looks like a dup but I'm not sure on current solution. max age could easily be set to say 1 week and a job could easily run longer then 1 week. I'll post comment on PR about that though. > Spark history server file cleaner excludes in-progress files > > > Key: SPARK-18733 > URL: https://issues.apache.org/jira/browse/SPARK-18733 > Project: Spark > Issue Type: Bug > Components: Web UI >Affects Versions: 2.0.2 >Reporter: Ergin Seyfe > > When we restart history server, it does spend a lot of time to load/replay > incomplete applications which mean the inprogress log files in the log folder. > We have already enabled "spark.history.fs.cleaner.enabled" but seems like > it's skipping the inprogress files. > I checked the log folder and saw that there are many old orphan files. > Probably files left over due to spark-driver failures or OOMs. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files
[ https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15726292#comment-15726292 ] Ergin Seyfe commented on SPARK-18733: - Hi [~vanzin]. I searched the Jira before creating a new one but seems like I missed it. My bad. This definitely looks like duplicate of https://issues.apache.org/jira/browse/SPARK-8617. Let's see what [~tgraves] would say. > Spark history server file cleaner excludes in-progress files > > > Key: SPARK-18733 > URL: https://issues.apache.org/jira/browse/SPARK-18733 > Project: Spark > Issue Type: Bug > Components: Web UI >Affects Versions: 2.0.2 >Reporter: Ergin Seyfe > > When we restart history server, it does spend a lot of time to load/replay > incomplete applications which mean the inprogress log files in the log folder. > We have already enabled "spark.history.fs.cleaner.enabled" but seems like > it's skipping the inprogress files. > I checked the log folder and saw that there are many old orphan files. > Probably files left over due to spark-driver failures or OOMs. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files
[ https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15726084#comment-15726084 ] Marcelo Vanzin commented on SPARK-18733: This is basically a dupe of SPARK-8617; there have been discussions about this in the past but I don't see them in that bug. Maybe [~tgraves] wants to chime in. > Spark history server file cleaner excludes in-progress files > > > Key: SPARK-18733 > URL: https://issues.apache.org/jira/browse/SPARK-18733 > Project: Spark > Issue Type: Bug > Components: Web UI >Affects Versions: 2.0.2 >Reporter: Ergin Seyfe > > When we restart history server, it does spend a lot of time to load/replay > incomplete applications which mean the inprogress log files in the log folder. > We have already enabled "spark.history.fs.cleaner.enabled" but seems like > it's skipping the inprogress files. > I checked the log folder and saw that there are many old orphan files. > Probably files left over due to spark-driver failures or OOMs. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files
[ https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15724362#comment-15724362 ] Apache Spark commented on SPARK-18733: -- User 'seyfe' has created a pull request for this issue: https://github.com/apache/spark/pull/16165 > Spark history server file cleaner excludes in-progress files > > > Key: SPARK-18733 > URL: https://issues.apache.org/jira/browse/SPARK-18733 > Project: Spark > Issue Type: Bug > Components: Web UI >Affects Versions: 2.0.2 >Reporter: Ergin Seyfe > > When we restart history server, it does spend a lot of time to load/replay > incomplete applications which mean the inprogress log files in the log folder. > We have already enabled "spark.history.fs.cleaner.enabled" but seems like > it's skipping the inprogress files. > I checked the log folder and saw that there are many old orphan files. > Probably files left over due to spark-driver failures or OOMs. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org