[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files

2016-12-06 Thread Thomas Graves (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15726701#comment-15726701
 ] 

Thomas Graves commented on SPARK-18733:
---

oh nevermind its looking at lastupdated time. 

> Spark history server file cleaner excludes in-progress files
> 
>
> Key: SPARK-18733
> URL: https://issues.apache.org/jira/browse/SPARK-18733
> Project: Spark
>  Issue Type: Bug
>  Components: Web UI
>Affects Versions: 2.0.2
>Reporter: Ergin Seyfe
>
> When we restart history server, it does spend a lot of time to load/replay  
> incomplete applications which mean the inprogress log files in the log folder.
> We have already enabled "spark.history.fs.cleaner.enabled" but  seems like 
> it's skipping the inprogress files.
> I checked the log folder and saw that there are many old orphan files. 
> Probably files left over due to spark-driver failures or OOMs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files

2016-12-06 Thread Thomas Graves (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15726671#comment-15726671
 ] 

Thomas Graves commented on SPARK-18733:
---

yes looks like a dup but I'm not sure on current solution. max age could easily 
be set to say 1 week and a job could easily run longer then 1 week. I'll post 
comment on PR about that though.

> Spark history server file cleaner excludes in-progress files
> 
>
> Key: SPARK-18733
> URL: https://issues.apache.org/jira/browse/SPARK-18733
> Project: Spark
>  Issue Type: Bug
>  Components: Web UI
>Affects Versions: 2.0.2
>Reporter: Ergin Seyfe
>
> When we restart history server, it does spend a lot of time to load/replay  
> incomplete applications which mean the inprogress log files in the log folder.
> We have already enabled "spark.history.fs.cleaner.enabled" but  seems like 
> it's skipping the inprogress files.
> I checked the log folder and saw that there are many old orphan files. 
> Probably files left over due to spark-driver failures or OOMs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files

2016-12-06 Thread Ergin Seyfe (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15726292#comment-15726292
 ] 

Ergin Seyfe commented on SPARK-18733:
-

Hi [~vanzin]. I searched the Jira before creating a new one but seems like I 
missed it. My bad. This definitely looks like duplicate of 
https://issues.apache.org/jira/browse/SPARK-8617. Let's see what [~tgraves] 
would say.

> Spark history server file cleaner excludes in-progress files
> 
>
> Key: SPARK-18733
> URL: https://issues.apache.org/jira/browse/SPARK-18733
> Project: Spark
>  Issue Type: Bug
>  Components: Web UI
>Affects Versions: 2.0.2
>Reporter: Ergin Seyfe
>
> When we restart history server, it does spend a lot of time to load/replay  
> incomplete applications which mean the inprogress log files in the log folder.
> We have already enabled "spark.history.fs.cleaner.enabled" but  seems like 
> it's skipping the inprogress files.
> I checked the log folder and saw that there are many old orphan files. 
> Probably files left over due to spark-driver failures or OOMs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files

2016-12-06 Thread Marcelo Vanzin (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15726084#comment-15726084
 ] 

Marcelo Vanzin commented on SPARK-18733:


This is basically a dupe of SPARK-8617; there have been discussions about this 
in the past but I don't see them in that bug. Maybe [~tgraves] wants to chime 
in.

> Spark history server file cleaner excludes in-progress files
> 
>
> Key: SPARK-18733
> URL: https://issues.apache.org/jira/browse/SPARK-18733
> Project: Spark
>  Issue Type: Bug
>  Components: Web UI
>Affects Versions: 2.0.2
>Reporter: Ergin Seyfe
>
> When we restart history server, it does spend a lot of time to load/replay  
> incomplete applications which mean the inprogress log files in the log folder.
> We have already enabled "spark.history.fs.cleaner.enabled" but  seems like 
> it's skipping the inprogress files.
> I checked the log folder and saw that there are many old orphan files. 
> Probably files left over due to spark-driver failures or OOMs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-18733) Spark history server file cleaner excludes in-progress files

2016-12-05 Thread Apache Spark (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-18733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15724362#comment-15724362
 ] 

Apache Spark commented on SPARK-18733:
--

User 'seyfe' has created a pull request for this issue:
https://github.com/apache/spark/pull/16165

> Spark history server file cleaner excludes in-progress files
> 
>
> Key: SPARK-18733
> URL: https://issues.apache.org/jira/browse/SPARK-18733
> Project: Spark
>  Issue Type: Bug
>  Components: Web UI
>Affects Versions: 2.0.2
>Reporter: Ergin Seyfe
>
> When we restart history server, it does spend a lot of time to load/replay  
> incomplete applications which mean the inprogress log files in the log folder.
> We have already enabled "spark.history.fs.cleaner.enabled" but  seems like 
> it's skipping the inprogress files.
> I checked the log folder and saw that there are many old orphan files. 
> Probably files left over due to spark-driver failures or OOMs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org