[
https://issues.apache.org/jira/browse/NUTCH-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14904727#comment-14904727
]
ASF GitHub Bot commented on NUTCH-2111:
---------------------------------------
GitHub user kwhitehall opened a pull request:
https://github.com/apache/nutch/pull/64
fix for NUTCH-2111 contributed by kwhitehall
Further investigation showed that changing the temporary path does not get
rid of the tmp files that eat up space. Further, if a selenium grid is
utilized, the location chosen on a given node may not be available on all
nodes. As such, it is best to stay with the default /tmp location and handle
deleting the files there instead. The patch submitted does this.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/kwhitehall/nutch NUTCH-2111
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nutch/pull/64.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #64
----
commit 5ef50e123a0e00eb5b55cdc44dc82c8be8fbf565
Author: Kim Whitehall <[email protected]>
Date: 2015-09-23T16:16:07Z
fix for NUTCH-2111 contributed by kwhitehall
----
> Set temporary file location for selenium tmp files
> --------------------------------------------------
>
> Key: NUTCH-2111
> URL: https://issues.apache.org/jira/browse/NUTCH-2111
> Project: Nutch
> Issue Type: Improvement
> Components: plugin
> Affects Versions: 1.10
> Reporter: Kim Whitehall
>
> When using the selenium plug in (local mode or selenium grid), a large # tmp
> files can be generated for each webdriver executed. The default location for
> selenium is the /tmp library. Thus very quickly (and inadvertently) the
> nutch-selenium interaction can lead to filesystem issues.
> I propose to include a config in nutch-default.xml that allows users to
> specify where they want the selenium tmp files to be written.
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)