[
https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14496751#comment-14496751
]
ASF GitHub Bot commented on NUTCH-1988:
---------------------------------------
GitHub user MJJoyce opened a pull request:
https://github.com/apache/nutch/pull/19
NUTCH-1988 - Add optional flat directory flag to dump command
- Add optional flatdir flag to dump command so that a user can dump
their crawl data to a flat directory instead of the nested structure
added in NUTCH-1957.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/MJJoyce/nutch NUTCH-1988
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nutch/pull/19.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #19
----
commit 40ca3e576781328b9b5afc22548a93bfd3df75bd
Author: Michael Joyce <[email protected]>
Date: 2015-04-15T19:19:07Z
NUTCH-1988 - Add optional flat directory flag to dump command
- Add optional flatdir flag to dump command so that a user can dump
their crawl data to a flat directory instead of the nested structure
added in NUTCH-1957.
----
> Make nested output directory dump optional
> ------------------------------------------
>
> Key: NUTCH-1988
> URL: https://issues.apache.org/jira/browse/NUTCH-1988
> Project: Nutch
> Issue Type: Improvement
> Components: dumpers
> Affects Versions: 1.9
> Reporter: Michael Joyce
> Priority: Minor
> Fix For: 1.10
>
>
> NUTCH-1957 added nested directories to the bin/nutch dump output to help
> avoid naming conflicts in output files. It would be nice to be able to
> specify that you want the older flat directory output as an optional
> parameter.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)