[
https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14496755#comment-14496755
]
Michael Joyce commented on NUTCH-1988:
--------------------------------------
Hi folks. Here's an example output run of this.
{code}
[mjjoyce@machine local]$ bin/nutch dump -outputDir ./foodir -segment
../local_elasticsearch_testt/crawl/segments/
[mjjoyce@machine local]$ bin/nutch dump -flatdir -outputDir ./foodir2 -segment
../local_elasticsearch_testt/crawl/segments/
[mjjoyce@machine local]$ ls -R foodir
foodir:
8f f8
foodir/8f:
a7
foodir/8f/a7:
8d84f847f7310620a9edc4327bbfc133_.html
foodir/f8:
df
foodir/f8/df:
fec7849283af7a0adc77eddefb242b6e_.html
[mjjoyce@machine local]$ ls -R foodir2
foodir2:
8d84f847f7310620a9edc4327bbfc133_.html fec7849283af7a0adc77eddefb242b6e_.html
[mjjoyce@machine local]$
{code}
> Make nested output directory dump optional
> ------------------------------------------
>
> Key: NUTCH-1988
> URL: https://issues.apache.org/jira/browse/NUTCH-1988
> Project: Nutch
> Issue Type: Improvement
> Components: dumpers
> Affects Versions: 1.9
> Reporter: Michael Joyce
> Priority: Minor
> Fix For: 1.10
>
>
> NUTCH-1957 added nested directories to the bin/nutch dump output to help
> avoid naming conflicts in output files. It would be nice to be able to
> specify that you want the older flat directory output as an optional
> parameter.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)