[
https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15000923#comment-15000923
]
Michael Joyce commented on NUTCH-2165:
--------------------------------------
Note, the diff looks massive here. This is really just adding an extra loop
over the parts directories in each segment directory. The tool could probably
use a bit of cleanup love, but we can address that in a later patch.
> FileDumper Util hard codes part-# folder name
> ---------------------------------------------
>
> Key: NUTCH-2165
> URL: https://issues.apache.org/jira/browse/NUTCH-2165
> Project: Nutch
> Issue Type: Bug
> Components: tool
> Affects Versions: 2.3, 1.10
> Reporter: Michael Joyce
> Assignee: Michael Joyce
> Fix For: 2.4, 1.11
>
> Attachments: NUTCH-2165_joyce_11Nov2015.patch
>
>
> Hi folks, [~lewismc] and I were just discussing this off list. It seems that
> the part-##### folders seem to be hard coded to part-00000 in the [FileDumper
> utility|https://github.com/apache/nutch/blob/trunk/src/java/org/apache/nutch/tools/FileDumper.java#L166-L167]
> which could prove problematic.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)