[
https://issues.apache.org/jira/browse/NUTCH-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-1030:
---------------------------------
Patch Info: [Patch Available]
> WebgraphDB program requires manually added directories
> ------------------------------------------------------
>
> Key: NUTCH-1030
> URL: https://issues.apache.org/jira/browse/NUTCH-1030
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 1.4, 2.0
> Reporter: Markus Jelsma
> Priority: Minor
> Fix For: 1.4, 2.0
>
> Attachments: NUTCH-1030-1.4-1.patch
>
>
> The WebgraphDB program exists with only a fatal error in the log. Because it
> doesn't make all directories it required
> First run:
> {code}
> 2011-07-06 01:41:32,150 FATAL webgraph.WebGraph - WebGraph:
> java.io.IOException: No input paths specified in job
> {code}
> second attempt:
> {code}
> 2011-07-06 01:23:20,626 FATAL webgraph.WebGraph - WebGraph:
> org.apache.hadoop.mapred.InvalidInputException: Input path does not exist:
> file:/home/markus/projects/apache/nutch/branches/branch-1.4/runtime/local/crawl/webgraphdb/outlinks
> {code}
> The first run exists but makes the directory. The second attempt will never
> create the required directory.
> * program must create the directory if it doesn't exist
> * program must write such errors to stdout
> * program must write success output to stdout
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira