WebgraphDB program requires manually added directories
------------------------------------------------------
Key: NUTCH-1030
URL: https://issues.apache.org/jira/browse/NUTCH-1030
Project: Nutch
Issue Type: Bug
Affects Versions: 1.4, 2.0
Reporter: Markus Jelsma
Priority: Minor
Fix For: 1.4, 2.0
The WebgraphDB program exists with only a fatal error in the log. Because it
doesn't make all directories it required
First run:
{code}
2011-07-06 01:41:32,150 FATAL webgraph.WebGraph - WebGraph:
java.io.IOException: No input paths specified in job
{code}
second attempt:
{code}
2011-07-06 01:23:20,626 FATAL webgraph.WebGraph - WebGraph:
org.apache.hadoop.mapred.InvalidInputException: Input path does not exist:
file:/home/markus/projects/apache/nutch/branches/branch-1.4/runtime/local/crawl/webgraphdb/outlinks
{code}
The first run exists but makes the directory. The second attempt will never
create the required directory.
* program must create the directory if it doesn't exist
* program must write such errors to stdout
* program must write success output to stdout
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira