[ 
http://issues.apache.org/jira/browse/NUTCH-190?page=comments#action_12364151 ] 

Chris A. Mattmann commented on NUTCH-190:
-----------------------------------------

+1

i think that this is a needed patch.

> ParseUtil drops reason for failed parse
> ---------------------------------------
>
>          Key: NUTCH-190
>          URL: http://issues.apache.org/jira/browse/NUTCH-190
>      Project: Nutch
>         Type: Bug
>   Components: fetcher
>     Versions: 0.8-dev
>  Environment: linux
>     Reporter: [EMAIL PROTECTED]
>     Priority: Minor
>  Attachments: ParseUtil_drops_failure_reason.patch
>
> Doing the below:
>     Parse parse;
>     ParseStatus parseStatus;
>     try {
>       parse = ParseUtil.parse(content);
>       parseStatus = parse.getData().getStatus();
>     } catch (Exception e) {
>       parseStatus = new ParseStatus(e);
>     }
>     if (!parseStatus.isSuccess()) {
>       LOG.warning("Error parsing: " + url + ": " + parseStatus);
>       parse = null;
>     }
> ...on failure, the LOG.warning never prints out the reason for failure.  
> Here's an example: "Error parsing: 
> http://www.dfrc.nasa.gov/DTRS/1967/PDF/H-478.pdf: failed(0,0)".
> ParseUtil is dropping messages lovingly crafted by parsers.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to