[ 
https://issues.apache.org/jira/browse/NUTCH-1552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13621348#comment-13621348
 ] 

Markus Jelsma commented on NUTCH-1552:
--------------------------------------

At this stage contentType should never be null! What plugins did you use? There 
may be a more serious bug upstream; a document without contentType should not 
exist if you ask me and i've never seen it happen in the billions of records we 
parsed.
                
> possibility of a NPE in index-more plugin
> -----------------------------------------
>
>                 Key: NUTCH-1552
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1552
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 2.2
>            Reporter: kaveh minooie
>         Attachments: NUTCH-1552.patch
>
>
> in line 203 of src/java/org/apache/nutch/indexer/more/MoreIndexingFilter.java 
> the code attempt to read from variable contentType even thou it is possible 
> for it to be null. for me, it happened when I tried to index  
> http://www.pscars.com/ 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to