[
https://issues.apache.org/jira/browse/NUTCH-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1521:
----------------------------------------
Attachment: NUTCH-1521-trunk.patch
Hi Lufeng, I attach a unified patch for your code which I've tested locally and
would like to commit. I added a brief class description and formatted it as per
the Nutch project formatting.
You indicated in the issue description that you may want to add some WARN
logging... this is up to you.
Are you able to provide a similar patch for trunk?
Thank you for the contribution and good catch!
> CrawlDbFilter pass null url to urlNormailzers
> ---------------------------------------------
>
> Key: NUTCH-1521
> URL: https://issues.apache.org/jira/browse/NUTCH-1521
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 1.7
> Reporter: lufeng
> Assignee: lufeng
> Priority: Trivial
> Fix For: 1.7
>
> Attachments: CrawlDbFilter_v1.patch, NUTCH-1521-trunk.patch,
> TestCrawlDbFilter.java
>
>
> urlNormalizers will get null url if we set CRAWLDB_PURGE_404, and it will
> throw NullPointerException. and the WARN Log will output something like this
> "Skipping null NullPointerException".
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira