[
https://issues.apache.org/jira/browse/NUTCH-956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-956:
---------------------------------------
Attachment: NUTCH-956v2.patch
new patch for this issue. It attempts to obtain the contentType by a number of
means. Firstly from the HttpHeaders, then the page contentType. DEBUG logging
accompanies these attempts.
> solrindex issues
> ----------------
>
> Key: NUTCH-956
> URL: https://issues.apache.org/jira/browse/NUTCH-956
> Project: Nutch
> Issue Type: Bug
> Components: indexer
> Affects Versions: nutchgora
> Reporter: Alexis
> Fix For: 2.2
>
> Attachments: NUTCH-956.patch, NUTCH-956v2.patch, solr.patch,
> solr.patch2
>
>
> I ran into a few caveats with solrindex command trying to index documents.
> Please refer to
> http://techvineyard.blogspot.com/2010/12/build-nutch-20.html#solrindex that
> describes my tests.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira