[ 
https://issues.apache.org/jira/browse/CONNECTORS-200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13036784#comment-13036784
 ] 

Erlend GarĂ¥sen commented on CONNECTORS-200:
-------------------------------------------

I checked out the latest from trunk and did a test crawl with documents I know 
will return a TikaException due to the following Tika bug:
https://issues.apache.org/jira/browse/TIKA-418

The job ended successfully and MCF did not try to fetch the affected documents 
over and over again even though TikaExceptions were thrown. In other words, it 
seems to work as it should now.

> Solr connector should treat TikaException the same as a 400 response
> --------------------------------------------------------------------
>
>                 Key: CONNECTORS-200
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-200
>             Project: ManifoldCF
>          Issue Type: Improvement
>          Components: Lucene/SOLR connector
>    Affects Versions: ManifoldCF 0.1, ManifoldCF 0.2, ManifoldCF 0.3
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>
> Solr connector should treat TikaException the same as a 400 response, which 
> is to skip the document.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to