[ 
https://issues.apache.org/jira/browse/SOLR-2875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064691#comment-14064691
 ] 

Shinichiro Abe commented on SOLR-2875:
--------------------------------------

Yes, the binary don't always have any pdf files. This data import will be 
completed successfully  on the source.

> Incorrect url of tika-data-config.xml in example-DIH
> ----------------------------------------------------
>
>                 Key: SOLR-2875
>                 URL: https://issues.apache.org/jira/browse/SOLR-2875
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - DataImportHandler
>    Affects Versions: 4.0-ALPHA
>         Environment: solr boot:java 
> -Dsolr.solr.home=~/trunk/solr/example/example-DIH/solr/tika -jar start.jar 
>            Reporter: Shinichiro Abe
>            Assignee: Koji Sekiguchi
>            Priority: Trivial
>             Fix For: 3.5, 4.0-ALPHA
>
>         Attachments: SOLR-2875.patch
>
>
> The specified url in tika-data-config.xml is not correct path. So when 
> running full-import, exception is thrown.
> {quote}
> 2011/11/04 16:48:26 org.apache.solr.common.SolrException log
> ?v???I: Full Import failed:java.lang.RuntimeException: 
> org.apache.solr.handler.dataimport.DataImportHandlerException: 
> java.lang.RuntimeException: java.io.FileNotFoundException: Could not find 
> file: ../contrib/extraction/src/test/resources/solr-word.pdf
>       at 
> org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:261)
>       at 
> org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:372)
>  :
>  :
> Caused by: java.io.FileNotFoundException: Could not find file: 
> ../contrib/extraction/src/test/resources/solr-word.pdf
>       at 
> org.apache.solr.handler.dataimport.FileDataSource.getFile(FileDataSource.java:110)
> {quote}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to