[ https://issues.apache.org/jira/browse/SOLR-2875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064691#comment-14064691 ]
Shinichiro Abe commented on SOLR-2875: -------------------------------------- Yes, the binary don't always have any pdf files. This data import will be completed successfully on the source. > Incorrect url of tika-data-config.xml in example-DIH > ---------------------------------------------------- > > Key: SOLR-2875 > URL: https://issues.apache.org/jira/browse/SOLR-2875 > Project: Solr > Issue Type: Bug > Components: contrib - DataImportHandler > Affects Versions: 4.0-ALPHA > Environment: solr boot:java > -Dsolr.solr.home=~/trunk/solr/example/example-DIH/solr/tika -jar start.jar > Reporter: Shinichiro Abe > Assignee: Koji Sekiguchi > Priority: Trivial > Fix For: 3.5, 4.0-ALPHA > > Attachments: SOLR-2875.patch > > > The specified url in tika-data-config.xml is not correct path. So when > running full-import, exception is thrown. > {quote} > 2011/11/04 16:48:26 org.apache.solr.common.SolrException log > ?v???I: Full Import failed:java.lang.RuntimeException: > org.apache.solr.handler.dataimport.DataImportHandlerException: > java.lang.RuntimeException: java.io.FileNotFoundException: Could not find > file: ../contrib/extraction/src/test/resources/solr-word.pdf > at > org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:261) > at > org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:372) > : > : > Caused by: java.io.FileNotFoundException: Could not find file: > ../contrib/extraction/src/test/resources/solr-word.pdf > at > org.apache.solr.handler.dataimport.FileDataSource.getFile(FileDataSource.java:110) > {quote} -- This message was sent by Atlassian JIRA (v6.2#6252) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org