MalformedURLException logging in TikaEntityProcessor -----------------------------------------------------
Key: SOLR-2903 URL: https://issues.apache.org/jira/browse/SOLR-2903 Project: Solr Issue Type: Improvement Components: contrib - DataImportHandler Reporter: Okke Klein Priority: Minor When using TikaEntityProcessor to fetch only certain documents, the logging is filled with SEVERE exceptions. There should be a way to handle this exception with a lot less logging. 17-nov-2011 15:23:34 org.apache.solr.handler.dataimport.BinURLDataSource getData SEVERE: Exception thrown while getting data java.net.MalformedURLException: no protocol: null at java.net.URL.<init>(URL.java:567) at java.net.URL.<init>(URL.java:464) at java.net.URL.<init>(URL.java:413) at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:80) at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:37) at org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:102) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:237) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:642) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:311) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:222) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:372) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:440) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:421) 17-nov-2011 15:23:34 org.apache.solr.common.SolrException log SEVERE: Exception in entity : tika:org.apache.solr.handler.dataimport.DataImportHandlerException: Exception in invoking url null Processing Document # 1445 at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72) at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:88) at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:37) at org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:102) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:237) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:642) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:311) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:222) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:372) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:440) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:421) Caused by: java.net.MalformedURLException: no protocol: null at java.net.URL.<init>(URL.java:567) at java.net.URL.<init>(URL.java:464) at java.net.URL.<init>(URL.java:413) at org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:80) ... 13 more -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org