MalformedURLException logging in TikaEntityProcessor 
-----------------------------------------------------

                 Key: SOLR-2903
                 URL: https://issues.apache.org/jira/browse/SOLR-2903
             Project: Solr
          Issue Type: Improvement
          Components: contrib - DataImportHandler
            Reporter: Okke Klein
            Priority: Minor


When using TikaEntityProcessor to fetch only certain documents, the logging is 
filled with SEVERE exceptions.

There should be a way to handle this exception with a lot less logging.

17-nov-2011 15:23:34 org.apache.solr.handler.dataimport.BinURLDataSource getData
SEVERE: Exception thrown while getting data
java.net.MalformedURLException: no protocol: null
        at java.net.URL.<init>(URL.java:567)
        at java.net.URL.<init>(URL.java:464)
        at java.net.URL.<init>(URL.java:413)
        at 
org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:80)
        at 
org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:37)
        at 
org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:102)
        at 
org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:237)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:642)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
        at 
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:311)
        at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:222)
        at 
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:372)
        at 
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:440)
        at 
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:421)
17-nov-2011 15:23:34 org.apache.solr.common.SolrException log
SEVERE: Exception in entity : 
tika:org.apache.solr.handler.dataimport.DataImportHandlerException: Exception 
in invoking url null Processing Document # 1445
        at 
org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72)
        at 
org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:88)
        at 
org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:37)
        at 
org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:102)
        at 
org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:237)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:642)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
        at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:668)
        at 
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:311)
        at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:222)
        at 
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:372)
        at 
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:440)
        at 
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:421)
Caused by: java.net.MalformedURLException: no protocol: null
        at java.net.URL.<init>(URL.java:567)
        at java.net.URL.<init>(URL.java:464)
        at java.net.URL.<init>(URL.java:413)
        at 
org.apache.solr.handler.dataimport.BinURLDataSource.getData(BinURLDataSource.java:80)
        ... 13 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to