Hi
I'm having an error when i import an xml file with DIH.
In this file my <id> is an url wich looks like this :
http://www.example.com/?cp=30_s&st=a&c=655
Apparently the issue is with the "=" character?
Is there any workaround?
Error trace:
rows processed:0 Processing Document # 849
at
org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor.initQuery(XPathEntityProcessor.java:328)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor.fetchNextRow(XPathEntityProcessor.java:224)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor.nextRow(XPathEntityProcessor.java:204)
at
org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:238)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:591)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:617)
at
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:267)
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:186)
at
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:353)
at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:411)
at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:392)
Caused by: java.lang.RuntimeException:
com.ctc.wstx.exc.WstxUnexpectedCharException: Unexpected character '='
(code 61); expected a semi-colon after the reference for entity 'st'
at [row,col {unknown-source}]: [2,199]
at
org.apache.solr.handler.dataimport.XPathRecordReader.streamRecords(XPathRecordReader.java:187)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor.initQuery(XPathEntityProcessor.java:318)
... 10 more
Caused by: com.ctc.wstx.exc.WstxUnexpectedCharException: Unexpected
character '=' (code 61); expected a semi-colon after the reference for
entity 'st'
at [row,col {unknown-source}]: [2,199]
at
com.ctc.wstx.sr.StreamScanner.throwUnexpectedChar(StreamScanner.java:648)
at
com.ctc.wstx.sr.StreamScanner.parseEntityName(StreamScanner.java:1994)
at
com.ctc.wstx.sr.StreamScanner.fullyResolveEntity(StreamScanner.java:1496)
at
com.ctc.wstx.sr.BasicStreamReader.skipTokenText(BasicStreamReader.java:3568)
at
com.ctc.wstx.sr.BasicStreamReader.skipToken(BasicStreamReader.java:3342)
at
com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2622)
at
com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1019)
at
org.apache.solr.handler.dataimport.XPathRecordReader$Node.handleStartElement(XPathRecordReader.java:376)
at
org.apache.solr.handler.dataimport.XPathRecordReader$Node.parse(XPathRecordReader.java:310)
at
org.apache.solr.handler.dataimport.XPathRecordReader$Node.handleStartElement(XPathRecordReader.java:346)
at
org.apache.solr.handler.dataimport.XPathRecordReader$Node.parse(XPathRecordReader.java:310)
at
org.apache.solr.handler.dataimport.XPathRecordReader$Node.handleStartElement(XPathRecordReader.java:346)
at
org.apache.solr.handler.dataimport.XPathRecordReader$Node.parse(XPathRecordReader.java:310)
at
org.apache.solr.handler.dataimport.XPathRecordReader$Node.access$200(XPathRecordReader.java:202)
at
org.apache.solr.handler.dataimport.XPathRecordReader.streamRecords(XPathRecordReader.java:184)
... 11 more
Apr 13, 2011 4:00:25 PM org.apache.solr.update.DirectUpdateHandler2 rollback
INFO: start rollback
Apr 13, 2011 4:00:25 PM org.apache.solr.update.DirectUpdateHandler2 rollback
INFO: end_rollback