Hi,
I am using solr 3.4 and configured my DataImportHandler
to get some data from MySql as well as index some rich document from the
disk.
This is the part of db-data-config file where i am indexing Rich text
documents.....
<entity name="resume" dataSource="ds-db" query="Select
name,js_login_id div 25000 as dir from js_resumes where js_login_id='${
js_logins.id}' and is_primary = 1 and deleted=0 and mask_cv != 1"
pk="resume_name"
deltaQuery="select js_login_id from js_resumes where
modified > '${dataimporter.last_index_time}' and is_primary = 1 and
deleted=0"
parentDeltaQuery="select jsl.id as id from
service_request_histories srh,service_requests sr, js_login_screenings
jsls, js_logins jsl where jsl.status IN(1,2) and srh.service_request_id =
sr.id and jsl.id=jsls.js_login_id and srh.status in ('8','43') and
jsls.id=srh.sid
and date(srh.created) < date_sub(now(),interval 2 day) and jsl.id =
'${js_resumes.js_login_id}'"
>
<entity processor="TikaEntityProcessor"
tikaConfig="tika-config.xml" url="http://localhost/resumes-new/resumes$
{resume.dir}/${js_logins.id}/${resume.name}" dataSource="ds-file"
format="text">
<field column="text" name="resume" />
</entity>
</entity>
But after some time i get the following error in my error log. It looks
like a class missing error, Can anyone tell me which poi jar version would
work with tika.0.6. Currently I have poi-3.7.jar.
Error which i am getting is this ....
SEVERE: Exception while processing: js_logins document :
SolrInputDocument[{id=id(1.0)={100984},
complete_mobile_number=complete_mobile_number(1.0)={+91 9600067575},
emailid=emailid(1.0)={[email protected]}, full_name=full_name(1.0)={Venkat
Ryali}}]:org.apache.solr.handler.dataimport.DataImportHandlerException:
java.lang.NoSuchMethodError:
org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(Lorg/openxmlformats/schemas/wordprocessingml/x2006/main/CTP;Lorg/apache/poi/xwpf/usermodel/
XWPFDocument;)V
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:669)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:622)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:622)
at
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:268)
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:187)
at
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:359)
at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:427)
at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:408)
Caused by: java.lang.NoSuchMethodError:
org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(Lorg/openxmlformats/schemas/wordprocessingml/x2006/main/CTP;Lorg/apache/poi/xwpf/usermodel/
XWPFDocument;)V
at
org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator$MyXWPFParagraph.<init>(XWPFWordExtractorDecorator.java:163)
at
org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator$MyXWPFParagraph.<init>(XWPFWordExtractorDecorator.java:161)
at
org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator.extractTableContent(XWPFWordExtractorDecorator.java:140)
at
org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator.buildXHTML(XWPFWordExtractorDecorator.java:91)
at
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTML(AbstractOOXMLExtractor.java:69)
at
org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:51)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:101)
at
org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:128)
at
org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:238)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:596)
... 7 more
Please help ASAP....
--
“The more you are willing to accept responsibility for your actions, the
more credibility you will have”
Anuj Kumar