Hi,
Thanks for the reply.
Kindly find the filed type scghema i am using :
<field name="opennlp_text" type="open_nlp" indexed="true" stored="true"/>
<copyField source="content" dest="opennlp_text"/>
Does the *opennlp_text* field be indexed="true"?
<fieldType name="open_nlp" class="solr.TextField"
positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.OpenNLPTokenizerFactory"
sentenceModel="opennlp/en-sent.bin" tokenizerModel="opennlp/en-token.bin"/>
<filter class="solr.OpenNLPFilterFactory"
posTaggerModel="opennlp/en-pos-maxent.bin"/>
<filter class="solr.OpenNLPLemmatizerFilterFactory"
dictionary="opennlp/en-lemmatizer.txt"/>
</analyzer>
</fieldType>
Here the en-lemmatizer.txt is 7mb in size.Without lemmatization usually the
whole indexing process takes on an average basis 2-3mts,but here it is
taking more than 1hr and continuing.Is the scenario related to the
lemmatizer file.
Could you please guide me.
Thanks,
Arun
--
View this message in context:
http://lucene.472066.n3.nabble.com/Exception-while-integrating-openNLP-with-Solr-tp4326146p4326311.html
Sent from the Solr - User mailing list archive at Nabble.com.