all the formatting information of the file posted to Tika not only the data.
Is that possible with Tika? or do i need use any other module ?
I would like to get your suggestions regarding this.
--
Yours,
S.Selvam
to configure Tika to be able to handle .pst format ? ,I would like
to hear your suggestions.
Note:1) I use VB.NET as a front end tool.
2) Other file contents are properly mapped to content field.
--
Yours,
S.Selvam
of same id(unique field) is posted.
I want to make this by modifying the solr source.Which file do i need to
modify so that i could get the above details in log ?
I tried with DirectUpdateHandler2.java(which removes the duplicate
entries),but efforts in vein.
--
Yours,
S.Selvam
://wiki.apache.org/solr/Deduplication
https://issues.apache.org/jira/browse/SOLR-799
-Hoss
Thank you for your response.I will try it out.
--
Yours,
S.Selvam
On Thu, Jan 22, 2009 at 2:33 PM, S.Selvam Siva s.selvams...@gmail.comwrote:
On Thu, Jan 22, 2009 at 7:12 AM, Chris Hostetter hossman_luc...@fucit.org
wrote:
: what i need is ,to log the existing urlid and new urlid(of course both
will
: not be same) ,when a .xml file of same id(unique
modifying DirectUpdateHandler2 bit easy.
Further,for the current importance of finding duplicate post,i made the
above modification to DirectUpdateHandler2.
Note:And for your information,we are commiting for every 1000 posts.
--
Yours,
S.Selvam