Solr tika and extracting formatting info

2009-07-11 Thread S.Selvam
all the formatting information of the file posted to Tika not only the data. Is that possible with Tika? or do i need use any other module ? I would like to get your suggestions regarding this. -- Yours, S.Selvam

Solr tika and posting .pst files

2009-07-20 Thread S.Selvam
to configure Tika to be able to handle .pst format ? ,I would like to hear your suggestions. Note:1) I use VB.NET as a front end tool. 2) Other file contents are properly mapped to content field. -- Yours, S.Selvam

solr-duplicate post management

2009-01-11 Thread S.Selvam Siva
of same id(unique field) is posted. I want to make this by modifying the solr source.Which file do i need to modify so that i could get the above details in log ? I tried with DirectUpdateHandler2.java(which removes the duplicate entries),but efforts in vein. -- Yours, S.Selvam

Re: solr-duplicate post management

2009-01-22 Thread S.Selvam Siva
://wiki.apache.org/solr/Deduplication https://issues.apache.org/jira/browse/SOLR-799 -Hoss Thank you for your response.I will try it out. -- Yours, S.Selvam

Re: solr-duplicate post management

2009-01-24 Thread S.Selvam Siva
On Thu, Jan 22, 2009 at 2:33 PM, S.Selvam Siva s.selvams...@gmail.comwrote: On Thu, Jan 22, 2009 at 7:12 AM, Chris Hostetter hossman_luc...@fucit.org wrote: : what i need is ,to log the existing urlid and new urlid(of course both will : not be same) ,when a .xml file of same id(unique

Re: solr-duplicate post management

2009-01-26 Thread S.Selvam Siva
modifying DirectUpdateHandler2 bit easy. Further,for the current importance of finding duplicate post,i made the above modification to DirectUpdateHandler2. Note:And for your information,we are commiting for every 1000 posts. -- Yours, S.Selvam