Hi,

Currently I am indexing documents using Solr, by directly adding files as
'req.addFile(fi);' or by sending the content of the file like
'req.addContentStream(stream);' (with solrj library). Actually, I need to
index the documents from my local network to Solr server running on a remote
network. The reason is, I would like to perform searches on server end and
dont want to store the indexed data at client side. Typically, the approach
is to transfer the entire file content to remote server (since, Solr server
is running on different remote network).

However, I believe, in Lucene the indexed data would only be 1% to 10% of
the original file data. Plz correct me, if I am wrong. So, I want to check
if i would be able to use 'CLucene' project in the client side, and generate
only the analysed data that needs to be stored in an index. Then, i would
transfer this data to the server (through socket or curl upload), and index
the analysed content on the server side. So, with this approach, i want to
avoid transfering entire files and transfer only the indexable portion of
the content as input to the server. Then on the server side, i want to
perform the necessary processing to create the index with this input data.

Is there any way/api to achieve these steps on both the client and server
side using CLucene. Or any way to achieve this by digging into the CLucene
codes/project ?

Regards,
Rahul.
------------------------------------------------------------------------------
Get a FREE DOWNLOAD! and learn more about uberSVN rich system, 
user administration capabilities and model configuration. Take 
the hassle out of deploying and managing Subversion and the 
tools developers use with it. http://p.sf.net/sfu/wandisco-d2d-2
_______________________________________________
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers

Reply via email to