Re: Converting an existing index format to Lucene Index

Edward Drapkin Fri, 25 Feb 2011 00:39:28 -0800

On 2/25/2011 12:26 AM, Lokendra Singh wrote:

Hi all,
I am seeking for some guidelines to directly convert an alreadyexisting index to Lucene index.The index available to me is of a set of <value1,value2> pairs. Whereeach pair is :
< word ,  fileName >
i.e a word as a 'value1', and the 'value2' being the fileNamecontaining that word.
A word might appear in several fileNames as well a same file cancontain multiple copies of a word. For eg, following index is possible:
< "my"  , "file1" >
< "you" , "file2" >
< "my",  "file2" >
< "my", "file1">
My actual problem is that the index available to me is very large insize, hence I am bit reluctant to create 'Document' object for eachfile because for that I will have to read through all the pairs firstand store them in memory. Or I will have to 'update' the 'Document'object of a particular file while iterating through the Pairs of myindex, this 'update', again, is a costly operation.
Please correct me if my understanding of Lucene is wrong or otheralternative ways.
Regards
Lokendra



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Converting an existing index format to Lucene Index

Reply via email to