Some times, you have to store the raw-text somewhere because you may need it for highlighting for example and can't get access to the original raw-text. If this is the case and you can't use the #ZipLib to enable compressed field, then maybe you can do what I did with Lucene.Net 1.4.3 days. Store the raw-text on a file system where folder compression is enabled. When I did this, I had to make sure the raw-text file name had a name that matched the document ID in Lucene's index so I can get back to it.
-- George Aroush -----Original Message----- From: Digy [mailto:[EMAIL PROTECTED] Sent: Wednesday, March 28, 2007 12:33 PM To: [email protected] Subject: RE: Index Size is Double Hi Laxmilal You don't have to store the raw data in the index for searching, It is needed only if you want to return the data in the search result. So you can use "Lucene.Net.Documents.Field.Store.NO". DIGY -----Original Message----- From: Laxmilal Menaria [mailto:[EMAIL PROTECTED] Sent: Wednesday, March 28, 2007 9:22 AM To: [email protected] Subject: Re: Index Size is Double also wants to store the raw data with index for searching.. On 3/28/07, Laxmilal Menaria <[EMAIL PROTECTED]> wrote: > Hello Everyone, > > I am Lucene.Net user and have Indexed 200 MB data with this methods, > > lucenedoc.Add(New Lucene.Net.Documents.Field(FiledName, fieldvalue, > Lucene.Net.Documents.Field.Store.YES, > Lucene.Net.Documents.Field.Index.TOKENIZED, > Lucene.Net.Documents.Field.TermVector.YES)) > > but when I see the Index Size its show appx Double of raw data (390 > MB), it > is ok or any other methods for reducing the size, Ipsl suggest me asap.. > > Thanks in adavance, > -LM >
