Hi Erick, hi Adrien,

thank you for both of your replies.

>> In a word, "no". When you set stored="true", Solr (well
>> actually Lucene) puts a compressed verbatim copy on disk.

I am using Lucene directly, not Solr, so I had the hope of being able to
configure the "store behavior" at this lower level.

(From the Javadoc it looks like SortedDocValuesField has this sharing
behavior, but AFAIK DocValues are not retrievable as StoredFields are.)

>> "Disk space is cheap" is the usual response here 

If we were talking a server application here, I would agree. But our
application is a search plugin for the Eclipse IDE and for some reason
developers are still surprisingly sensitive about their workspaces
consuming a few hundred megabytes of disk space.

> You can still give an id to each value on the application side if you want
> to avoid repeating values.

I am afraid I don't understand. Do you suggest using IntFields as ID
instead of StringFields, as they are presumably stored more efficiently?

> Otherwise, even without doing anything, things
> should not be too bad thanks to stored fields compression.

AFAICT, the fields are not compressed on disk right now. At least, "grep
-c" finds my field over and over in the index files.

So, how do I enabled stored fields compression. Googling turned up
Store.COMPRESS, but that doesn't exist in 5.2.1.

Best wishes,

Andreas

-- 
Codetrails GmbH
The knowledge transfer company

Robert-Bosch-Str. 7, 64293 Darmstadt
Phone: +49-6151-276-7092
Mobile: +49-170-811-3791
http://www.codetrails.com/

Managing Director: Dr. Marcel Bruch
Handelsregister: Darmstadt HRB 91940

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to