Hi -
our indexed documents currently store solr fields like 'digest' or 'type',
which most of our documents will end up with same value (such as 'sha1' for
field 'digest', or 'message' for field 'type' etc).

on each solr server, we usually have 100 of millions of documents indexed
and with the same value on these fields (fields are stored and indexed).

any suggestion what is the  best approach if we suspect this will be very
inefficient on disk space usage, or is it?

thanks!
Jie



--
View this message in context: 
http://lucene.472066.n3.nabble.com/suggestion-howto-handle-highly-repetitive-valued-field-tp4026104.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to