Hi

I'm indexing w/ IW, flush-by-RAM=off and flush-by-doc=MAX_INT. Whenever
iw.ramSizeInBytes() >= threshold, I commit the changes, serializes the
Directory somewhere and starts with a new Directory and IW instance.

The threshold is currently 32MB. I noticed though that the size of the
serialized Directory is nearly half (<16 MB). Is that expected? Will I see
that behavior every time (e.g. w/ large stored fields), or is it data
dependent? I assume that the data can affect the compression, but I never
thought that by 50% factor, from RAM to disk.

Shai

Reply via email to