Hi Uwe,

> For larger segments it will automatically create CFS files

I was under the impression Lucene packed only smaller segments into CFS
files..... based on this 3 years old comment from Mike:
https://github.com/elastic/elasticsearch/issues/8919 .  Maybe that comment
is out of date now?

Thanks,
Otis
--
Monitoring - Log Management - Alerting - Anomaly Detection
Solr & Elasticsearch Consulting Support Training - http://sematext.com/


On Sun, Apr 23, 2017 at 11:40 AM, Uwe Schindler <[email protected]> wrote:

> Hi Avi,
>
> There is nothing wrong with CFS files. They are just like zip files,
> containing multiple other index files. Sometimes, when you add only few
> documents, IndexWriter starts to merge several older segments to a new
> file. For larger segments it will automatically create CFS files, as those
> segments are unlikely to change. During merging it needs additional disk
> space. At end of merging it will delete old segments, unless they are used
> by older commit points or if Index searchers are referring to them. You
> should have at least 2 or 3 times the original index size on spare for
> indexes that change. Keep in mind, that e.g. on Windows where files in use
> cannot be deleted, you may see older segment for long time.
> As far as I know, depending on merge policy, Sole no longer defaults to
> not use CFS files. For large segments CFS files are better as they use less
> file handles. Smaller segments still use no compounds. So it is a matter of
> segment size by default, like in Lucene.
>
> Uwe
>
>
> Am 23. April 2017 11:50:17 MESZ schrieb Avi Steiner <[email protected]
> >:
>>
>> Hi
>>
>>
>>
>> We have a customer with Solr 5.3.1.
>>
>> The index contains less than 3.5 million docs, and index folder size is
>> about 240GB.
>>
>> I found that the huge files are .cfs files (compound files) that were
>> created lately although only few documents were added.
>>
>> The useCompoundFile parameter is commented in SolrConfig.xml.
>>
>> As far as I understand the default of Solr is false, and of Lucene is
>> true, which means this feature should be disabled.
>>
>> I would like to understand why those files created and why they are so
>> huge.
>>
>>
>>
>> Regards,
>>
>>
>>
>> Avi
>>
>>
>>
>> ------------------------------
>> This email and any attachments thereto may contain private, confidential,
>> and privileged material for the sole use of the intended recipient. Any
>> review, copying, or distribution of this email (or any attachments thereto)
>> by others is strictly prohibited. If you are not the intended recipient,
>> please contact the sender immediately and permanently delete the original
>> and any copies of this email and any attachments thereto.
>
>
> --
> Uwe Schindler
> Achterdiek 19, 28357 Bremen
> https://www.thetaphi.de
>

Reply via email to