Don't the segment that only has deleted documents just gets dropped?
Or does it get dropped _after_ the merge and therefore still sits
around?

Regards,
   Alex.
----
Solr Example reading group is starting November 2016, join us at
http://j.mp/SolrERG
Newsletter and resources for Solr beginners and intermediates:
http://www.solr-start.com/


On 29 October 2016 at 08:53, Walter Underwood <wun...@wunderwood.org> wrote:
> It is normal for disk usage to double. Under controlled circumstances,
> it can triple, but that probably won’t happen.
>
> This is the second time today that I’ve sent this information to the list.
>
> It can use nearly 2X the space whenever the largest segment(s) are
> merged, especially if there are only a few smaller segments.
>
> In order to use 3X the space, you need to:
>
> 1. Disable merging.
> 2. Delete all the documents.
> 3. Add all the documents.
> 4. Enable merging.
>
> This causes one complete set of segments that are 100% deletes,
> one set that is 0% deletes, then the merge creates another set that
> is 0% deletes. During the merge, the old files remain while the
> new one is created.
>
> wunder
> Walter Underwood
> wun...@wunderwood.org
> http://observer.wunderwood.org/  (my blog)
>
>
>> On Oct 28, 2016, at 2:41 PM, Alexandre Rafalovitch <arafa...@gmail.com> 
>> wrote:
>>
>> 2) Is probably a merge operation. Lucene index segments are not
>> rewritable in place, so the merge creates a new file, does everything
>> to it, then switches to it.
>>
>> I remember the number was that the space could temporarily triple
>> (?!?) though that may have been before the tiered merge policy.
>>
>> 3) It should be safe to delete old log files. It is standard log4j stuff.
>>
>> ----
>> Solr Example reading group is starting November 2016, join us at
>> http://j.mp/SolrERG
>> Newsletter and resources for Solr beginners and intermediates:
>> http://www.solr-start.com/
>>
>>
>> On 29 October 2016 at 06:55, Jamal, Sarfaraz
>> <sarfaraz.ja...@verizonwireless.com.invalid> wrote:
>>> Hi Guys,
>>>
>>> I am currently investigating an instance of Solr's Disk space usage and I 
>>> had a few questions I thought you guys might be able to help answer.
>>>
>>> First Question
>>> * There is 30 gb's worth of autosuggest data in the /tmp folder. Each file 
>>> is half of a gigabyte
>>> Is it safe to delete those files?
>>>
>>> Second Question
>>> Also, we notice that at times the disk runs down to only having a few 
>>> gigabytes available, and then goes back to having more space. (the index 
>>> file literally grows and then shrinks).
>>>
>>> Third Question
>>> Is it also safe to delete the log files?
>>>
>>> We run a database indexer on a set interval, perhaps that is relevant to 
>>> this discussion.
>>>
>>> Sas
>

Reply via email to