Re: Elasticsearch 1.1.0 - Optimize broken?

Elliott Bradshaw Wed, 09 Apr 2014 07:07:49 -0700

Hi Adrien,

I did customize my merge policy, although I did so only because I was so 
surprised by the number of segments left over after the load.  I'm pretty 
sure the optimize problem was happening before I made this change, but 
either way here are my settings:


"index" : {
"merge" : {
"policy" : {
"max_merged_segment" : "20gb",
"segments_per_tier" : 5,
"floor_segment" : "10mb"
},
"scheduler" : "concurrentmergescheduler"
}
}

Not sure whether this set up could be a contributing factor or not.  
Nothing really jumps out at me in the logs.  In fact, when i kick off the 
optimize, I don't see any logging at all.  Should I?

I'm running the following command: curl -XPOST 
http://localhost:9200/index/_optimize

Thanks!


On Wednesday, April 9, 2014 8:56:35 AM UTC-4, Adrien Grand wrote:
>
> Hi Elliott,
>
> 1500 segments per shard is certainly way too much, and it is not normal 
> that optimize doesn't manage to reduce the number of segments.
>  - Is there anything suspicious in the logs?
>  - Have you customized the merge policy or scheduler?[1]
>  - Does the issue still reproduce if you restart your cluster?
>
> [1] 
> http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/index-modules-merge.html
>
>
>
> On Wed, Apr 9, 2014 at 2:38 PM, Elliott Bradshaw 
> <[email protected]<javascript:>
> > wrote:
>
>> Any other thoughts on this?  Would 1500 segments per shard be 
>> significantly impacting performance?  Have you guys noticed this behavior 
>> elsewhere?
>>
>> Thanks.
>>
>>
>> On Monday, April 7, 2014 8:56:38 AM UTC-4, Elliott Bradshaw wrote:
>>>
>>> Adrian,
>>>
>>> I ran the following command:
>>>
>>> curl -XPUT http://localhost:9200/_settings -d 
>>> '{"indices.store.throttle.max_bytes_per_sec" : "10gb"}'
>>>
>>> and received a { "acknowledged" : "true" } response.  The logs showed 
>>> "cluster state updated".
>>>
>>> I did have to close my index prior to changing the setting and reopen 
>>> afterward.
>>>
>>>
>>> I've since began another optimize, but again it doesn't look like much 
>>> is happening.  The optimize isn't returning and the total CPU usage on 
>>> every node is holding at about 2% of a single core.  I would copy a 
>>> hot_threads stack trace, but I'm unfortunately on a closed network and this 
>>> isn't possible.  I can tell you that refreshes of hot_threads show vary 
>>> little happening.  The occasional [merge] thread (always in a 
>>> LinkedTransferQueue.awaitMatch() state) or [optimize] (doing nothing on 
>>> a waitForMerge() call) thread shows up, but it's always consuming 0-1% 
>>> CPU.  It sure feels like something isn't right.  Any thoughts?
>>>
>>>
>>>
>>>
>>> On Fri, Apr 4, 2014 at 3:24 PM, Adrien Grand <adrien...@elasticsearch.
>>> com <javascript:>> wrote:
>>>
>>>> Did you see a message in the logs confirming that the setting has been 
>>>> updated? It would be interesting to see the output of hot threads[1] to 
>>>> see 
>>>> what your node is doing.
>>>>
>>>> [1] http://www.elasticsearch.org/guide/en/elasticsearch/
>>>> reference/current/cluster-nodes-hot-threads.html
>>>>
>>>>
>>>> On Fri, Apr 4, 2014 at 7:18 PM, Elliott Bradshaw 
>>>> <[email protected]<javascript:>
>>>> > wrote:
>>>>
>>>>> Yes. I have run max_num_segments=1 every time.
>>>>>
>>>>>
>>>>> On Fri, Apr 4, 2014 at 12:26 PM, Michael Sick <
>>>>> [email protected] <javascript:>> wrote:
>>>>>
>>>>>> Have you tried max_num_segments=1 on your optimize? 
>>>>>>
>>>>>> On Fri, Apr 4, 2014 at 11:27 AM, Elliott Bradshaw 
>>>>>> <[email protected]<javascript:>
>>>>>> > wrote:
>>>>>>
>>>>>>> Any thoughts on this?  I've run optimize several more times, and the 
>>>>>>> number of segments falls each time, but I'm still over 1000 segments 
>>>>>>> per 
>>>>>>> shard.  Has anyone else run into something similar?
>>>>>>>
>>>>>>>
>>>>>>> On Thursday, April 3, 2014 11:21:29 AM UTC-4, Elliott Bradshaw wrote:
>>>>>>>>
>>>>>>>> OK.  Optimize finally returned, so I suppose something was 
>>>>>>>> happening in the background, but I'm still seeing over 6500 segments.  
>>>>>>>> Even 
>>>>>>>> after setting max_num_segments=5.  Does this seem right?  Queries are 
>>>>>>>> a 
>>>>>>>> little faster (350-400ms) but still not great.  Bigdesk is still 
>>>>>>>> showing a 
>>>>>>>> fair amount of file IO.
>>>>>>>>
>>>>>>>> On Thursday, April 3, 2014 8:47:32 AM UTC-4, Elliott Bradshaw wrote:
>>>>>>>>>
>>>>>>>>> Hi All,
>>>>>>>>>
>>>>>>>>> I've recently upgraded to Elasticsearch 1.1.0.  I've got a 4 node 
>>>>>>>>> cluster, each with 64G of ram, with 24G allocated to Elasticsearch on 
>>>>>>>>> each.  I've batch loaded approximately 86 million documents into a 
>>>>>>>>> single 
>>>>>>>>> index (4 shards) and have started benchmarking 
>>>>>>>>> cross_field/multi_match 
>>>>>>>>> queries on them.  The index has one replica and takes up a total of 
>>>>>>>>> 111G.  
>>>>>>>>> I've run several batches of warming queries, but queries are not as 
>>>>>>>>> fast as 
>>>>>>>>> I had hoped, approximately 400-500ms each.  Given that *top *(on 
>>>>>>>>> Centos) shows 5-8 GB of free memory on each server, I would assume 
>>>>>>>>> that the 
>>>>>>>>> entire index has been paged into memory (I had worried about disk 
>>>>>>>>> performance previously, as we are working in a virtualized 
>>>>>>>>> environment).
>>>>>>>>>
>>>>>>>>> A stats query on the index in questions shows that the index is 
>>>>>>>>> composed of > 7000 segments.  This seemed high to me, but maybe it's 
>>>>>>>>> appropriate.  Regardless, I dispatched an optimize command, but I am 
>>>>>>>>> not 
>>>>>>>>> seeing any progress and the command has not returned.  Current merges 
>>>>>>>>> remains at zero, and the segment count is not changing.  Checking out 
>>>>>>>>> hot 
>>>>>>>>> threads in ElasticHQ, I initially saw an optimize call in the stack 
>>>>>>>>> that 
>>>>>>>>> was blocked on a waitForMerge call.  This however has disappeared, 
>>>>>>>>> and I'm 
>>>>>>>>> seeing no evidence that the optimize is occuring.
>>>>>>>>>
>>>>>>>>> Does any of this seem out of the norm or unusual?  Has anyone else 
>>>>>>>>> had similar issues.  This is the second time I have tried to optimize 
>>>>>>>>> an 
>>>>>>>>> index since upgrading.  I've gotten the same result both time.
>>>>>>>>>
>>>>>>>>> Thanks in advance for any help/tips!
>>>>>>>>>
>>>>>>>>> - Elliott
>>>>>>>>>
>>>>>>>>  -- 
>>>>>>> You received this message because you are subscribed to the Google 
>>>>>>> Groups "elasticsearch" group.
>>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>>> send an email to [email protected] <javascript:>.
>>>>>>>  To view this discussion on the web visit 
>>>>>>> https://groups.google.com/d/msgid/elasticsearch/5391291f-
>>>>>>> 5c5e-4088-a1f2-93272beef0bb%40googlegroups.com<https://groups.google.com/d/msgid/elasticsearch/5391291f-5c5e-4088-a1f2-93272beef0bb%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>>>> .
>>>>>>>
>>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>>
>>>>>>
>>>>>>  -- 
>>>>>> You received this message because you are subscribed to a topic in 
>>>>>> the Google Groups "elasticsearch" group.
>>>>>> To unsubscribe from this topic, visit https://groups.google.com/d/
>>>>>> topic/elasticsearch/kqTRRADQBwc/unsubscribe.
>>>>>> To unsubscribe from this group and all its topics, send an email to 
>>>>>> [email protected] <javascript:>.
>>>>>> To view this discussion on the web visit https://groups.google.com/d/
>>>>>> msgid/elasticsearch/CAP8axnD7BUziGct2%3Db%
>>>>>> 3DfupaKYFnA5fR2TBsxHoURJumHSyODFA%40mail.gmail.com<https://groups.google.com/d/msgid/elasticsearch/CAP8axnD7BUziGct2%3Db%3DfupaKYFnA5fR2TBsxHoURJumHSyODFA%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>>>>> .
>>>>>>
>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>
>>>>>
>>>>>  -- 
>>>>> You received this message because you are subscribed to the Google 
>>>>> Groups "elasticsearch" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>>> an email to [email protected] <javascript:>.
>>>>>  To view this discussion on the web visit https://groups.google.com/d/
>>>>> msgid/elasticsearch/CAGCt%2BFvoSQTvv%2B6G%3D3GOX27AuYdEwLiW%
>>>>> 3Demc0JTouT9%2BBeUk_A%40mail.gmail.com<https://groups.google.com/d/msgid/elasticsearch/CAGCt%2BFvoSQTvv%2B6G%3D3GOX27AuYdEwLiW%3Demc0JTouT9%2BBeUk_A%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>>>> .
>>>>>  
>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>
>>>>
>>>>
>>>>
>>>> -- 
>>>> Adrien Grand
>>>>  
>>>> -- 
>>>> You received this message because you are subscribed to a topic in the 
>>>> Google Groups "elasticsearch" group.
>>>> To unsubscribe from this topic, visit https://groups.google.com/d/
>>>> topic/elasticsearch/kqTRRADQBwc/unsubscribe.
>>>> To unsubscribe from this group and all its topics, send an email to 
>>>> [email protected] <javascript:>.
>>>> To view this discussion on the web visit https://groups.google.com/d/
>>>> msgid/elasticsearch/CAL6Z4j6sQrPjijV86nYGoGTAQ%3D3cO_pgyYE6%2B3sGjJPr8%
>>>> 2BKDsg%40mail.gmail.com<https://groups.google.com/d/msgid/elasticsearch/CAL6Z4j6sQrPjijV86nYGoGTAQ%3D3cO_pgyYE6%2B3sGjJPr8%2BKDsg%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>>> .
>>>>
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>>  -- 
>> You received this message because you are subscribed to the Google Groups 
>> "elasticsearch" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/elasticsearch/8742280e-922f-4e91-bcb2-6096ca0165e6%40googlegroups.com<https://groups.google.com/d/msgid/elasticsearch/8742280e-922f-4e91-bcb2-6096ca0165e6%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
>
> -- 
> Adrien Grand
>  

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/775e2e32-5891-4514-bd53-32b52c0deb1a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: Elasticsearch 1.1.0 - Optimize broken?

Reply via email to