Re: Merge/segment understanding

Han JU Mon, 31 Mar 2014 03:03:36 -0700

Thanks Binh.

I'm curious about this because we're benchmarking our bulk indexing. And 
we've found out that the fastest bulk indexing strategy to be:


- bulk indexing with 0 replica, no refresh, let ES do as little merge as 
possible
- when indexing finished, optimize segments
- replicates

Is there some readings about the details/internals of lucene? We've the 
book Lucene in Action but it's mainly about core concepts and usage.

在 2014年3月28日星期五UTC+1下午8时32分46秒，Binh Ly写道：
>
> The indexing buffer could also fill up which will flush to a segment. Also 
> the translog flush is not "exactly" deterministic, for example 
> "index.translog.interval" determines how often to check if the translog 
> needs to be flushed or not. Anyway, I wouldn't worry about it if I were 
> you. About the merge, I'd probably leave the defaults alone unless you are 
> absolutely sure changing them helps you. The more segments there are, the 
> more time it could take to do a merge.
>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/21034812-7c7e-4469-a3ad-7ceadde349e6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: Merge/segment understanding

Reply via email to