Hello, We use _update_by_query plugin to bulk update the documents. In the tests we've hit an issue where not all the documents are updated because the index is may not be refreshed before we do _update_by_query.
We have refresh interval set to 1 sec and this issue won't happen very often in the real life, as usually there is a longer timeframe between adding and updating a document. Nevertheless we want to solve the issue. Right now I can see two solutions: 1) Migrate _update_by_query to update by _id where possible (this works as documents are *gettable *by id right after they are indexed 2) Issue refresh before all _update_by_query operations The latter solution will make us safe (_refresh is blocking and we'll wait for confirmation before issuing update by's), but what is the performance cost? Is it a major one? For 99% for update_by_queries the refresh is not needed but we have no way to tell upfront. Thanks, Igor -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/57e7df7c-b6ec-4af5-bc83-37880df974c9%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
