Hello,

We use _update_by_query plugin to bulk update the documents. In the tests 
we've hit an issue where not all the documents are updated because the 
index is may not be refreshed before we do _update_by_query.

We have refresh interval set to 1 sec and this issue won't happen very 
often in the real life, as usually there is a longer timeframe between 
adding and updating a document.

Nevertheless we want to solve the issue. Right now I can see two solutions:

1) Migrate _update_by_query to update by _id where possible (this works as 
documents are *gettable *by id right after they are indexed
2) Issue refresh before all _update_by_query operations

The latter solution will make us safe (_refresh is blocking and we'll wait 
for confirmation before issuing update by's), but what is the performance 
cost? Is it a major one? For 99% for update_by_queries the refresh is not 
needed but we have no way to tell upfront.

Thanks,
Igor

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/57e7df7c-b6ec-4af5-bc83-37880df974c9%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to