kennknowles opened a new issue, #19441:
URL: https://github.com/apache/beam/issues/19441

   I am building a pipeline that needs to process all records in a set of 
indexes, each suffixed with a timestamp. I have an alias that matches all of 
these indexes at once. However, I cannot use the alias name in ElasticsearchIO 
as it will try to read stats from this specific index. Because it is an alias 
and not an actual index, the response contains no count for the alias name 
itself and therefore Beam (Dataflow?) will estimate the size as being 0. This 
makes the pipeline end without even executing the query on the alias, even 
though that would have returned loads of documents.
   
   This should be easy to fix as the results of /<aliasname\>/_stats only 
contains indexes references by that alias, so instead of looking for a key 
<aliasname\> in the `indices` key in the returned JSON, it should consider all 
returned indexes and add the estimated sizes together.
   
   Imported from Jira 
[BEAM-6920](https://issues.apache.org/jira/browse/BEAM-6920). Original Jira may 
contain additional context.
   Reported by: MadEgg.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to