sheepdreamofandroids opened a new issue, #22773:
URL: https://github.com/apache/beam/issues/22773

   ### What happened?
   
   We have a relatively simple pipeline reading from Pubsub and writing to 
ElasticsearchIO running in the DataflowRunner.
   
   This fails in beam versions where 
ElasticsearchIO.BulkIO.BulkIOBaseFn.ProcessContextAdapter#output uses 
outputWithTimestamp() and succeeds where it uses output(). The git history 
shows that the implementation bounces back and forth between both 
implementations. ATM we're stuck with version 2.38.0 since that is the most 
recent that works.
   
   I thought this might be related to the pubsub clients using 
outputWithTimestamp with the original publication time. But an inbetween ParDo 
that uses outputWithTimestamp with the current time did not help.
   
   The error we get:
   ```
   java.lang.IllegalArgumentException: Cannot output with timestamp 
2022-08-16T11:03:57.086Z. Output timestamps must be no earlier than the 
timestamp of the current input or timer (2022-08-16T11:33:50.762Z) minus the 
allowed skew (0 milliseconds) and no later than 294247-01-10T04:00:54.775Z. See 
the DoFn#getAllowedTimestampSkew() Javadoc for details on changing the allowed 
skew.
           
org.apache.beam.runners.dataflow.worker.repackaged.org.apache.beam.runners.core.SimpleDoFnRunner.checkTimestamp(SimpleDoFnRunner.java:259)
           
org.apache.beam.runners.dataflow.worker.repackaged.org.apache.beam.runners.core.SimpleDoFnRunner.access$1300(SimpleDoFnRunner.java:85)
           
org.apache.beam.runners.dataflow.worker.repackaged.org.apache.beam.runners.core.SimpleDoFnRunner$DoFnProcessContext.outputWithTimestamp(SimpleDoFnRunner.java:416)
           
org.apache.beam.sdk.io.elasticsearch.ElasticsearchIO$BulkIO$BulkIOBaseFn$ProcessContextAdapter.output(ElasticsearchIO.java:2416)
           
org.apache.beam.sdk.io.elasticsearch.ElasticsearchIO$BulkIO$BulkIOBaseFn.flushAndOutputResults(ElasticsearchIO.java:2436)
           
org.apache.beam.sdk.io.elasticsearch.ElasticsearchIO$BulkIO$BulkIOBaseFn.addAndMaybeFlush(ElasticsearchIO.java:2448)
           
org.apache.beam.sdk.io.elasticsearch.ElasticsearchIO$BulkIO$BulkIOStatefulFn.processElement(ElasticsearchIO.java:2349)
   ```
   
   I'm not sure about the P1 prio.
   
   ### Issue Priority
   
   Priority: 1
   
   ### Issue Component
   
   Component: io-java-elasticsearch


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to