Thomas Groh created BEAM-3646:
---------------------------------
Summary: Add comments about appropriate use of DoFn.Teardown
Key: BEAM-3646
URL: https://issues.apache.org/jira/browse/BEAM-3646
Project: Beam
Issue Type: Bug
Components: sdk-java-core
Reporter: Thomas Groh
Assignee: Thomas Groh
Because the {{Teardown}} method has no relation to the atomicity of processing
and commiting of output, it is EXTREMELY DANGEROUS to use to flush outputs, and
buffered data there is extremely likely to never be flushed. If a DoFn instance
with buffered data is lost (for example, via worker/machine failure), and the
runner has committed the result of processing that input, the data is lost.
Not commenting on this being the case can cause users to believe that
(especially if running a batch pipeline) that their data will be flushed on
pipeline completion. This is very dangerous behavior that we do not warn of
sufficiently.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)