Robert Bradshaw created BEAM-4030:
-------------------------------------
Summary: Add CombineFn.compact, similar to Java
Key: BEAM-4030
URL: https://issues.apache.org/jira/browse/BEAM-4030
Project: Beam
Issue Type: Bug
Components: sdk-py-core
Reporter: Robert Bradshaw
Assignee: Ahmet Altay
Some CombineFns buffer elements in their add_inputs because a combining
operation cost can be effectively amortized across many elements. However, this
introduces the extra (possibly higher) cost of potentially serializing more
expensive buffers through shuffle. We should add a CombineFn.compact(self,
accumulator) method (defaulting to the identity) similar to what the Java SDK
provides which is called when flushing an element from the PGBKCV table.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)