[ https://issues.apache.org/jira/browse/BEAM-7989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ismaël Mejía resolved BEAM-7989. -------------------------------- Resolution: Fixed Fix Version/s: 2.16.0 > SparkRunner CacheVisitor counts PCollections from SideInputs > ------------------------------------------------------------ > > Key: BEAM-7989 > URL: https://issues.apache.org/jira/browse/BEAM-7989 > Project: Beam > Issue Type: Bug > Components: runner-spark > Affects Versions: 2.14.0 > Reporter: Kyle Winkelman > Assignee: Kyle Winkelman > Priority: Major > Fix For: 2.16.0 > > Time Spent: 1h 10m > Remaining Estimate: 0h > > The SparkRunner's CacheVisitor looks at all inputs for a > TransformHierarchy.Node. Those inputs include the PCollections from the > PCollectionViews that are supplied as sideInputs. > The SparkRunner should not count these instances of sideInputs as the > PCollections are not actually accessed. They are only accessed when the > CreatePCollectionView Transform is processed. -- This message was sent by Atlassian JIRA (v7.6.14#76016)