[
https://issues.apache.org/jira/browse/BEAM-6350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marek Simunek updated BEAM-6350:
--------------------------------
Description:
If for LeftJoin is used BroadcastHashJoinTranslator then from right side is
created PCollectionView (as sideInput).
If we use right side in multiple joins then PCollectionView is created
multiple times which is not optimal behavior.
Eg.
{code:java}
LeftJoin.of(left, right)..
LeftJoin.of(anotherLeftPcollection, right)..
{code}
For example it will happen when we want to solve skew Join.
was:
If for LeftJoin is used
BroadcastHashJoinTranslator then from right side is created PCollectionView (as
sideInput). If we use right side in multiple joins then PCollectionView is
created multiple times which is not optimal behavior.
Eg.
{code:java}
LeftJoin.of(left, right)..
LeftJoin.of(anotherLeftPcollection, right)..
{code}
For example it will happen when we want to solve skew Join.
> Reuuse PCollectionView when created in translators
> ---------------------------------------------------
>
> Key: BEAM-6350
> URL: https://issues.apache.org/jira/browse/BEAM-6350
> Project: Beam
> Issue Type: Improvement
> Components: dsl-euphoria
> Affects Versions: 2.11.0
> Reporter: Marek Simunek
> Assignee: David Moravek
> Priority: Major
>
> If for LeftJoin is used BroadcastHashJoinTranslator then from right side is
> created PCollectionView (as sideInput).
> If we use right side in multiple joins then PCollectionView is created
> multiple times which is not optimal behavior.
> Eg.
> {code:java}
> LeftJoin.of(left, right)..
> LeftJoin.of(anotherLeftPcollection, right)..
> {code}
> For example it will happen when we want to solve skew Join.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)