[ 
https://issues.apache.org/jira/browse/BEAM-6350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marek Simunek updated BEAM-6350:
--------------------------------
    Description: 
If for LeftJoin is used BroadcastHashJoinTranslator then from right side is 
created PCollectionView (as sideInput).

If we use right side in multiple joins then PCollectionView  is created 
multiple times which is not optimal behavior.

Eg.
{code:java}
LeftJoin.of(left, right)..
LeftJoin.of(anotherLeftPcollection, right)..

{code}
 For example it will happen when we want to solve skew Join.

  was:
If for LeftJoin is used

BroadcastHashJoinTranslator then from right side is created PCollectionView (as 
sideInput). If we use right side in multiple joins then PCollectionView  is 
created multiple times which is not optimal behavior. 

Eg. 
{code:java}
LeftJoin.of(left, right)..
LeftJoin.of(anotherLeftPcollection, right)..

{code}
 For example it will happen when we want to solve skew Join.


> Reuuse PCollectionView when created in translators 
> ---------------------------------------------------
>
>                 Key: BEAM-6350
>                 URL: https://issues.apache.org/jira/browse/BEAM-6350
>             Project: Beam
>          Issue Type: Improvement
>          Components: dsl-euphoria
>    Affects Versions: 2.11.0
>            Reporter: Marek Simunek
>            Assignee: David Moravek
>            Priority: Major
>
> If for LeftJoin is used BroadcastHashJoinTranslator then from right side is 
> created PCollectionView (as sideInput).
> If we use right side in multiple joins then PCollectionView  is created 
> multiple times which is not optimal behavior.
> Eg.
> {code:java}
> LeftJoin.of(left, right)..
> LeftJoin.of(anotherLeftPcollection, right)..
> {code}
>  For example it will happen when we want to solve skew Join.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to