[ 
https://issues.apache.org/jira/browse/FLINK-1141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14163466#comment-14163466
 ] 

Ufuk Celebi commented on FLINK-1141:
------------------------------------

Unfortunately 0.8. [~StephanEwen] replied to this thread via email (not 
mirrored to JIRA):

{quote}
Can you add a compiler hint that forces a merge-join? That one is not deadlock 
prone...
{quote}

> Selfjoin fails after DataSet exceeds certain size
> -------------------------------------------------
>
>                 Key: FLINK-1141
>                 URL: https://issues.apache.org/jira/browse/FLINK-1141
>             Project: Flink
>          Issue Type: Bug
>          Components: Local Runtime
>    Affects Versions: 0.6.1-incubating
>         Environment: LocalExecutionEnvironment (dop=4)
>            Reporter: Robert Waury
>            Priority: Minor
>         Attachments: LargeSelfJoin.java
>
>
> As soon as a DataSet exceeds a certain size (1000000 tuples in my example) a 
> Selfjoin with a FlatJoinFunction no longer works. After around a second the 
> Join, DataSource and DataSink threads are all in Wait and don't perform any 
> work (no output files are created) and the job never finishes.
> If I cut the input size in half it works fine.
> My current workaround is to create the DataSet twice and join the two 
> identical DataSets.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to