[GitHub] flink issue #2985: [FLINK-5104] Bipartite graph validation

2016-12-16 Thread mushketyk
Github user mushketyk commented on the issue:

https://github.com/apache/flink/pull/2985
  
Rebased this PR on top of `master`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink issue #2985: [FLINK-5104] Bipartite graph validation

2016-12-15 Thread greghogan
Github user greghogan commented on the issue:

https://github.com/apache/flink/pull/2985
  
@mushketyk I had meant just with a simple comment noting that with an 
anti-join we could also remove the FlatJoinFunction.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink issue #2985: [FLINK-5104] Bipartite graph validation

2016-12-14 Thread mushketyk
Github user mushketyk commented on the issue:

https://github.com/apache/flink/pull/2985
  
@greghogan Thank you for the review.
Can you give me an example of how you document joins?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink issue #2985: [FLINK-5104] Bipartite graph validation

2016-12-14 Thread mushketyk
Github user mushketyk commented on the issue:

https://github.com/apache/flink/pull/2985
  
Sorry for the failing build. I've removed the unused imports so it should 
be fine now.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink issue #2985: [FLINK-5104] Bipartite graph validation

2016-12-14 Thread greghogan
Github user greghogan commented on the issue:

https://github.com/apache/flink/pull/2985
  
@mushketyk the build is failing due to unused imports (IntelliJ can be 
configured to automatically remove these).

There are so many ways to implement the validator. Projections would reduce 
the data to be transmitted and sorted. I like to document when we are 
performing a different kind of join (here, an anti-join) for the day when these 
are available in Flink. I don't know how counting in the `FlatJoinFunction` 
followed by two `DiscardingOutputFormat` compares with the current 
implementation of a union and count (which is a dummy `OutputFormat`).

We'll need to rebase this PR once FLINK-5311 has been committed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink issue #2985: [FLINK-5104] Bipartite graph validation

2016-12-13 Thread mushketyk
Github user mushketyk commented on the issue:

https://github.com/apache/flink/pull/2985
  
Hi @greghogan ,

Thank you for your feedback. I've updated the PR accordingly. The only 
thing that I did differently is that I've replaced projections and two 
`count()` executions with joins and one `count()` call. What do you think about 
this?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink issue #2985: [FLINK-5104] Bipartite graph validation

2016-12-11 Thread mushketyk
Github user mushketyk commented on the issue:

https://github.com/apache/flink/pull/2985
  
I noticed that `Graph` has the following implementation of the `validate` 
method:

```java
public Boolean validate(GraphValidator validator) throws 
Exception {
return validator.validate(this);
}
```

What is the reason for returning `Boolean` instead of `boolean` here 
(GraphValidator returns `boolean`)?
Shouldn't we change the return value to `boolean`?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---