[
https://issues.apache.org/jira/browse/FLINK-128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Fabian Hueske closed FLINK-128.
-------------------------------
Resolution: Duplicate
Will be fixed with FLINK-1005
> Duplicate keys after chained reducers
> -------------------------------------
>
> Key: FLINK-128
> URL: https://issues.apache.org/jira/browse/FLINK-128
> Project: Flink
> Issue Type: Bug
> Reporter: GitHub Import
> Labels: github-import
> Fix For: pre-apache
>
>
> Hi guys,
> I'm trying to create the homework assignment for my course and as example I
> wanted to write a PACT job that computes the degree distribution of a graph
> (=how many vertices have outdegree x). The input is a file with "source
> target" ids per line.
> I have a Map that parses the data and emits (vertexID, 1), then a reduce that
> sums up the degree per vertexID and emits (degree, 1). The last reduce sums
> up the counts per degree and emits (degree, count) pairs. Unfortunately, I
> see duplicates in the output... I checked my code but can't see any problem.
> It would be super important for me to have this fixed soon.
> Here's the code:
> https://github.com/sscdotopen/graphmining-with-ozone/blob/master/src/main/java/de/tuberlin/dima/aim3/graphmining/OutDegreeDistribution.java
> ---------------- Imported from GitHub ----------------
> Url: https://github.com/stratosphere/stratosphere/issues/128
> Created by: [sscdotopen|https://github.com/sscdotopen]
> Labels: bug, core,
> Assignee: [rmetzger|https://github.com/rmetzger]
> Created at: Fri Oct 04 16:05:16 CEST 2013
> State: open
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)