Re: [PROPOSAL] Structure the Flink Open Source Development

2016-05-13 Thread Fabian Hueske
I like the proposal and especially the goal to improve the metadata and descriptions of JIRA issues. However, I would like to split Client and Optimizer into separate components. I can be a maintainer of the optimizer component (DataSet + SQL are fine as well). Cheers, Fabian 2016-05-13 17:03

[jira] [Created] (FLINK-3912) Typos in Batch Scala API Documentation

2016-05-13 Thread Ignacio N. Lucero Ascencio (JIRA)
Ignacio N. Lucero Ascencio created FLINK-3912: - Summary: Typos in Batch Scala API Documentation Key: FLINK-3912 URL: https://issues.apache.org/jira/browse/FLINK-3912 Project: Flink

[jira] [Created] (FLINK-3911) Sort operation before a group reduce doesn't seem to be implemented on 1.0.2

2016-05-13 Thread Patrice Freydiere (JIRA)
Patrice Freydiere created FLINK-3911: Summary: Sort operation before a group reduce doesn't seem to be implemented on 1.0.2 Key: FLINK-3911 URL: https://issues.apache.org/jira/browse/FLINK-3911

[jira] [Created] (FLINK-3910) New self-join operator

2016-05-13 Thread Greg Hogan (JIRA)
Greg Hogan created FLINK-3910: - Summary: New self-join operator Key: FLINK-3910 URL: https://issues.apache.org/jira/browse/FLINK-3910 Project: Flink Issue Type: New Feature Components:

[jira] [Created] (FLINK-3909) Maven Failsafe plugin may report SUCCESS on failed tests

2016-05-13 Thread Maximilian Michels (JIRA)
Maximilian Michels created FLINK-3909: - Summary: Maven Failsafe plugin may report SUCCESS on failed tests Key: FLINK-3909 URL: https://issues.apache.org/jira/browse/FLINK-3909 Project: Flink

[jira] [Created] (FLINK-3908) FieldParsers error state is not reset correctly to NONE

2016-05-13 Thread Flavio Pompermaier (JIRA)
Flavio Pompermaier created FLINK-3908: - Summary: FieldParsers error state is not reset correctly to NONE Key: FLINK-3908 URL: https://issues.apache.org/jira/browse/FLINK-3908 Project: Flink

Re: [PROPOSAL] Structure the Flink Open Source Development

2016-05-13 Thread Greg Hogan
+1 to better scaling :) Many Jira tickets are good ideas with no current traction. Some have a pull request (usually closed), many have comments or discussion. It seems these old tickets tend to hang around because closing the ticket feels like rejecting the idea. How do we track requested

[jira] [Created] (FLINK-3907) Directed Clustering Coefficient

2016-05-13 Thread Greg Hogan (JIRA)
Greg Hogan created FLINK-3907: - Summary: Directed Clustering Coefficient Key: FLINK-3907 URL: https://issues.apache.org/jira/browse/FLINK-3907 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-3906) Global Clustering Coefficient

2016-05-13 Thread Greg Hogan (JIRA)
Greg Hogan created FLINK-3906: - Summary: Global Clustering Coefficient Key: FLINK-3906 URL: https://issues.apache.org/jira/browse/FLINK-3906 Project: Flink Issue Type: New Feature

Re: [PROPOSAL] Structure the Flink Open Source Development

2016-05-13 Thread Matthias J. Sax
Sounds like a good idea to me. We could include Wikipedia article as well. As was thinking about extending the article anyway (no time so far...), as of Flink 1.x the system is stable in large parts and it might be nice to have a high level system description on Wikipedia, too. -Matthias On

Re: Dataset split/demultiplex

2016-05-13 Thread Gábor Gévay
I would like to add that if your predicate does some heavy-weight computation that you want to avoid duplicating for the filters, then you can insert a map before the filters, where you evaluate the predicate and put the result into a field. Best, Gabor 2016-05-13 11:51 GMT+02:00 Fabian Hueske

Re: [PROPOSAL] Structure the Flink Open Source Development

2016-05-13 Thread Kostas Tzoumas
Should we also add a component "Flink website and wiki" (minus the documentation) with an associated maintainer? On Fri, May 13, 2016 at 12:17 PM, Timo Walther wrote: > +1 for from my side too > > > > On 13.05.2016 06:13, Chiwan Park wrote: > >> +1 for this proposal >> > > >

Re: Dataset split/demultiplex

2016-05-13 Thread Fabian Hueske
Hi, it is true that Gabor's approach of using two filters has a certain overhead. However, the overhead should be reasonable. The data stays on the same node and the filter can be very lightweight. I agree that this is not a very nice solution. However, modifying the DataSet API such that an

Re: [RESULT] [VOTE] Release Apache Flink 1.0.3 (RC3)

2016-05-13 Thread Maximilian Michels
Neat! :) On Thu, May 12, 2016 at 10:15 PM, Wright, Eron wrote: > FYI the brew formula has been updated to 1.0.3. > > $ brew info apache-flink > apache-flink: stable 1.0.3, HEAD > Scalable batch and stream data processing > https://flink.apache.org/ > Not installed > From: >

[jira] [Created] (FLINK-3905) Add KafkaOutputFormat (DataSet API)

2016-05-13 Thread Maximilian Bode (JIRA)
Maximilian Bode created FLINK-3905: -- Summary: Add KafkaOutputFormat (DataSet API) Key: FLINK-3905 URL: https://issues.apache.org/jira/browse/FLINK-3905 Project: Flink Issue Type: New

[jira] [Created] (FLINK-3904) GlobalConfiguration doesn't ensure config has been loaded

2016-05-13 Thread Maximilian Michels (JIRA)
Maximilian Michels created FLINK-3904: - Summary: GlobalConfiguration doesn't ensure config has been loaded Key: FLINK-3904 URL: https://issues.apache.org/jira/browse/FLINK-3904 Project: Flink