Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
I think I found 2 more issues with the web interface. When inside a running job's view: - I think the colorful boxes with the number of tasks in each status show wrong values (or show something else?). I get different values than the ones I see in "Overview" and "Running Jobs" tabs. - In the

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Maximilian Michels
It is supposed to show some general statistics about a job but it is currently just a place holder. The accumulators are shown in the overview. This page should be removed before the release. Thanks, Max On Wed, Oct 28, 2015 at 12:54 PM, Vasiliki Kalavri wrote: > I

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
I see, thank you! +1 for removing before the release :) On 28 October 2015 at 13:06, Sachin Goel wrote: > Those are hard coded values. > What exactly should be there, I'm not sure either. > On Oct 28, 2015 5:25 PM, "Vasiliki Kalavri" >

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Maximilian Michels
@Vasia: This is a CSS problem which manifests because of a long class name. The colored boxes show the status of tasks from your job which you are viewing. Are the number not correct? @Sachin: Could you fix the wrapping of the column? On Wed, Oct 28, 2015 at 2:44 PM, Sachin Goel

Fast strategy for intersect

2015-10-28 Thread Martin Junghanns
Hi all! While working on FLINK-2905, I was wondering what a good (and fast) way to compute the intersect between two data sets (Gelly vertices in my case) with unknown size would be. I came up with three ways to solve this: Consider two sets: DataSet> verticesLeft =

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
I have a question regarding the web interface :) What is the "Job Accumulator/Statistics" tab supposed to show? No matter what job I run, the values are the same (operator=1, parallelism=2, subtasks=3). Are these hard-coded defaults? Thanks! -Vasia. On 28 October 2015 at 10:50, Maximilian

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Sachin Goel
While we're at it, we should also remove the dummy log and stdout tabs for task managers. The work on that hasn't been finished yet. I'll file a jira for both. On Oct 28, 2015 5:39 PM, "Vasiliki Kalavri" wrote: > I see, thank you! +1 for removing before the release :)

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Maximilian Michels
@Vasia: - There are two types of shapes which are colored :) The circles mark the running/finished/cancelled/failed jobs while the squares mark the status of a task within a job (cancelled/running/failed/restart/pending/finished/total). - I can see all four columns in the "Plan" tab on Firefox.

Re: Web Page Issue

2015-10-28 Thread Maximilian Michels
+1 for keeping the Quickstart on the main page but I'm against removing it from the documentation because it is, essentially, a part of the documentation. On Wed, Oct 28, 2015 at 2:25 PM, Matthias J. Sax wrote: > Good point. > > How often does "Quickstart" change? Seems to be

Re: Web Page Issue

2015-10-28 Thread Matthias J. Sax
Good point. How often does "Quickstart" change? Seems to be fairly stable. Maybe we could move it from doc page to main page? Btw: The link "Flink on Windows" is broken in the Quickstart guide. -Matthias On 10/28/2015 02:09 PM, Aljoscha Krettek wrote: > I think the quickstarts should be very

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Sachin Goel
@Max, I will try to get the wrap working [rather ellipsifying the text in this case.]. Not very good with CSS unfortunately. @Vasia, there seems to be different things which are being used to render those two. For the running jobs page, job.tasks is rendered, while for the job page,

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Vasiliki Kalavri
The numbers I see in the overview are different. See https://drive.google.com/file/d/0BzQJrI2eGlyYMHZZUGs2ZFJzaXc/view?usp=sharing vs. https://drive.google.com/file/d/0BzQJrI2eGlyYc3kzMlQ4OXN6a3c/view?usp=sharing -Vasia. On 28 October 2015 at 14:51, Maximilian Michels wrote:

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Maximilian Michels
Yes, that's correct. One is running operators (top of the job view) while the other lists all the parallel tasks (overview page, and detail view in job view). I think it makes sense where they are displayed at the moment. It's just confusing how they are displayed. Could we add a label at the top

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Sachin Goel
I think the squares on top of the job page are showing the status of vertices, not tasks. The squares on overview pages however show the number of tasks. Should we make it vertices or tasks everywhere, for consistency? -- Sachin Goel Computer Science, IIT Delhi m. +91-9871457685 On Wed, Oct 28,

Re: Flink "Material"

2015-10-28 Thread Maximilian Michels
Yes, you can find lots of Flink slides on the Slideshare. On Tue, Oct 27, 2015 at 9:46 PM, Matthias J. Sax wrote: > Hi, > > I just "discovered" that on the Flink "Material" page, a couple of slide > decks are listed (https://flink.apache.org/material.html). I guess, this >

Re: Web Page Issue

2015-10-28 Thread Maximilian Michels
We should be careful not to break links to the docs again. I'm in favor of making it more clear what is the Flink web site and what its documentation is. For me, it would be enough to change "Overview 1.0" to "Documentation 1.0" and have a clear link which says "Back to Flink website". That should

Re: Flink "Material"

2015-10-28 Thread Martin Liesenberg
It might also be useful to link to the training materials which can be found here: http://dataartisans.github.io/flink-training/ Best regards Martin Maximilian Michels schrieb am Mi., 28. Okt. 2015 10:53: > Yes, you can find lots of Flink slides on the Slideshare. > > On Tue,

Re: [DISCUSS] flink-external

2015-10-28 Thread Maximilian Michels
Thanks Matthias! I made a comment. Please open a pull request. On Tue, Oct 27, 2015 at 10:37 PM, Matthias J. Sax wrote: > Just updated this. Improved the layout and added FastR project. > > https://github.com/mjsax/flink-web/tree/flink-external-page > > -Matthias > > On

Re: Web Page Issue

2015-10-28 Thread Fabian Hueske
I agree with Max. Renaming Overview in Documentation and adding a clear link back to the project website are the most important issues, IMO. 2015-10-28 10:59 GMT+01:00 Maximilian Michels : > We should be careful not to break links to the docs again. I'm in > favor of making it

Re: Broken link for master Javadocs

2015-10-28 Thread Suneel Marthi
Thanks for taking care of this. On Wed, Oct 28, 2015 at 1:00 PM, Maximilian Michels wrote: > The issue with our Java Docs has been resolved. The link works again. > > On Tue, Oct 27, 2015 at 3:57 PM, Henry Saputra > wrote: > > Ah thanks Max, sending to

Re: Broken link for master Javadocs

2015-10-28 Thread Maximilian Michels
The issue with our Java Docs has been resolved. The link works again. On Tue, Oct 27, 2015 at 3:57 PM, Henry Saputra wrote: > Ah thanks Max, sending to commits@ is good > > - Henry > > On Tue, Oct 27, 2015 at 2:35 AM, Maximilian Michels wrote: >> Hi

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Maximilian Michels
@Sachin: I've tried it out. It has the tendency to make things a bit harder to read (because it breaks words at arbitrary positions). However, we don't have a better fix. On Wed, Oct 28, 2015 at 4:46 PM, Sachin Goel wrote: > Sorry. Wrong commit. In case you've pulled

Re: Fast strategy for intersect

2015-10-28 Thread Fabian Hueske
I would go for the first solution with the join. This gives the engine the highest degree of freedom: - repartition vs. broadcast-forward - sort-merge vs. hash-join Best, Fabian 2015-10-28 18:45 GMT+01:00 Vasiliki Kalavri : > Hi Martin, > > isn't finding the

Re: Scala 2.10/2.11 Maven dependencies

2015-10-28 Thread Frederick F. Kautz IV
No idea if I get a vote ;) Nevertheless, +1 to have binaries for both versions in Maven and explicitly "scala versioned". Some background on this for those not as familiar with scala versioning: It's considered best practice to label what version of scala a library uses in the artifact id.

Re: Caching information from a stream

2015-10-28 Thread Maximilian Michels
Hi Andra, What you thought of turns out to be one of the core features of the Flink streaming API. Flink's operators support state. State can be partitioned by the the key using keyBy(field). You may use a MapFunction to achieve what you wanted like so: public static void main(String[] args)

Re: [VOTE] Release Apache Flink 0.10.0 (release-0.10.0-rc2)

2015-10-28 Thread Till Rohrmann
-1 from my side. I just found a serious issue with the KryoSerializer (FLINK-2800) which in some cases produced duplicated elements or corrupted data. I opened a PR to fix the issue ( https://github.com/apache/flink/pull/1308). Cheers, Till ​ On Wed, Oct 28, 2015 at 5:38 PM, Maximilian Michels

Re: Scala 2.10/2.11 Maven dependencies

2015-10-28 Thread Fabian Hueske
+1 to have binaries for both versions in Maven and as build to download. 2015-10-26 17:11 GMT+01:00 Theodore Vasiloudis < theodoros.vasilou...@gmail.com>: > +1 for having binaries, I'm working on a Spark application currently with > Scala 2.11 and having to rebuild everything when deploying e.g.

Re: Caching information from a stream

2015-10-28 Thread Maximilian Michels
Oups, forgot the mapper :) static class StatefulMapper extends RichMapFunction, Tuple2> { private OperatorState counter; @Override public Tuple2 map(Tuple2 value) throws Exception { System.out.println("Key: " + value.f0 +

[jira] [Created] (FLINK-2936) ClassCastException when using EventTimeSourceFunction in non-EventTime program

2015-10-28 Thread Fabian Hueske (JIRA)
Fabian Hueske created FLINK-2936: Summary: ClassCastException when using EventTimeSourceFunction in non-EventTime program Key: FLINK-2936 URL: https://issues.apache.org/jira/browse/FLINK-2936

Re: Web Page Issue

2015-10-28 Thread Fabian Hueske
I agree, two Overview links pointing to different locations should be changed. I am not so sure about the Logo issue. IMO, there should be always a text link. The logo link should only be an addition. Maybe we should wait for more opinions, before we continue. The website has been changed a