Re: About Operator and OperatorBase

2015-04-20 Thread Henry Saputra
Since we are talking about common API, what was the original intention or design for this layer? >From the doc: " The Common API operator exists only in order for the flink-java and flink-scalapackages to not have a dependency on the optimizer." Currently the Java API Operator is converted into

Periodic full stream aggregations

2015-04-20 Thread Gyula Fóra
Hey all, I think we are missing a quite useful feature that could be implemented (with some slight modifications) on top of the current windowing api. We currently provide 2 ways of aggregating (or reducing) over streams: doing a continuous aggregation and always output the aggregated value (whic

[jira] [Created] (FLINK-1915) Faulty plan selection by optimizer

2015-04-20 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-1915: Summary: Faulty plan selection by optimizer Key: FLINK-1915 URL: https://issues.apache.org/jira/browse/FLINK-1915 Project: Flink Issue Type: Bug

Re: [DISCUSS] Replace user@f.a.o with Stackoverflow

2015-04-20 Thread Henry Saputra
@Ted, you are right, decisions should be make in the dev@ list so I was not saying we could not have users discussions it outside of Apache land, but should be also happening in user@ list. So I apologize to Ufuk, Robert, and Stephan if my response implied that way. Like @Robert has said before, s

Re: [DISCUSS] Replace user@f.a.o with Stackoverflow

2015-04-20 Thread Ted Dunning
Henry, I think that the requirement is that project decisions need to be made on an Apache controlled resource. Not sure how many project decisions are made on the user list. I know I answer a ton of Mahout questions on quora. Having more than one arena for answers and reflecting between them f

Re: Hadoop ETLing with Flink

2015-04-20 Thread Fabian Hueske
> collection = LOAD 'test_data.csv' USING PigStorage(';') >> AS ( >> col1:chararray, >> col2:chararray, >> ); >> >> # use partitions >> STORE collection I

Re: Hadoop ETLing with Flink

2015-04-20 Thread Fabian Hueske
a.csv' USING PigStorage(';') > AS ( > col1:chararray, > col2:chararray, > ); > > # use partitions > STORE collection INTO 'import_table_hcat' USING > org.apache.hcatalog.pig.HCatStorer('datestamp

Re: [DISCUSS] Replace user@f.a.o with Stackoverflow

2015-04-20 Thread Stephan Ewen
I like the searchability of StackOverflow better than that of Mailing List archives, but we need to stick to Apache rules here. I like to idea of posting answers with a lot of effort to StackOverflow. Not sure however how it looks if we post a lot of questions and answer them ourselves. Looks weir

Re: [DISCUSS] Replace user@f.a.o with Stackoverflow

2015-04-20 Thread Kostas Tzoumas
If we clearly preface with a boilerplate text and a link to the users@ question it might look less weird :-) Perhaps we can create a stackoverflow user whose purpose is to create questions that mirror the users@ list? Not sure how much this can be automated though On Mon, Apr 20, 2015 at 4:25 PM,

Re: Question about Flink Project

2015-04-20 Thread Stephan Ewen
I have a side project where a web developer and me are drafting a new version of the runtime monitor interface. I'll send a demo as soon as we have something presentable. What kind of GUI topics are you interested in? On Mon, Apr 20, 2015 at 11:45 AM, Maximilian Michels wrote: > Hi Matthias, >

RE: Hadoop ETLing with Flink

2015-04-20 Thread Papp, Stefan
;import_table_hcat' USING org.apache.hcatalog.pig.HCatStorer('datestamp=20150420'); How would I implement this with Flink? Let us brainstorm about the code snippet... final ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment(); CsvReader csvr = env.readCsvFile(fileP

[jira] [Created] (FLINK-1914) Wrong FS while starting YARN session without correct HADOOP_HOME

2015-04-20 Thread JIRA
Zoltán Zvara created FLINK-1914: --- Summary: Wrong FS while starting YARN session without correct HADOOP_HOME Key: FLINK-1914 URL: https://issues.apache.org/jira/browse/FLINK-1914 Project: Flink

Re: Rework of the window-join semantics

2015-04-20 Thread Stephan Ewen
Perfect! I am eager to see what you came up with! On Sat, Apr 18, 2015 at 2:00 PM, Gyula Fóra wrote: > Hey all, > > We have spent some time with Asterios, Paris and Jonas to finalize the > windowing semantics (both the current features and the window join), and I > think we made very have come u

Re: Hadoop ETLing with Flink

2015-04-20 Thread Robert Metzger
Hi Stefan, you can use Flink to load data into HDFS. The CSV reader is suited for reading delimiter separated text files into the system. But you can also read data from a lot of other sources (avro, jdbc, mongodb, hcatalog). We don't have any utilities to make writing to HCatalog very easy, but

Re: Merge Python API

2015-04-20 Thread Aljoscha Krettek
+1, as I said on the Github PR On Mon, Apr 20, 2015 at 3:00 PM, Till Rohrmann wrote: > +1 > > On Mon, Apr 20, 2015 at 2:50 PM, Timo Walther wrote: > >> +1 >> >> >> On 20.04.2015 14:49, Gyula Fóra wrote: >> >>> +1 >>> >>> On Mon, Apr 20, 2015 at 2:41 PM, Fabian Hueske wrote: >>> >>> +1 >>>

Re: Merge Python API

2015-04-20 Thread Till Rohrmann
+1 On Mon, Apr 20, 2015 at 2:50 PM, Timo Walther wrote: > +1 > > > On 20.04.2015 14:49, Gyula Fóra wrote: > >> +1 >> >> On Mon, Apr 20, 2015 at 2:41 PM, Fabian Hueske wrote: >> >> +1 >>> >>> 2015-04-20 14:39 GMT+02:00 Maximilian Michels : >>> >>> +1 Let's merge it to flink-staging and get som

Re: Merge Python API

2015-04-20 Thread Andra Lungu
+1 from me too, the presentation of the Python API seemed okay, plus that having one more project in staging will not hurt. On Mon, Apr 20, 2015 at 2:49 PM, Gyula Fóra wrote: > +1 > > On Mon, Apr 20, 2015 at 2:41 PM, Fabian Hueske wrote: > > > +1 > > > > 2015-04-20 14:39 GMT+02:00 Maximilian Mi

Re: Merge Python API

2015-04-20 Thread Vasiliki Kalavri
+1 On 20 April 2015 at 14:50, Timo Walther wrote: > +1 > > > On 20.04.2015 14:49, Gyula Fóra wrote: > >> +1 >> >> On Mon, Apr 20, 2015 at 2:41 PM, Fabian Hueske wrote: >> >> +1 >>> >>> 2015-04-20 14:39 GMT+02:00 Maximilian Michels : >>> >>> +1 Let's merge it to flink-staging and get some peop

[jira] [Created] (FLINK-1913) Document how to access data in HCatalog

2015-04-20 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-1913: - Summary: Document how to access data in HCatalog Key: FLINK-1913 URL: https://issues.apache.org/jira/browse/FLINK-1913 Project: Flink Issue Type: Bug

Re: Merge Python API

2015-04-20 Thread Timo Walther
+1 On 20.04.2015 14:49, Gyula Fóra wrote: +1 On Mon, Apr 20, 2015 at 2:41 PM, Fabian Hueske wrote: +1 2015-04-20 14:39 GMT+02:00 Maximilian Michels : +1 Let's merge it to flink-staging and get some people to use it. On Mon, Apr 20, 2015 at 2:21 PM, Kostas Tzoumas wrote: I'm +1 for thi

Re: Merge Python API

2015-04-20 Thread Gyula Fóra
+1 On Mon, Apr 20, 2015 at 2:41 PM, Fabian Hueske wrote: > +1 > > 2015-04-20 14:39 GMT+02:00 Maximilian Michels : > > > +1 Let's merge it to flink-staging and get some people to use it. > > > > On Mon, Apr 20, 2015 at 2:21 PM, Kostas Tzoumas > > wrote: > > > > > I'm +1 for this > > > > > > On M

Re: Merge Python API

2015-04-20 Thread Fabian Hueske
+1 2015-04-20 14:39 GMT+02:00 Maximilian Michels : > +1 Let's merge it to flink-staging and get some people to use it. > > On Mon, Apr 20, 2015 at 2:21 PM, Kostas Tzoumas > wrote: > > > I'm +1 for this > > > > On Mon, Apr 20, 2015 at 11:03 AM, Robert Metzger > > wrote: > > > > > Hi, > > > > > >

Re: Merge Python API

2015-04-20 Thread Maximilian Michels
+1 Let's merge it to flink-staging and get some people to use it. On Mon, Apr 20, 2015 at 2:21 PM, Kostas Tzoumas wrote: > I'm +1 for this > > On Mon, Apr 20, 2015 at 11:03 AM, Robert Metzger > wrote: > > > Hi, > > > > The Python API pull request [1] has been open for quite some time now. > > >

Re: Merge Python API

2015-04-20 Thread Kostas Tzoumas
I'm +1 for this On Mon, Apr 20, 2015 at 11:03 AM, Robert Metzger wrote: > Hi, > > The Python API pull request [1] has been open for quite some time now. > > I was wondering whether we are planning to merge it or not. > I took a closer look at the Python API a few weeks ago and I think we > shoul

Hadoop ETLing with Flink

2015-04-20 Thread Papp, Stefan
Hi, I want load CSV files into a Hadoop cluster. How could I do that with Flink? I know, I can load data into a CsvReader and then iterate over rows and transform them. Is there an easy way to store the results into HDFS+HCatalog within Flink? Thank you! Stefan Papp Lead Hadoop Consultant

Re: Question about Flink Project

2015-04-20 Thread Maximilian Michels
Hi Matthias, There's the Job Manager's web interface and the web client (started via ./start-webclient.sh). I believe some people were discussing changes to the Job Manager's interface in the past. The web client has some rendering issues and its GUI could be improved quite a bit. Best, Max On M

Question about Flink Project

2015-04-20 Thread Matthias
Hello Developers, exists open GUI-related issues for this project? I search a new topic for my bachelor thesis. Thanks for your help Matthias Schumacher

Re: [DISCUSS] Replace user@f.a.o with Stackoverflow

2015-04-20 Thread Robert Metzger
I think the mailinglists are indexed by Google as well, through various web based mirrors. SO is good for questions with clear answers, but if a user wants more interactive feedback (with questions back and forth) the user@ list is probably better. Also, people which are experienced with Apache pr

Merge Python API

2015-04-20 Thread Robert Metzger
Hi, The Python API pull request [1] has been open for quite some time now. I was wondering whether we are planning to merge it or not. I took a closer look at the Python API a few weeks ago and I think we should merge it to expose it to our users to collect feedback. I hope by merging it, we'll f

Re: problem when use this coding please found slove

2015-04-20 Thread Aljoscha Krettek
Hi, what is the difference in the results, you posted this: (2,Customer#2,XSTf4&&NCwDVaWNe6tEgvwfmRchLXak,121.65,AUTOMOBILE) and this: (2,Customer#2,XSTf4&&NCwDVaWNe6tEgvwfmRchLXak,121.65,AUTOMOBILE) Also, how to you fill the LinkedList and how do you pass it to the user function?