[GitHub] flink pull request: [FLINK-377] [FLINK-671] Generic Interface / PA...

2015-02-03 Thread fhueske
Github user fhueske commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72640916
  
Just implemented the basic triangle enumeration job and figured out that 
this example is already included in this PR ;-)
However, when trying to run both programs, I encountered two problems:

1. I had to manually create a file `/tmp/flink_data/output` and give it a 
certain size (100MB worked for me). I'm on OS X.
2. After I had that file, the ./pyflink2.sh command did not print any error 
message but did not respond either. Seemed kind of deadlocked.

The error message for the missing file was not really helpful and could be 
improved.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-377) Create a general purpose framework for language bindings

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303164#comment-14303164
 ] 

ASF GitHub Bot commented on FLINK-377:
--

Github user fhueske commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72640916
  
Just implemented the basic triangle enumeration job and figured out that 
this example is already included in this PR ;-)
However, when trying to run both programs, I encountered two problems:

1. I had to manually create a file `/tmp/flink_data/output` and give it a 
certain size (100MB worked for me). I'm on OS X.
2. After I had that file, the ./pyflink2.sh command did not print any error 
message but did not respond either. Seemed kind of deadlocked.

The error message for the missing file was not really helpful and could be 
improved.


 Create a general purpose framework for language bindings
 

 Key: FLINK-377
 URL: https://issues.apache.org/jira/browse/FLINK-377
 Project: Flink
  Issue Type: Improvement
Reporter: GitHub Import
Assignee: Chesnay Schepler
  Labels: github-import
 Fix For: pre-apache


 A general purpose API to run operators with arbitrary binaries. 
 This will allow to run Stratosphere programs written in Python, JavaScript, 
 Ruby, Go or whatever you like. 
 We suggest using Google Protocol Buffers for data serialization. This is the 
 list of languages that currently support ProtoBuf: 
 https://code.google.com/p/protobuf/wiki/ThirdPartyAddOns 
 Very early prototype with python: 
 https://github.com/rmetzger/scratch/tree/learn-protobuf (basically testing 
 protobuf)
 For Ruby: https://github.com/infochimps-labs/wukong
 Two new students working at Stratosphere (@skunert and @filiphaase) are 
 working on this.
 The reference binding language will be for Python, but other bindings are 
 very welcome.
 The best name for this so far is stratosphere-lang-bindings.
 I created this issue to track the progress (and give everybody a chance to 
 comment on this)
  Imported from GitHub 
 Url: https://github.com/stratosphere/stratosphere/issues/377
 Created by: [rmetzger|https://github.com/rmetzger]
 Labels: enhancement, 
 Assignee: [filiphaase|https://github.com/filiphaase]
 Created at: Tue Jan 07 19:47:20 CET 2014
 State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-1437][Java API] Fixes copy() methods in...

2015-02-03 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/342


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink pull request: [FLINK-1193][java-api][scala-api] Complete sup...

2015-02-03 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/348


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1464) Added ResultTypeQueryable interface to TypeSerializerInputFormat.

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303177#comment-14303177
 ] 

ASF GitHub Bot commented on FLINK-1464:
---

Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/349


 Added ResultTypeQueryable interface to TypeSerializerInputFormat.
 -

 Key: FLINK-1464
 URL: https://issues.apache.org/jira/browse/FLINK-1464
 Project: Flink
  Issue Type: Improvement
  Components: Distributed Runtime, Optimizer
Affects Versions: 0.8, 0.9, 0.8.1
Reporter: Alexander Alexandrov
Assignee: Alexander Alexandrov
Priority: Minor
  Labels: easyfix
 Fix For: 0.9, 0.8.1

   Original Estimate: 6h
  Remaining Estimate: 6h

 It is currently impossible to use the {{TypeSerializerInputFormat}} with 
 generic Tuple types.
 For example, [this example 
 gist|https://gist.github.com/aalexandrov/90bf21f66bf604676f37] fails with a
 {quote}
 Exception in thread main 
 org.apache.flink.api.common.InvalidProgramException: The type returned by the 
 input format could not be automatically determined. Please specify the 
 TypeInformation of the produced type explicitly.
 at 
 org.apache.flink.api.java.ExecutionEnvironment.readFile(ExecutionEnvironment.java:341)
 at SerializedFormatExample$.main(SerializedFormatExample.scala:48)
 at SerializedFormatExample.main(SerializedFormatExample.scala)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:606)
 at com.intellij.rt.execution.application.AppMain.main(AppMain.java:134)
 {quote}
 exaception. 
 To fix the issue, I changed the constructor to take a {{TypeInformationT}} 
 instad of a {{TypeSerializerT}} argument. If this is indeed a bug, I think 
 that this is a good solution. 
 Unfortunately the fix breaks the API. Feel free to change it if you find a 
 more elegant solution compatible with the 0.8 branch.
 The suggested fix can be found in the GitHub 
 [PR#349|https://github.com/apache/flink/pull/349].



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1437) Bug in PojoSerializer's copy() method

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303175#comment-14303175
 ] 

ASF GitHub Bot commented on FLINK-1437:
---

Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/342


 Bug in PojoSerializer's copy() method
 -

 Key: FLINK-1437
 URL: https://issues.apache.org/jira/browse/FLINK-1437
 Project: Flink
  Issue Type: Bug
  Components: Java API
Reporter: Timo Walther
Assignee: Timo Walther

 The PojoSerializer's {{copy()}} method does not work properly with {{null}} 
 values. An exception could look like:
 {code}
 Caused by: java.io.IOException: Thread 'SortMerger spilling thread' 
 terminated due to an exception: null
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$ThreadBase.run(UnilateralSortMerger.java:792)
 Caused by: java.io.EOFException
   at 
 org.apache.flink.runtime.io.disk.RandomAccessInputView.nextSegment(RandomAccessInputView.java:83)
   at 
 org.apache.flink.runtime.memorymanager.AbstractPagedInputView.advance(AbstractPagedInputView.java:159)
   at 
 org.apache.flink.runtime.memorymanager.AbstractPagedInputView.readByte(AbstractPagedInputView.java:270)
   at 
 org.apache.flink.runtime.memorymanager.AbstractPagedInputView.readUnsignedByte(AbstractPagedInputView.java:277)
   at org.apache.flink.types.StringValue.copyString(StringValue.java:839)
   at 
 org.apache.flink.api.common.typeutils.base.StringSerializer.copy(StringSerializer.java:83)
   at 
 org.apache.flink.api.java.typeutils.runtime.PojoSerializer.copy(PojoSerializer.java:261)
   at 
 org.apache.flink.runtime.operators.sort.NormalizedKeySorter.writeToOutput(NormalizedKeySorter.java:449)
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$SpillingThread.go(UnilateralSortMerger.java:1303)
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$ThreadBase.run(UnilateralSortMerger.java:788)
 {code}
 I'm working on a fix for that...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: Improved exception for missing type of InputFo...

2015-02-03 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/351


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1330) Restructure directory layout

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303260#comment-14303260
 ] 

ASF GitHub Bot commented on FLINK-1330:
---

Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/333


 Restructure directory layout
 

 Key: FLINK-1330
 URL: https://issues.apache.org/jira/browse/FLINK-1330
 Project: Flink
  Issue Type: Improvement
  Components: Build System, Documentation
Reporter: Max Michels
Priority: Minor
  Labels: usability

 When building Flink, the build results can currently be found under 
 flink-root/flink-dist/target/flink-$FLINKVERSION-incubating-SNAPSHOT-bin/flink-$YARNVERSION-$FLINKVERSION-incubating-SNAPSHOT/.
 I think we could improve the directory layout with the following:
 - provide the bin folder in the root by default
 - let the start up and submissions scripts in bin assemble the class path
 - in case the project hasn't been build yet, inform the user
 The changes would make it easier to work with Flink from source.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-377) Create a general purpose framework for language bindings

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303170#comment-14303170
 ] 

ASF GitHub Bot commented on FLINK-377:
--

Github user fhueske commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72641447
  
btw. implementing the program felt quite good. Very nice API, IMO!


 Create a general purpose framework for language bindings
 

 Key: FLINK-377
 URL: https://issues.apache.org/jira/browse/FLINK-377
 Project: Flink
  Issue Type: Improvement
Reporter: GitHub Import
Assignee: Chesnay Schepler
  Labels: github-import
 Fix For: pre-apache


 A general purpose API to run operators with arbitrary binaries. 
 This will allow to run Stratosphere programs written in Python, JavaScript, 
 Ruby, Go or whatever you like. 
 We suggest using Google Protocol Buffers for data serialization. This is the 
 list of languages that currently support ProtoBuf: 
 https://code.google.com/p/protobuf/wiki/ThirdPartyAddOns 
 Very early prototype with python: 
 https://github.com/rmetzger/scratch/tree/learn-protobuf (basically testing 
 protobuf)
 For Ruby: https://github.com/infochimps-labs/wukong
 Two new students working at Stratosphere (@skunert and @filiphaase) are 
 working on this.
 The reference binding language will be for Python, but other bindings are 
 very welcome.
 The best name for this so far is stratosphere-lang-bindings.
 I created this issue to track the progress (and give everybody a chance to 
 comment on this)
  Imported from GitHub 
 Url: https://github.com/stratosphere/stratosphere/issues/377
 Created by: [rmetzger|https://github.com/rmetzger]
 Labels: enhancement, 
 Assignee: [filiphaase|https://github.com/filiphaase]
 Created at: Tue Jan 07 19:47:20 CET 2014
 State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-377] [FLINK-671] Generic Interface / PA...

2015-02-03 Thread fhueske
Github user fhueske commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72641447
  
btw. implementing the program felt quite good. Very nice API, IMO!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Resolved] (FLINK-1193) InvalidTypesException for Array[Array[Tuple2[Int, Double]]]

2015-02-03 Thread Stephan Ewen (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stephan Ewen resolved FLINK-1193.
-
   Resolution: Fixed
Fix Version/s: 0.9

Fixed via 57b3e9f9e50705033200dcca0762a5a65d1aba60

 InvalidTypesException for Array[Array[Tuple2[Int, Double]]]
 ---

 Key: FLINK-1193
 URL: https://issues.apache.org/jira/browse/FLINK-1193
 Project: Flink
  Issue Type: Bug
Reporter: Till Rohrmann
Assignee: Timo Walther
 Fix For: 0.9


 The TypeExtractor cannot handle nested arrays with non-trivial element types. 
 For example, the following code cannot be run
 {code}
 val input = env.fromCollection(List(1,2,3,4))
 val mapped = input.map{
   id = {
 val pair = (1,1.0)
 val a = Array(pair)
 val b = Array(a)
 b
   }
 }
 {code}
 The resulting type Array[Array[Tuple2[Int, Double]]] cannot be extracted. It 
 would be nice, if Flink supported these types.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-377] [FLINK-671] Generic Interface / PA...

2015-02-03 Thread zentol
Github user zentol commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72652471
  
hmm...you are now the second person to report that creating the tmp files 
does not work on OS X. i don't know why that doesn't work. the file creation is 
done from java, is there any magic required there? i can't debug OS X error 
myself at the moment. all i can do on that front is add sanity checks for 
better error reporting.

the included triangle enumeration is kinda odd, even if it runs the output 
is empty; Ive already checked the implementation yesterday and it appears equal 
to the java counterpart. will give it another go.

the plan execution is one of the more fragile parts. generally, when the 
process exits with an error it is noticed. but if for example something is 
missing (like the call to execute) things just get stuck. this is due to the 
fact that information is only ever sent to java, but never received, a complete 
one-way street. since accumulators nor actions were supposed to be implemented 
anytime soon this seemed appropriate, but it seems that requires a change 
already. some timeouts could be useful as well.

@fhueske Thanks for trying it out!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1455) ExternalSortLargeRecordsITCase.testSortWithShortMediumAndLargeRecords: Potential Memory leak

2015-02-03 Thread Stephan Ewen (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303334#comment-14303334
 ] 

Stephan Ewen commented on FLINK-1455:
-

Hmm, this seems a problem of the JVM setup on the test cluster.

The memory that it cannot allocate is not Flink memory, it is Java direct 
memory, which the NIO code internally needs for I/O: Buffers are copied to 
pooled DirectByteBuffers before issuing the I/O request.

Not sure what is happening there, this is a standalone test, no other NIO 
memory consumers in the test. Seems like an issue with the Travis VMs.

 ExternalSortLargeRecordsITCase.testSortWithShortMediumAndLargeRecords: 
 Potential Memory leak
 

 Key: FLINK-1455
 URL: https://issues.apache.org/jira/browse/FLINK-1455
 Project: Flink
  Issue Type: Bug
  Components: Local Runtime
Affects Versions: 0.9
Reporter: Robert Metzger
Priority: Minor

 This error occurred in one of my Travis jobs: 
 https://travis-ci.org/rmetzger/flink/jobs/48343022
 Would be cool if somebody who knows the sorter better could verify/invalidate 
 the issue.
 {code}
 Running org.apache.flink.runtime.operators.sort.ExternalSortLargeRecordsITCase
 java.lang.RuntimeException: Error obtaining the sorted input: Thread 
 'SortMerger spilling thread' terminated due to an exception: Cannot allocate 
 memory
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger.getIterator(UnilateralSortMerger.java:593)
   at 
 org.apache.flink.runtime.operators.sort.ExternalSortLargeRecordsITCase.testSortWithShortMediumAndLargeRecords(ExternalSortLargeRecordsITCase.java:285)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at 
 org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:264)
   at 
 org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:153)
   at 
 org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:124)
   at 
 org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:200)
   at 
 org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:153)
   at 
 org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:103)
 Caused by: java.io.IOException: Thread 'SortMerger spilling thread' 
 terminated due to an exception: Cannot allocate memory
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$ThreadBase.run(UnilateralSortMerger.java:770)
 Caused by: java.io.IOException: Cannot allocate memory
   at sun.nio.ch.FileDispatcherImpl.write0(Native Method)
   at sun.nio.ch.FileDispatcherImpl.write(FileDispatcherImpl.java:60)
   at sun.nio.ch.IOUtil.writeFromNativeBuffer(IOUtil.java:93)
   at sun.nio.ch.IOUtil.write(IOUtil.java:65)
   at sun.nio.ch.FileChannelImpl.write(FileChannelImpl.java:205)
   at 
 org.apache.flink.runtime.io.disk.iomanager.SegmentWriteRequest.write(AsynchronousFileIOChannel.java:267)
   at 
 org.apache.flink.runtime.io.disk.iomanager.IOManagerAsync$WriterThread.run(IOManagerAsync.java:440)
 Tests run: 5, Failures: 2, Errors: 0, Skipped: 0, Time elapsed: 56.089 sec 
  FAILURE! - in 
 

[GitHub] flink pull request: Allow KeySelectors to implement ResultTypeQuer...

2015-02-03 Thread aalexandrov
Github user aalexandrov commented on the pull request:

https://github.com/apache/flink/pull/354#issuecomment-72660318
  
I would advocate to adding this one as well as a fallback option.

I have a situation where I want to use KeySelector that might return Java 
TupleXX instances parameterized with Scala types, e.g.:

```
class SelectFoo extends KeySelector[Tuple3[Int, Int, String], Tuple3[Int, 
String]] {
override def getKey(v: Tuple3[Int, Int, Int]) = new Tuple2(v.f0, v.f2)
}
``

Even though in this cases  the generic parameters are available, the Scala 
types cannot be inferred because the actual type field type parameters are 
erased by Scala and  are seen only as java.lang.Object from the Java reflection 
API.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink pull request: [FLINK-377] [FLINK-671] Generic Interface / PA...

2015-02-03 Thread qmlmoon
Github user qmlmoon commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72668635
  
I would also like to try out some examples. I'm starting with TPCH queries:D


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (FLINK-1472) Web frontend config overview shows wrong value

2015-02-03 Thread Ufuk Celebi (JIRA)
Ufuk Celebi created FLINK-1472:
--

 Summary: Web frontend config overview shows wrong value
 Key: FLINK-1472
 URL: https://issues.apache.org/jira/browse/FLINK-1472
 Project: Flink
  Issue Type: Bug
  Components: Webfrontend
Affects Versions: master
Reporter: Ufuk Celebi
Priority: Minor


The web frontend shows configuration values even if they could not be correctly 
parsed.

For example I've configured the number of buffers as 123.000, which cannot be 
parsed as an Integer by GlobalConfiguration and the default value is used. 
Still, the web frontend shows the not used 123.000.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: Allow KeySelectors to implement ResultTypeQuer...

2015-02-03 Thread aalexandrov
Github user aalexandrov commented on the pull request:

https://github.com/apache/flink/pull/354#issuecomment-72660810
  
I think that [this StackOverflow article explains my 
problem](http://stackoverflow.com/questions/11586944/how-to-obtain-the-raw-datatype-of-a-parameter-of-a-field-that-is-specialized-in).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink pull request: [FLINK-1318] CsvInputFormat: Made quoted strin...

2015-02-03 Thread StephanEwen
Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/265#issuecomment-72612157
  
Looks good so far. What about this PR is API breaking?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink pull request: Allow KeySelectors to implement ResultTypeQuer...

2015-02-03 Thread StephanEwen
Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/354#issuecomment-72614564
  
So, this pull request can be closed?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1458) Interfaces and abstract classes are not valid types

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302966#comment-14302966
 ] 

ASF GitHub Bot commented on FLINK-1458:
---

Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/357#issuecomment-72614433
  
Looks good to me so far, modulo two comments:
  - Since the type extractor code is changed a lot, it would be good to 
have a test validating that the TypeExtractor allows abstract types and 
interfaces.
  - I would change the log level of the messages (abstract types are less 
performant, type is no POJO) to INFO, since WARN implies to me that something 
is wrong (which is not)


 Interfaces and abstract classes are not valid types
 ---

 Key: FLINK-1458
 URL: https://issues.apache.org/jira/browse/FLINK-1458
 Project: Flink
  Issue Type: Bug
Reporter: John Sandiford
Assignee: Aljoscha Krettek

 I don't know whether this is by design or is a bug, but I am having trouble 
 working with DataSet and traits in scala which is a major limitation.  A 
 simple example is shown below.  
 Compile time warning is 'Type Main.SimpleTrait has no fields that are visible 
 from Scala Type analysis. Falling back to Java Type Analysis...'
 Run time error is 'Interfaces and abstract classes are not valid types: 
 interface Main$SimpleTrait'
 Regards, John
  val env = ExecutionEnvironment.getExecutionEnvironment
   trait SimpleTrait {
 def contains(x: String): Boolean
   }
   class SimpleClass extends SimpleTrait {
 def contains(x: String) = true
   }
   val data: DataSet[Double] = env.fromElements(1.0, 2.0, 3.0, 4.0)
   def f(data: DataSet[Double]): DataSet[SimpleTrait] = {
 data.mapPartition(iterator = {
   Iterator(new SimpleClass)
 })
   }
   val g = f(data)
   g.print()
   env.execute(Simple example)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-1318] CsvInputFormat: Made quoted strin...

2015-02-03 Thread fhueske
Github user fhueske commented on the pull request:

https://github.com/apache/flink/pull/265#issuecomment-72613154
  
The API was just extended, but the parsing logic for strings changed (see 
PR description). So programs that relied on the previous way of parsing will 
fail now.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1318) Make quoted String parsing optional and configurable for CSVInputFormats

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302954#comment-14302954
 ] 

ASF GitHub Bot commented on FLINK-1318:
---

Github user fhueske commented on the pull request:

https://github.com/apache/flink/pull/265#issuecomment-72613154
  
The API was just extended, but the parsing logic for strings changed (see 
PR description). So programs that relied on the previous way of parsing will 
fail now.


 Make quoted String parsing optional and configurable for CSVInputFormats
 

 Key: FLINK-1318
 URL: https://issues.apache.org/jira/browse/FLINK-1318
 Project: Flink
  Issue Type: Improvement
  Components: Java API, Scala API
Affects Versions: 0.8
Reporter: Fabian Hueske
Assignee: Fabian Hueske
Priority: Minor

 With the current implementation of the CSVInputFormat, quoted string parsing 
 kicks in, if the first non-whitespace character of a field is a double quote. 
 I see two issues with this implementation:
 1. Quoted String parsing cannot be disabled
 2. The quoting character is fixed to double quotes ()
 I propose to add parameters to disable quoted String parsing and set the 
 quote character.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-377] [FLINK-671] Generic Interface / PA...

2015-02-03 Thread zentol
Github user zentol commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72744279
  
about error messages going to command-line: the only way i see for that to 
work is by wrapping the complete error message into an exception, since they do 
show up on the command-line.

wc deadlock: i just can't reproduce it. i tried small files (4 words) and 
went up to 750mb with dop=1. can you send me the test data you used?

@qmlmoon THANK YOU! that would have taken me ages to figure out. working on 
a fix right now.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-377) Create a general purpose framework for language bindings

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304096#comment-14304096
 ] 

ASF GitHub Bot commented on FLINK-377:
--

Github user zentol commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72744279
  
about error messages going to command-line: the only way i see for that to 
work is by wrapping the complete error message into an exception, since they do 
show up on the command-line.

wc deadlock: i just can't reproduce it. i tried small files (4 words) and 
went up to 750mb with dop=1. can you send me the test data you used?

@qmlmoon THANK YOU! that would have taken me ages to figure out. working on 
a fix right now.


 Create a general purpose framework for language bindings
 

 Key: FLINK-377
 URL: https://issues.apache.org/jira/browse/FLINK-377
 Project: Flink
  Issue Type: Improvement
Reporter: GitHub Import
Assignee: Chesnay Schepler
  Labels: github-import
 Fix For: pre-apache


 A general purpose API to run operators with arbitrary binaries. 
 This will allow to run Stratosphere programs written in Python, JavaScript, 
 Ruby, Go or whatever you like. 
 We suggest using Google Protocol Buffers for data serialization. This is the 
 list of languages that currently support ProtoBuf: 
 https://code.google.com/p/protobuf/wiki/ThirdPartyAddOns 
 Very early prototype with python: 
 https://github.com/rmetzger/scratch/tree/learn-protobuf (basically testing 
 protobuf)
 For Ruby: https://github.com/infochimps-labs/wukong
 Two new students working at Stratosphere (@skunert and @filiphaase) are 
 working on this.
 The reference binding language will be for Python, but other bindings are 
 very welcome.
 The best name for this so far is stratosphere-lang-bindings.
 I created this issue to track the progress (and give everybody a chance to 
 comment on this)
  Imported from GitHub 
 Url: https://github.com/stratosphere/stratosphere/issues/377
 Created by: [rmetzger|https://github.com/rmetzger]
 Labels: enhancement, 
 Assignee: [filiphaase|https://github.com/filiphaase]
 Created at: Tue Jan 07 19:47:20 CET 2014
 State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-377] [FLINK-671] Generic Interface / PA...

2015-02-03 Thread qmlmoon
Github user qmlmoon commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72749480
  
Great. Now it works for me


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink pull request: [FLINK-1330] [build] Build creates a link in t...

2015-02-03 Thread StephanEwen
Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/333#issuecomment-72678344
  
I modified the merged version to exclude it from rat. I also tried 
subsequent builds, it worked.

See here: 
https://github.com/apache/flink/commit/0c726f833892a03a12cc92d35e7d249369eb0dc9#diff-600376dffeb79835ede4a0b285078036R1046


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1314) Update website about #flink chat room in freenode IRC

2015-02-03 Thread Robert Metzger (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303569#comment-14303569
 ] 

Robert Metzger commented on FLINK-1314:
---

Sorry, I was too busy today. The change looks good to merge.
Thank you!

 Update website about #flink chat room in freenode IRC
 -

 Key: FLINK-1314
 URL: https://issues.apache.org/jira/browse/FLINK-1314
 Project: Flink
  Issue Type: Task
  Components: Project Website
Reporter: Henry Saputra
Assignee: Henry Saputra
Priority: Minor
 Attachments: FLINK-1314.patch


 Update Flink website to mention the #flink chat room in freenode IRC



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1314) Update website about #flink chat room in freenode IRC

2015-02-03 Thread Henry Saputra (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303567#comment-14303567
 ] 

Henry Saputra commented on FLINK-1314:
--

If no comment, will merge this end of day

 Update website about #flink chat room in freenode IRC
 -

 Key: FLINK-1314
 URL: https://issues.apache.org/jira/browse/FLINK-1314
 Project: Flink
  Issue Type: Task
  Components: Project Website
Reporter: Henry Saputra
Assignee: Henry Saputra
Priority: Minor
 Attachments: FLINK-1314.patch


 Update Flink website to mention the #flink chat room in freenode IRC



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-1458] Allow Interfaces and abstract typ...

2015-02-03 Thread aljoscha
Github user aljoscha commented on the pull request:

https://github.com/apache/flink/pull/357#issuecomment-72642370
  
You're right.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Resolved] (FLINK-1437) Bug in PojoSerializer's copy() method

2015-02-03 Thread Stephan Ewen (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stephan Ewen resolved FLINK-1437.
-
   Resolution: Fixed
Fix Version/s: 0.9

Fixed via fb7ce0e315e270e10c605dcfa269286bc2add47f

 Bug in PojoSerializer's copy() method
 -

 Key: FLINK-1437
 URL: https://issues.apache.org/jira/browse/FLINK-1437
 Project: Flink
  Issue Type: Bug
  Components: Java API
Reporter: Timo Walther
Assignee: Timo Walther
 Fix For: 0.9


 The PojoSerializer's {{copy()}} method does not work properly with {{null}} 
 values. An exception could look like:
 {code}
 Caused by: java.io.IOException: Thread 'SortMerger spilling thread' 
 terminated due to an exception: null
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$ThreadBase.run(UnilateralSortMerger.java:792)
 Caused by: java.io.EOFException
   at 
 org.apache.flink.runtime.io.disk.RandomAccessInputView.nextSegment(RandomAccessInputView.java:83)
   at 
 org.apache.flink.runtime.memorymanager.AbstractPagedInputView.advance(AbstractPagedInputView.java:159)
   at 
 org.apache.flink.runtime.memorymanager.AbstractPagedInputView.readByte(AbstractPagedInputView.java:270)
   at 
 org.apache.flink.runtime.memorymanager.AbstractPagedInputView.readUnsignedByte(AbstractPagedInputView.java:277)
   at org.apache.flink.types.StringValue.copyString(StringValue.java:839)
   at 
 org.apache.flink.api.common.typeutils.base.StringSerializer.copy(StringSerializer.java:83)
   at 
 org.apache.flink.api.java.typeutils.runtime.PojoSerializer.copy(PojoSerializer.java:261)
   at 
 org.apache.flink.runtime.operators.sort.NormalizedKeySorter.writeToOutput(NormalizedKeySorter.java:449)
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$SpillingThread.go(UnilateralSortMerger.java:1303)
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$ThreadBase.run(UnilateralSortMerger.java:788)
 {code}
 I'm working on a fix for that...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-1330] [build] Build creates a link in t...

2015-02-03 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/333


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1458) Interfaces and abstract classes are not valid types

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303187#comment-14303187
 ] 

ASF GitHub Bot commented on FLINK-1458:
---

Github user aljoscha commented on the pull request:

https://github.com/apache/flink/pull/357#issuecomment-72642370
  
You're right.


 Interfaces and abstract classes are not valid types
 ---

 Key: FLINK-1458
 URL: https://issues.apache.org/jira/browse/FLINK-1458
 Project: Flink
  Issue Type: Bug
Reporter: John Sandiford
Assignee: Aljoscha Krettek

 I don't know whether this is by design or is a bug, but I am having trouble 
 working with DataSet and traits in scala which is a major limitation.  A 
 simple example is shown below.  
 Compile time warning is 'Type Main.SimpleTrait has no fields that are visible 
 from Scala Type analysis. Falling back to Java Type Analysis...'
 Run time error is 'Interfaces and abstract classes are not valid types: 
 interface Main$SimpleTrait'
 Regards, John
  val env = ExecutionEnvironment.getExecutionEnvironment
   trait SimpleTrait {
 def contains(x: String): Boolean
   }
   class SimpleClass extends SimpleTrait {
 def contains(x: String) = true
   }
   val data: DataSet[Double] = env.fromElements(1.0, 2.0, 3.0, 4.0)
   def f(data: DataSet[Double]): DataSet[SimpleTrait] = {
 data.mapPartition(iterator = {
   Iterator(new SimpleClass)
 })
   }
   val g = f(data)
   g.print()
   env.execute(Simple example)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (FLINK-1330) Restructure directory layout

2015-02-03 Thread Stephan Ewen (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stephan Ewen resolved FLINK-1330.
-
   Resolution: Fixed
Fix Version/s: 0.9
 Assignee: Stephan Ewen

Fixed via 0c726f833892a03a12cc92d35e7d249369eb0dc9

 Restructure directory layout
 

 Key: FLINK-1330
 URL: https://issues.apache.org/jira/browse/FLINK-1330
 Project: Flink
  Issue Type: Improvement
  Components: Build System, Documentation
Reporter: Max Michels
Assignee: Stephan Ewen
Priority: Minor
  Labels: usability
 Fix For: 0.9


 When building Flink, the build results can currently be found under 
 flink-root/flink-dist/target/flink-$FLINKVERSION-incubating-SNAPSHOT-bin/flink-$YARNVERSION-$FLINKVERSION-incubating-SNAPSHOT/.
 I think we could improve the directory layout with the following:
 - provide the bin folder in the root by default
 - let the start up and submissions scripts in bin assemble the class path
 - in case the project hasn't been build yet, inform the user
 The changes would make it easier to work with Flink from source.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-377) Create a general purpose framework for language bindings

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303266#comment-14303266
 ] 

ASF GitHub Bot commented on FLINK-377:
--

Github user zentol commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72652471
  
hmm...you are now the second person to report that creating the tmp files 
does not work on OS X. i don't know why that doesn't work. the file creation is 
done from java, is there any magic required there? i can't debug OS X error 
myself at the moment. all i can do on that front is add sanity checks for 
better error reporting.

the included triangle enumeration is kinda odd, even if it runs the output 
is empty; Ive already checked the implementation yesterday and it appears equal 
to the java counterpart. will give it another go.

the plan execution is one of the more fragile parts. generally, when the 
process exits with an error it is noticed. but if for example something is 
missing (like the call to execute) things just get stuck. this is due to the 
fact that information is only ever sent to java, but never received, a complete 
one-way street. since accumulators nor actions were supposed to be implemented 
anytime soon this seemed appropriate, but it seems that requires a change 
already. some timeouts could be useful as well.

@fhueske Thanks for trying it out!


 Create a general purpose framework for language bindings
 

 Key: FLINK-377
 URL: https://issues.apache.org/jira/browse/FLINK-377
 Project: Flink
  Issue Type: Improvement
Reporter: GitHub Import
Assignee: Chesnay Schepler
  Labels: github-import
 Fix For: pre-apache


 A general purpose API to run operators with arbitrary binaries. 
 This will allow to run Stratosphere programs written in Python, JavaScript, 
 Ruby, Go or whatever you like. 
 We suggest using Google Protocol Buffers for data serialization. This is the 
 list of languages that currently support ProtoBuf: 
 https://code.google.com/p/protobuf/wiki/ThirdPartyAddOns 
 Very early prototype with python: 
 https://github.com/rmetzger/scratch/tree/learn-protobuf (basically testing 
 protobuf)
 For Ruby: https://github.com/infochimps-labs/wukong
 Two new students working at Stratosphere (@skunert and @filiphaase) are 
 working on this.
 The reference binding language will be for Python, but other bindings are 
 very welcome.
 The best name for this so far is stratosphere-lang-bindings.
 I created this issue to track the progress (and give everybody a chance to 
 comment on this)
  Imported from GitHub 
 Url: https://github.com/stratosphere/stratosphere/issues/377
 Created by: [rmetzger|https://github.com/rmetzger]
 Labels: enhancement, 
 Assignee: [filiphaase|https://github.com/filiphaase]
 Created at: Tue Jan 07 19:47:20 CET 2014
 State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (FLINK-1471) Allow KeySelectors to implement ResultTypeQueryable

2015-02-03 Thread Stephan Ewen (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stephan Ewen resolved FLINK-1471.
-
   Resolution: Fixed
Fix Version/s: 0.9

Fixed via e3f6c9ba69a3e545fdd8f18b7b652fa111ade93e

Thanks for the patch!

 Allow KeySelectors to implement ResultTypeQueryable
 ---

 Key: FLINK-1471
 URL: https://issues.apache.org/jira/browse/FLINK-1471
 Project: Flink
  Issue Type: Bug
  Components: Java API
Affects Versions: 0.9
Reporter: Robert Metzger
Assignee: Timo Walther
 Fix For: 0.9


 See https://github.com/apache/flink/pull/354



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (FLINK-1464) Added ResultTypeQueryable interface to TypeSerializerInputFormat.

2015-02-03 Thread Robert Metzger (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Metzger resolved FLINK-1464.
---
Resolution: Fixed

Fixed via e3f6c9ba69a3e545fdd8f18b7b652fa111ade93e
Thanks for the patch!

The fix has been merged by Stephan Ewen.


 Added ResultTypeQueryable interface to TypeSerializerInputFormat.
 -

 Key: FLINK-1464
 URL: https://issues.apache.org/jira/browse/FLINK-1464
 Project: Flink
  Issue Type: Improvement
  Components: Distributed Runtime, Optimizer
Affects Versions: 0.8, 0.9, 0.8.1
Reporter: Alexander Alexandrov
Assignee: Alexander Alexandrov
Priority: Minor
  Labels: easyfix
 Fix For: 0.9, 0.8.1

   Original Estimate: 6h
  Remaining Estimate: 6h

 It is currently impossible to use the {{TypeSerializerInputFormat}} with 
 generic Tuple types.
 For example, [this example 
 gist|https://gist.github.com/aalexandrov/90bf21f66bf604676f37] fails with a
 {quote}
 Exception in thread main 
 org.apache.flink.api.common.InvalidProgramException: The type returned by the 
 input format could not be automatically determined. Please specify the 
 TypeInformation of the produced type explicitly.
 at 
 org.apache.flink.api.java.ExecutionEnvironment.readFile(ExecutionEnvironment.java:341)
 at SerializedFormatExample$.main(SerializedFormatExample.scala:48)
 at SerializedFormatExample.main(SerializedFormatExample.scala)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:606)
 at com.intellij.rt.execution.application.AppMain.main(AppMain.java:134)
 {quote}
 exaception. 
 To fix the issue, I changed the constructor to take a {{TypeInformationT}} 
 instad of a {{TypeSerializerT}} argument. If this is indeed a bug, I think 
 that this is a good solution. 
 Unfortunately the fix breaks the API. Feel free to change it if you find a 
 more elegant solution compatible with the 0.8 branch.
 The suggested fix can be found in the GitHub 
 [PR#349|https://github.com/apache/flink/pull/349].



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Reopened] (FLINK-1471) Allow KeySelectors to implement ResultTypeQueryable

2015-02-03 Thread Robert Metzger (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Metzger reopened FLINK-1471:
---

The commit Stephan mentions fixes another issue.
This is still unresolved.

 Allow KeySelectors to implement ResultTypeQueryable
 ---

 Key: FLINK-1471
 URL: https://issues.apache.org/jira/browse/FLINK-1471
 Project: Flink
  Issue Type: Bug
  Components: Java API
Affects Versions: 0.9
Reporter: Robert Metzger
Assignee: Timo Walther
 Fix For: 0.9


 See https://github.com/apache/flink/pull/354



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-592] Add support for Kerberos secured Y...

2015-02-03 Thread rmetzger
GitHub user rmetzger opened a pull request:

https://github.com/apache/flink/pull/358

[FLINK-592] Add support for Kerberos secured YARN setups to Flink.

This pull request is basically a port of @warneke's branch 
(https://github.com/warneke/flink/tree/security) to the latest `master` of 
Flink.

The port has been done mostly by @mxm. 
We tested the change on google compute engine (non-secure setup, to ensure 
that everything is working as before) and a local secure YARN setup with 
Kerberos.

Open issues:
- Test token renewal 

Once the open issues have been resolved, I would like to merge this asap 
because a user was asking for this on the mailing list.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/rmetzger/flink flink592

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/flink/pull/358.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #358


commit 3fc8d47f3f7322285539454c7a80a8cec4ba043f
Author: Max m...@posteo.de
Date:   2015-02-02T15:09:18Z

[FLINK-592] Add support for Kerberos secured YARN setups to Flink.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-377) Create a general purpose framework for language bindings

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304192#comment-14304192
 ] 

ASF GitHub Bot commented on FLINK-377:
--

Github user zentol commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72752968
  
@qmlmoon sweet. 

@rmetzger errors should show up on the console now. and in the .out file. 
and i suppose by extension in the .log file aswell.


 Create a general purpose framework for language bindings
 

 Key: FLINK-377
 URL: https://issues.apache.org/jira/browse/FLINK-377
 Project: Flink
  Issue Type: Improvement
Reporter: GitHub Import
Assignee: Chesnay Schepler
  Labels: github-import
 Fix For: pre-apache


 A general purpose API to run operators with arbitrary binaries. 
 This will allow to run Stratosphere programs written in Python, JavaScript, 
 Ruby, Go or whatever you like. 
 We suggest using Google Protocol Buffers for data serialization. This is the 
 list of languages that currently support ProtoBuf: 
 https://code.google.com/p/protobuf/wiki/ThirdPartyAddOns 
 Very early prototype with python: 
 https://github.com/rmetzger/scratch/tree/learn-protobuf (basically testing 
 protobuf)
 For Ruby: https://github.com/infochimps-labs/wukong
 Two new students working at Stratosphere (@skunert and @filiphaase) are 
 working on this.
 The reference binding language will be for Python, but other bindings are 
 very welcome.
 The best name for this so far is stratosphere-lang-bindings.
 I created this issue to track the progress (and give everybody a chance to 
 comment on this)
  Imported from GitHub 
 Url: https://github.com/stratosphere/stratosphere/issues/377
 Created by: [rmetzger|https://github.com/rmetzger]
 Labels: enhancement, 
 Assignee: [filiphaase|https://github.com/filiphaase]
 Created at: Tue Jan 07 19:47:20 CET 2014
 State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-377] [FLINK-671] Generic Interface / PA...

2015-02-03 Thread zentol
Github user zentol commented on the pull request:

https://github.com/apache/flink/pull/202#issuecomment-72752968
  
@qmlmoon sweet. 

@rmetzger errors should show up on the console now. and in the .out file. 
and i suppose by extension in the .log file aswell.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink pull request: Allow KeySelectors to implement ResultTypeQuer...

2015-02-03 Thread aljoscha
Github user aljoscha commented on the pull request:

https://github.com/apache/flink/pull/354#issuecomment-72752435
  
Nope, sorry, also have no Idea why this is happening.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink pull request: [FLINK-1376] [runtime] Add proper shared slot ...

2015-02-03 Thread StephanEwen
Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/317#issuecomment-72618925
  
I think this is a good fix, overall. There is one issue I would really like 
to fix, and that is the serializability of the `Instance` class. This class is 
not meant to be serialized and moved around, which can be reflected by the fact 
that it holds an Actor Ref, and the necessity to make a lot of the fields 
transient.

I assume that the instance needs to be serialized as part of the 
ExechutionGraph archiving, where the ExecutionGraph is sent via an actor 
message to the archiver.

I would like to solve that differently. The execution graph is cleaned 
before archiving (see #344 ) to reduce memory footprint. At this point, I would 
replace the `Instance` in the Executions with the `Instance Connection Info`, 
which holds all info necessary. Then we won't have to send instances through 
actor messages, which would be the cleaner solution.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1471) Allow KeySelectors to implement ResultTypeQueryable

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303007#comment-14303007
 ] 

ASF GitHub Bot commented on FLINK-1471:
---

Github user rmetzger commented on the pull request:

https://github.com/apache/flink/pull/354#issuecomment-72620298
  
I'll close it.
I've filed a jira for the issue and assigned it to @twalthr: 
https://issues.apache.org/jira/browse/FLINK-1471


 Allow KeySelectors to implement ResultTypeQueryable
 ---

 Key: FLINK-1471
 URL: https://issues.apache.org/jira/browse/FLINK-1471
 Project: Flink
  Issue Type: Bug
  Components: Java API
Affects Versions: 0.9
Reporter: Robert Metzger
Assignee: Timo Walther

 See https://github.com/apache/flink/pull/354



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-1464] Add ResultTypeQueryable interface...

2015-02-03 Thread StephanEwen
Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/349#issuecomment-72631504
  
Looks good to me. I think the API breaking is minor, since it does not 
affect the methods on `DataSet` or on any of the operators. The 
TypeSerializerInputFormat is only used explicitly and is rather new, so it 
should be okay.

+1

Will merge this.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1464) Added ResultTypeQueryable interface to TypeSerializerInputFormat.

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303075#comment-14303075
 ] 

ASF GitHub Bot commented on FLINK-1464:
---

Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/349#issuecomment-72631504
  
Looks good to me. I think the API breaking is minor, since it does not 
affect the methods on `DataSet` or on any of the operators. The 
TypeSerializerInputFormat is only used explicitly and is rather new, so it 
should be okay.

+1

Will merge this.


 Added ResultTypeQueryable interface to TypeSerializerInputFormat.
 -

 Key: FLINK-1464
 URL: https://issues.apache.org/jira/browse/FLINK-1464
 Project: Flink
  Issue Type: Improvement
  Components: Distributed Runtime, Optimizer
Affects Versions: 0.8, 0.9, 0.8.1
Reporter: Alexander Alexandrov
Assignee: Alexander Alexandrov
Priority: Minor
  Labels: easyfix
 Fix For: 0.9, 0.8.1

   Original Estimate: 6h
  Remaining Estimate: 6h

 It is currently impossible to use the {{TypeSerializerInputFormat}} with 
 generic Tuple types.
 For example, [this example 
 gist|https://gist.github.com/aalexandrov/90bf21f66bf604676f37] fails with a
 {quote}
 Exception in thread main 
 org.apache.flink.api.common.InvalidProgramException: The type returned by the 
 input format could not be automatically determined. Please specify the 
 TypeInformation of the produced type explicitly.
 at 
 org.apache.flink.api.java.ExecutionEnvironment.readFile(ExecutionEnvironment.java:341)
 at SerializedFormatExample$.main(SerializedFormatExample.scala:48)
 at SerializedFormatExample.main(SerializedFormatExample.scala)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:606)
 at com.intellij.rt.execution.application.AppMain.main(AppMain.java:134)
 {quote}
 exaception. 
 To fix the issue, I changed the constructor to take a {{TypeInformationT}} 
 instad of a {{TypeSerializerT}} argument. If this is indeed a bug, I think 
 that this is a good solution. 
 Unfortunately the fix breaks the API. Feel free to change it if you find a 
 more elegant solution compatible with the 0.8 branch.
 The suggested fix can be found in the GitHub 
 [PR#349|https://github.com/apache/flink/pull/349].



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: Improved exception for missing type of InputFo...

2015-02-03 Thread StephanEwen
Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/351#issuecomment-72629129
  
I tried to implement Henry's idea, but I noticed the error message is very 
specific to the site where it is created, since it refers to an alternate 
method of `DataSet`. It does not make sense to have that specific message in a 
different place. I would leave it as it is for now...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1376) SubSlots are not properly released in case that a TaskManager fatally fails, leaving the system in a corrupted state

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302997#comment-14302997
 ] 

ASF GitHub Bot commented on FLINK-1376:
---

Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/317#issuecomment-72618925
  
I think this is a good fix, overall. There is one issue I would really like 
to fix, and that is the serializability of the `Instance` class. This class is 
not meant to be serialized and moved around, which can be reflected by the fact 
that it holds an Actor Ref, and the necessity to make a lot of the fields 
transient.

I assume that the instance needs to be serialized as part of the 
ExechutionGraph archiving, where the ExecutionGraph is sent via an actor 
message to the archiver.

I would like to solve that differently. The execution graph is cleaned 
before archiving (see #344 ) to reduce memory footprint. At this point, I would 
replace the `Instance` in the Executions with the `Instance Connection Info`, 
which holds all info necessary. Then we won't have to send instances through 
actor messages, which would be the cleaner solution.


 SubSlots are not properly released in case that a TaskManager fatally fails, 
 leaving the system in a corrupted state
 

 Key: FLINK-1376
 URL: https://issues.apache.org/jira/browse/FLINK-1376
 Project: Flink
  Issue Type: Bug
Reporter: Till Rohrmann
Assignee: Till Rohrmann

 In case that the TaskManager fatally fails and some of the failing node's 
 slots are SharedSlots, then the slots are not properly released by the 
 JobManager. This causes that the corresponding job will not be properly 
 failed, leaving the system in a corrupted state.
 The reason for that is that the AllocatedSlot is not aware of being treated 
 as a SharedSlot and thus he cannot release the associated SubSlots.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: Improved exception for missing type of InputFo...

2015-02-03 Thread StephanEwen
Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/351#issuecomment-72623931
  
I'll merge this...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1437) Bug in PojoSerializer's copy() method

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303028#comment-14303028
 ] 

ASF GitHub Bot commented on FLINK-1437:
---

Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/342#issuecomment-72623753
  
Ah, just saw that the tests are there. My bad...


 Bug in PojoSerializer's copy() method
 -

 Key: FLINK-1437
 URL: https://issues.apache.org/jira/browse/FLINK-1437
 Project: Flink
  Issue Type: Bug
  Components: Java API
Reporter: Timo Walther
Assignee: Timo Walther

 The PojoSerializer's {{copy()}} method does not work properly with {{null}} 
 values. An exception could look like:
 {code}
 Caused by: java.io.IOException: Thread 'SortMerger spilling thread' 
 terminated due to an exception: null
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$ThreadBase.run(UnilateralSortMerger.java:792)
 Caused by: java.io.EOFException
   at 
 org.apache.flink.runtime.io.disk.RandomAccessInputView.nextSegment(RandomAccessInputView.java:83)
   at 
 org.apache.flink.runtime.memorymanager.AbstractPagedInputView.advance(AbstractPagedInputView.java:159)
   at 
 org.apache.flink.runtime.memorymanager.AbstractPagedInputView.readByte(AbstractPagedInputView.java:270)
   at 
 org.apache.flink.runtime.memorymanager.AbstractPagedInputView.readUnsignedByte(AbstractPagedInputView.java:277)
   at org.apache.flink.types.StringValue.copyString(StringValue.java:839)
   at 
 org.apache.flink.api.common.typeutils.base.StringSerializer.copy(StringSerializer.java:83)
   at 
 org.apache.flink.api.java.typeutils.runtime.PojoSerializer.copy(PojoSerializer.java:261)
   at 
 org.apache.flink.runtime.operators.sort.NormalizedKeySorter.writeToOutput(NormalizedKeySorter.java:449)
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$SpillingThread.go(UnilateralSortMerger.java:1303)
   at 
 org.apache.flink.runtime.operators.sort.UnilateralSortMerger$ThreadBase.run(UnilateralSortMerger.java:788)
 {code}
 I'm working on a fix for that...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: Allow KeySelectors to implement ResultTypeQuer...

2015-02-03 Thread rmetzger
Github user rmetzger closed the pull request at:

https://github.com/apache/flink/pull/354


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (FLINK-1471) Allow KeySelectors to implement ResultTypeQueryable

2015-02-03 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-1471:
-

 Summary: Allow KeySelectors to implement ResultTypeQueryable
 Key: FLINK-1471
 URL: https://issues.apache.org/jira/browse/FLINK-1471
 Project: Flink
  Issue Type: Bug
  Components: Java API
Affects Versions: 0.9
Reporter: Robert Metzger
Assignee: Timo Walther


See https://github.com/apache/flink/pull/354



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: Allow KeySelectors to implement ResultTypeQuer...

2015-02-03 Thread rmetzger
Github user rmetzger commented on the pull request:

https://github.com/apache/flink/pull/354#issuecomment-72620298
  
I'll close it.
I've filed a jira for the issue and assigned it to @twalthr: 
https://issues.apache.org/jira/browse/FLINK-1471


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1471) Allow KeySelectors to implement ResultTypeQueryable

2015-02-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14304088#comment-14304088
 ] 

ASF GitHub Bot commented on FLINK-1471:
---

GitHub user twalthr opened a pull request:

https://github.com/apache/flink/pull/359

[FLINK-1471][java-api] Fixes wrong input validation if function has no 
generics

FLINK-1471 was not implemented properly. See also #354.

This PR skips the input validation if no generic parameters are available.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/twalthr/flink UnParameterizedFunctions

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/flink/pull/359.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #359


commit 7f70d7ef500cbb572a66f775d35609ac5903f767
Author: twalthr twal...@apache.org
Date:   2015-02-03T21:51:10Z

[FLINK-1471][java-api] Fixes wrong input validation if function has no 
generics




 Allow KeySelectors to implement ResultTypeQueryable
 ---

 Key: FLINK-1471
 URL: https://issues.apache.org/jira/browse/FLINK-1471
 Project: Flink
  Issue Type: Bug
  Components: Java API
Affects Versions: 0.9
Reporter: Robert Metzger
Assignee: Timo Walther
 Fix For: 0.9


 See https://github.com/apache/flink/pull/354



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)