Re: [Spark Streaming][Problem with DataFrame UDFs]

2016-01-21 Thread Jean-Pierre OCALAN
called 10 times, although it gets consistently called >>> 50 >>> times, but the resulting DF is correct and when executing a count() >>> properly >>> return 10, as expected. >>> >>> I have changed my code to work directly with RDDs using mapPartitions

Re: [Spark Streaming][Problem with DataFrame UDFs]

2016-01-21 Thread Jean-Pierre OCALAN
gt;> >> As additional information, I have set spark.speculation to false and no >> tasks failed. >> >> I am working on a smaller example that would isolate this potential issue, >> but in the meantime I

Re: [Spark Streaming][Problem with DataFrame UDFs]

2016-01-21 Thread Cody Koeninger
alse and no > tasks failed. > > I am working on a smaller example that would isolate this potential issue, > but in the meantime I would like to know if somebody encountered this > issue. > > Thank you. > > > > -- > View this message in context: > http://apache-sp

[Spark Streaming][Problem with DataFrame UDFs]

2016-01-20 Thread jpocalan
.1001560.n3.nabble.com/Spark-Streaming-Problem-with-DataFrame-UDFs-tp26024.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands