heap overflow within seconds : pyspark kinesis stream with Spark 2.1.0

2017-04-22 Thread s t
Hi, I hope i am missing very simple point to stuck this kind of error. http://stackoverflow.com/questions/43560807/pyspark-streaming-from-kinesis-kills-heap Regards, Serkan

Off heap memory settings and Tungsten

2017-04-22 Thread geoHeil
Hi, I wonder when to enable spark's off heap settings. Shouldn't tungsten enable these automatically in 2.1? http://stackoverflow.com/questions/43330902/spark-off-heap-memory-config-and-tungsten Regards, Georg -- View this message in context:

Re: how to add new column using regular expression within pyspark dataframe

2017-04-22 Thread Zeming Yu
Thanks a lot! Just another question, how can I extract the minutes as a number? I can use: .withColumn('duration_m',split(flight.duration,'h').getItem(1) to get strings like '10m' but how do I drop the charater "m" at the end? I can use substr(), but what's the function to get the length of

Re: Spark SQL - Global Temporary View is not behaving as expected

2017-04-22 Thread vincent gromakowski
Look at alluxio for sharing across drivers or spark jobserver Le 22 avr. 2017 10:24 AM, "Hemanth Gudela" a écrit : > Thanks for your reply. > > > > Creating a table is an option, but such approach slows down reads & writes > for a real-time analytics streaming use

Re: Spark SQL - Global Temporary View is not behaving as expected

2017-04-22 Thread Hemanth Gudela
Thanks for your reply. Creating a table is an option, but such approach slows down reads & writes for a real-time analytics streaming use case that I’m currently working on. If at all global temporary view could have been accessible across sessions/spark contexts, that would have simplified my

Re: Spark SQL - Global Temporary View is not behaving as expected

2017-04-22 Thread Felix Cheung
Cross session is this context is multiple spark sessions from the same spark context. Since you are running two shells, you are having different spark context. Do you have to you a temp view? Could you create a table? _ From: Hemanth Gudela

Spark SQL - Global Temporary View is not behaving as expected

2017-04-22 Thread Hemanth Gudela
Hi, According to documentation, global temporary views are cross-session accessible. But when I try to query a global temporary view from another spark shell like this --> Instance 1 of spark-shell

Re: Spark structured streaming: Is it possible to periodically refresh static data frame?

2017-04-22 Thread Hemanth Gudela
Thank you Georg, Gene for your ideas. For now, I am using ”Futures” to asynchronously run a background thread that periodically creates a new dataframe fetching latest data from underlying table, and re-registers temp view with the same name as used by main thread’s static dataframe. This