Hi Mich.

So it sounds like what you're really after is a way to apply new stream options 
in runtime without downtime?

BR, Martin
________________________________
From: Mich Talebzadeh <mich.talebza...@gmail.com>
Sent: Tuesday, March 14, 2023 16:39
To: Martin Andersson <martin.anders...@kambi.com>
Cc: Spark dev list <dev@spark.apache.org>
Subject: Re: Adding pause() method to pyspark.sql.streaming.StreamingQuery


EXTERNAL SENDER. Do not click links or open attachments unless you recognize 
the sender and know the content is safe. DO NOT provide your username or 
password.


Hi Martin,

I see the major benefit of the spark stop() method in giving the ability to 
shut down the main topic gracefully. I have explained this in this SPIP
SPIP: Shutting down spark structured streaming when the streaming process 
completed current 
process<https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fbrowse%2FSPARK-42485&data=05%7C01%7Cmartin.andersson%40kambi.com%7C2dca72df238543466f2608db24a251fd%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638144051802324559%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=nZi1EIEwxao28460TFv0Brfd6qx52w8vi21dG2OlSo0%3D&reserved=0>

With regard to pause() I saw a request from a member


Spark Structured Streaming] Could we apply new options of 
readStream/writeStream without stopping spark application (zero 
downtime)?<https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.apache.org%2Fthread%2Fynboft2lw1zot70b8rolonpb6mrprnro&data=05%7C01%7Cmartin.andersson%40kambi.com%7C2dca72df238543466f2608db24a251fd%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638144051802324559%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=N64dNtmy%2FC6yAXn8Ju4UjrZq129DksqG9u1nQynfCaY%3D&reserved=0>


I think it would be good to have this paus() added so we can adjust spark 
streaming parameters without shutting down the streaming process., effectively 
with zero streaming downtime. This "change" is a challenge because the 
parameters can only change at the start-up until now.


HTH


 
[https://ci3.googleusercontent.com/mail-sig/AIorK4zholKucR2Q9yMrKbHNn-o1TuS4mYXyi2KO6Xmx6ikHPySa9MLaLZ8t2hrA6AUcxSxDgHIwmKE]
   view my Linkedin 
profile<https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.linkedin.com%2Fin%2Fmich-talebzadeh-ph-d-5205b2%2F&data=05%7C01%7Cmartin.andersson%40kambi.com%7C2dca72df238543466f2608db24a251fd%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638144051802324559%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=Y13vHqLxjyBt1ONR80snOKIG28P%2BQz6GGIjjnYhyUjQ%3D&reserved=0>


 
https://en.everybodywiki.com/Mich_Talebzadeh<https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.everybodywiki.com%2FMich_Talebzadeh&data=05%7C01%7Cmartin.andersson%40kambi.com%7C2dca72df238543466f2608db24a251fd%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638144051802324559%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=tVVB50KpDC1TLE0mGUJsxsIdR4Q8Z8xMqyVHiYL%2F90g%3D&reserved=0>



Disclaimer: Use it at your own risk. Any and all responsibility for any loss, 
damage or destruction of data or any other property which may arise from 
relying on this email's technical content is explicitly disclaimed. The author 
will in no case be liable for any monetary damages arising from such loss, 
damage or destruction.




On Tue, 14 Mar 2023 at 12:33, Martin Andersson 
<martin.anders...@kambi.com<mailto:martin.anders...@kambi.com>> wrote:
Hi Mich.

I'm trying to understand, can you please provide some use-cases where it would 
be beneficial with a pause and how a pause would differ functionally from a 
stop?

Best regards, Martin
________________________________
From: Mich Talebzadeh 
<mich.talebza...@gmail.com<mailto:mich.talebza...@gmail.com>>
Sent: Thursday, March 9, 2023 17:12
To: Spark dev list <dev@spark.apache.org<mailto:dev@spark.apache.org>>
Subject: Adding pause() method to pyspark.sql.streaming.StreamingQuery


EXTERNAL SENDER. Do not click links or open attachments unless you recognize 
the sender and know the content is safe. DO NOT provide your username or 
password.



Hi,


Currently for Spark Streaming we have the following class:


pyspark.sql.streaming.StreamingQuery<https://gbr01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fpyspark.sql.streaming.streamingquery%2F&data=05%7C01%7Cmartin.andersson%40kambi.com%7C2dca72df238543466f2608db24a251fd%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638144051802324559%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=7fenpPrRk1OHAN3vdiXBKjHymExOxsW4rhl7EgBph8I%3D&reserved=0>


There are a number of useful methods, for example stop() which stops the 
streaming process gracefully.


Can we add another method pause() so w can pause the processing. This will come 
handy in a number of occasions?



Thanks



 
[https://ci3.googleusercontent.com/mail-sig/AIorK4zholKucR2Q9yMrKbHNn-o1TuS4mYXyi2KO6Xmx6ikHPySa9MLaLZ8t2hrA6AUcxSxDgHIwmKE]
   view my Linkedin 
profile<https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.linkedin.com%2Fin%2Fmich-talebzadeh-ph-d-5205b2%2F&data=05%7C01%7Cmartin.andersson%40kambi.com%7C2dca72df238543466f2608db24a251fd%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638144051802324559%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=Y13vHqLxjyBt1ONR80snOKIG28P%2BQz6GGIjjnYhyUjQ%3D&reserved=0>


 
https://en.everybodywiki.com/Mich_Talebzadeh<https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.everybodywiki.com%2FMich_Talebzadeh&data=05%7C01%7Cmartin.andersson%40kambi.com%7C2dca72df238543466f2608db24a251fd%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638144051802324559%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=tVVB50KpDC1TLE0mGUJsxsIdR4Q8Z8xMqyVHiYL%2F90g%3D&reserved=0>



Disclaimer: Use it at your own risk. Any and all responsibility for any loss, 
damage or destruction of data or any other property which may arise from 
relying on this email's technical content is explicitly disclaimed. The author 
will in no case be liable for any monetary damages arising from such loss, 
damage or destruction.


Reply via email to