[jira] [Updated] (SPARK-47734) Fix flaky pyspark.sql.dataframe.DataFrame.writeStream doctest by stopping streaming query

2024-04-07 Thread Jungtaek Lim (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-47734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jungtaek Lim updated SPARK-47734:
-
Fix Version/s: 3.4.3

> Fix flaky pyspark.sql.dataframe.DataFrame.writeStream doctest by stopping 
> streaming query
> -
>
> Key: SPARK-47734
> URL: https://issues.apache.org/jira/browse/SPARK-47734
> Project: Spark
>  Issue Type: Improvement
>  Components: PySpark, Tests
>Affects Versions: 4.0.0
>Reporter: Josh Rosen
>Assignee: Josh Rosen
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0, 3.5.2, 3.4.3
>
>
> https://issues.apache.org/jira/browse/SPARK-47199 didn't fix the flakiness in 
> the pyspark.sql.dataframe.DataFrame.writeStream doctest : the problem is not 
> that we are colliding on the test but, rather, that the test is starting a 
> background thread to write to a directory then deleting that directory from 
> the main test thread, something which is inherently race prone.
> The fix is simple: stop the streaming query in the doctest itself, similar to 
> other streaming doctest examples.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-47734) Fix flaky pyspark.sql.dataframe.DataFrame.writeStream doctest by stopping streaming query

2024-04-04 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-47734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated SPARK-47734:
---
Labels: pull-request-available  (was: )

> Fix flaky pyspark.sql.dataframe.DataFrame.writeStream doctest by stopping 
> streaming query
> -
>
> Key: SPARK-47734
> URL: https://issues.apache.org/jira/browse/SPARK-47734
> Project: Spark
>  Issue Type: Improvement
>  Components: PySpark, Tests
>Affects Versions: 4.0.0
>Reporter: Josh Rosen
>Assignee: Josh Rosen
>Priority: Major
>  Labels: pull-request-available
>
> https://issues.apache.org/jira/browse/SPARK-47199 didn't fix the flakiness in 
> the pyspark.sql.dataframe.DataFrame.writeStream doctest : the problem is not 
> that we are colliding on the test but, rather, that the test is starting a 
> background thread to write to a directory then deleting that directory from 
> the main test thread, something which is inherently race prone.
> The fix is simple: stop the streaming query in the doctest itself, similar to 
> other streaming doctest examples.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org