[
https://issues.apache.org/jira/browse/SPARK-9805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14680997#comment-14680997
]
Joseph K. Bradley commented on SPARK-9805:
------------------------------------------
Example failures:
*
[https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40323/console]
*
[https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40202/console]
> Make streaming PySpark ML tests more robust using termination conditions
> ------------------------------------------------------------------------
>
> Key: SPARK-9805
> URL: https://issues.apache.org/jira/browse/SPARK-9805
> Project: Spark
> Issue Type: Test
> Components: MLlib, PySpark, Streaming
> Affects Versions: 1.5.0
> Reporter: Joseph K. Bradley
> Assignee: Joseph K. Bradley
>
> Recently, PySpark ML streaming tests have been flaky, most likely because of
> the batches not being processed in time. Proposal: Replace the use of
> _ssc_wait (which waits for a fixed amount of time) with a method which waits
> for a fixed amount of time but can terminate early based on a termination
> condition method. With this, we can extend the waiting period (to make tests
> less flaky) but also stop early when possible (making tests faster on
> average).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]