[ 
https://issues.apache.org/jira/browse/SPARK-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14647220#comment-14647220
 ] 

Jeremy Freeman commented on SPARK-9461:
---------------------------------------

I'd definitely be curious to see if these two tests pass again with a relaxed 
tolerance (say double the tolerance, which would have fixed the one in your 
initial comment), while we get to the core of the issue, maybe try that on your 
PR?

The only other algo to consider making the change in is StreamingKMeans, but 
the tests there are closer to toy examples, and I think will be less 
susceptible to small differences in convergence (assuming that's what's going 
on here), have you you noticed any of those failing as well?

> Possibly slightly flaky PySpark StreamingLinearRegressionWithTests
> ------------------------------------------------------------------
>
>                 Key: SPARK-9461
>                 URL: https://issues.apache.org/jira/browse/SPARK-9461
>             Project: Spark
>          Issue Type: Bug
>          Components: MLlib, PySpark
>    Affects Versions: 1.5.0
>            Reporter: Joseph K. Bradley
>            Assignee: Jeremy Freeman
>
> [~freeman-lab]
> Check out this failure: 
> [https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/38913/consoleFull]
> It should be deterministic, but do you think it's just slight variations 
> caused by the Python version?  Or do you think it's something odd going on 
> with streaming?  This is the only time I've seen this happen, but I'll post 
> again if I see it more.
> Test failure message:
> {code}
> ======================================================================
> FAIL: test_parameter_accuracy (__main__.StreamingLinearRegressionWithTests)
> Test that coefs are predicted accurately by fitting on toy data.
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File 
> "/home/jenkins/workspace/SparkPullRequestBuilder/python/pyspark/mllib/tests.py",
>  line 1282, in test_parameter_accuracy
>     slr.latestModel().weights.array, [10., 10.], 1)
>   File 
> "/home/jenkins/workspace/SparkPullRequestBuilder/python/pyspark/mllib/tests.py",
>  line 1257, in assertArrayAlmostEqual
>     self.assertAlmostEqual(i, j, dec)
> AssertionError: 9.4243238731093655 != 9.3216175551722014 within 1 places
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to