[ https://issues.apache.org/jira/browse/SPARK-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14647220#comment-14647220 ]
Jeremy Freeman commented on SPARK-9461: --------------------------------------- I'd definitely be curious to see if these two tests pass again with a relaxed tolerance (say double the tolerance, which would have fixed the one in your initial comment), while we get to the core of the issue, maybe try that on your PR? The only other algo to consider making the change in is StreamingKMeans, but the tests there are closer to toy examples, and I think will be less susceptible to small differences in convergence (assuming that's what's going on here), have you you noticed any of those failing as well? > Possibly slightly flaky PySpark StreamingLinearRegressionWithTests > ------------------------------------------------------------------ > > Key: SPARK-9461 > URL: https://issues.apache.org/jira/browse/SPARK-9461 > Project: Spark > Issue Type: Bug > Components: MLlib, PySpark > Affects Versions: 1.5.0 > Reporter: Joseph K. Bradley > Assignee: Jeremy Freeman > > [~freeman-lab] > Check out this failure: > [https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/38913/consoleFull] > It should be deterministic, but do you think it's just slight variations > caused by the Python version? Or do you think it's something odd going on > with streaming? This is the only time I've seen this happen, but I'll post > again if I see it more. > Test failure message: > {code} > ====================================================================== > FAIL: test_parameter_accuracy (__main__.StreamingLinearRegressionWithTests) > Test that coefs are predicted accurately by fitting on toy data. > ---------------------------------------------------------------------- > Traceback (most recent call last): > File > "/home/jenkins/workspace/SparkPullRequestBuilder/python/pyspark/mllib/tests.py", > line 1282, in test_parameter_accuracy > slr.latestModel().weights.array, [10., 10.], 1) > File > "/home/jenkins/workspace/SparkPullRequestBuilder/python/pyspark/mllib/tests.py", > line 1257, in assertArrayAlmostEqual > self.assertAlmostEqual(i, j, dec) > AssertionError: 9.4243238731093655 != 9.3216175551722014 within 1 places > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org