Github user srowen commented on the pull request:
https://github.com/apache/spark/pull/8314#issuecomment-151188571
For `JavaDataFrameSuite.testSampleBy` I think you can accept any value
between 1 and 6 for key 0, and 4 and 9 for key 1. These are not-improbable
values given the test -- basically, how many of 33 elements do you choose if
choosing with probability 0.1 and 0.2 respectively.
The `Word2Vec` test does look far too tight, I think. The others, I'm not
as sure. I think the `StreamingKMeansSuite` just needs more points. Let me see
if I can provide a concrete suggestion on these.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]