[GitHub] spark pull request #20627: [SPARK-23217][ML][PYTHON] Add distanceMeasure par...

BryanCutler Mon, 19 Feb 2018 11:37:09 -0800

Github user BryanCutler commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20627#discussion_r169158234
  
    --- Diff: python/pyspark/ml/tests.py ---
    @@ -541,6 +541,16 @@ def test_java_params(self):
             self.assertEqual(evaluator._java_obj.getMetricName(), "r2")
             self.assertEqual(evaluatorCopy._java_obj.getMetricName(), "mae")
     
    +    def test_clustering_evaluator_with_cosine_distance(self):
    +        featureAndPredictions = map(lambda x: (Vectors.dense(x[0]), x[1]),
    +                                    [([1.0, 1.0], 1.0), ([10.0, 10.0], 
1.0), ([1.0, 0.5], 2.0),
    +                                     ([10.0, 4.4], 2.0), ([-1.0, 1.0], 
3.0), ([-100.0, 90.0], 3.0)])
    +        dataset = self.spark.createDataFrame(featureAndPredictions, 
["features", "prediction"])
    +        evaluator = ClusteringEvaluator(predictionCol="prediction", 
distanceMeasure="cosine")
    +        self.assertEqual(evaluator.getDistanceMeasure(), "cosine")
    +        self.assertEqual(round(evaluator.evaluate(dataset), 5),  0.99267)
    +        self.assertEqual(evaluator._java_obj.getDistanceMeasure(), 
"cosine")
    --- End diff --
    
    I don't think it is great to check the parameter in the java obj too.  If 
this was done for every param, it would be a ton of check and make it 
impossible to possibly change this plumbing in the future.
    
    There is an existing test that runs some basic param checks that should be 
sufficient, but it doesn't look like the evaluation module is part of it, can 
you add it?  It's in `DefaultValuesTests.test_java_params`



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #20627: [SPARK-23217][ML][PYTHON] Add distanceMeasure par...

Reply via email to