[GitHub] [spark] LuciferYang commented on pull request #29857: [SPARK-32972][ML] Fix UTs of `mllib` module in Scala 2.13 except RandomForestRegressorSuite

GitBox Wed, 23 Sep 2020 21:56:29 -0700


LuciferYang commented on pull request #29857:
URL: https://github.com/apache/spark/pull/29857#issuecomment-698111573



   cc @srowen The remaining failed case is 
   
   ```
   RandomForestRegressorSuite:
   - training with sample weights *** FAILED ***
     0.756 was not greater than or equal to 0.78 (MLTestingUtils.scala:285)
   ```
   
   
https://github.com/apache/spark/blob/0bc0e91e4015eb98bd2f4bf17da2ec7135b520a9/mllib/src/test/scala/org/apache/spark/ml/regression/RandomForestRegressorSuite.scala#L171-L200
   
   Input `(50, 10, 0.95, 0.78)` with 
   
   ```
   MLTestingUtils.testOversamplingVsWeighting[RandomForestRegressionModel,
           RandomForestRegressor](df.as[LabeledPoint], estimator,
           MLTestingUtils.modelPredictionEquals(df, _ ~= _ relTol 0.2, tol), 
seed)
   ```
   failed.
   
   I found that the following `RandomForest.runBagged` behave differently for 
the same input in Scala 2.12 and Scala 2.13, maybe related to the follow code 
block:
   
   
https://github.com/apache/spark/blob/0bc0e91e4015eb98bd2f4bf17da2ec7135b520a9/mllib/src/main/scala/org/apache/spark/ml/tree/impl/RandomForest.scala#L191-L215
   
   but I am not familiar with this algorithm and I not find root cause, I think 
we need an expert to guide how to fix it
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] LuciferYang commented on pull request #29857: [SPARK-32972][ML] Fix UTs of `mllib` module in Scala 2.13 except RandomForestRegressorSuite

Reply via email to