Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/11989#discussion_r58402132
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/tree/RandomForest.scala ---
@@ -55,10 +55,15 @@ import org.apache.spark.util.Utils
* @param numTrees If 1, then no bootstrapping is used. If > 1, then
bootstrapping is done.
* @param featureSubsetStrategy Number of features to consider for splits
at each node.
* Supported values: "auto", "all", "sqrt",
"log2", "onethird".
+ * Supported numerical values: "(0.0-1.0]",
"[1-n]".
* If "auto" is set, this parameter is set
based on numTrees:
* if numTrees == 1, set to "all";
* if numTrees > 1 (forest) set to "sqrt"
for classification and
* to "onethird" for regression.
+ * If a real value "(0.0-1.0]" is set, this
parameter specifies
+ * the fraction of features in each subset.
+ * If an integer value "[1-n]" is set, this
parameter specifies
--- End diff --
Similar change could be made here. See above note.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]