[
https://issues.apache.org/jira/browse/SPARK-20199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
pralabhkumar updated SPARK-20199:
---------------------------------
Description:
Spark GradientBoostedTreesModel doesn't have featureSubsetStrategy . It Uses
random forest internally ,which have featureSubsetStrategy hardcoded "all". It
should be provided by the user to have randomness at the feature level.
This parameter is available in H2O and XGBoost.
Sample from H2O.ai
gbmParams._col_sample_rate
Please provide the parameter .
was:
Spark GradientBoostedTreesModel doesn't have Column sampling rate parameter .
This parameter is available in H2O and XGBoost.
Sample from H2O.ai
gbmParams._col_sample_rate
Please provide the parameter .
> GradientBoostedTreesModel doesn't have featureSubsetStrategy parameter
> -----------------------------------------------------------------------
>
> Key: SPARK-20199
> URL: https://issues.apache.org/jira/browse/SPARK-20199
> Project: Spark
> Issue Type: Improvement
> Components: ML, MLlib
> Affects Versions: 2.1.0
> Reporter: pralabhkumar
>
> Spark GradientBoostedTreesModel doesn't have featureSubsetStrategy . It Uses
> random forest internally ,which have featureSubsetStrategy hardcoded "all".
> It should be provided by the user to have randomness at the feature level.
> This parameter is available in H2O and XGBoost.
> Sample from H2O.ai
> gbmParams._col_sample_rate
> Please provide the parameter .
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]