Hello group, Happy new year! Does anyone have a working example or template
using the DataFrame API vs. the RDD based APIs. We are wanting to migrate
to using the new DataFrame APIs to take advantage of the *Feature
Importance* function for our Regression Random Forest Models.
We are wanting to move from
import org.apache.spark.mllib.tree.RandomForestimport
org.apache.spark.mllib.tree.model.RandomForestModelimport
org.apache.spark.mllib.util.MLUtils
to
import org.apache.spark.ml.regression.{RandomForestRegressionModel,
RandomForestRegressor}
Is this something that should be fairly straightforward by adjusting
parameters and calling new classes within DASE or is it much more involved
development.
Thank You!
*Shane Johnson | 801.360.3350*
LinkedIn <https://www.linkedin.com/in/shanewjohnson> | Facebook
<https://www.facebook.com/shane.johnson.71653>