Re: RandomSplit with Spark-ML and Dataframe

Olivier Girardot Tue, 19 May 2015 12:53:54 -0700

Thank you !

Le mar. 19 mai 2015 à 21:08, Xiangrui Meng <[email protected]> a écrit :


> In 1.4, we added RAND as a DataFrame expression, which can be used for
> random split. Please check the example here:
>
> https://github.com/apache/spark/blob/master/python/pyspark/ml/tuning.py#L214.
> <https://github.com/apache/spark/blob/master/python/pyspark/ml/tuning.py#L214.-Xiangrui>
> -Xiangrui
> <https://github.com/apache/spark/blob/master/python/pyspark/ml/tuning.py#L214.-Xiangrui>
>
> On Thu, May 7, 2015 at 8:39 AM, Olivier Girardot
> <[email protected]> wrote:
> > Hi,
> > is there any best practice to do like in MLLib a randomSplit of
> > training/cross-validation set with dataframes and the pipeline API ?
> >
> > Regards
> >
> > Olivier.
>

Re: RandomSplit with Spark-ML and Dataframe

Reply via email to