Hi,
i am pyspark user and i want test Randomforest.

i have dataframe with 100 columns
i should give Rdd or data frame to algorithme i transformed my dataframe to
only tow columns
label ands features  columns

 df.label df.features
  0            (517,(0,1,2,333,56 ...
   1           (517,(0,11,0,33,6 ...
    0           (517,(0,1,0,33,8 ...

but i dont have no ieda to transorme data frame like input to data frame i
test the example in offciel web page without succes

please give me example how i can work and specily with test set  .

thanks

Reply via email to