Hi Matthias,
Thanks for the help! In response to your questions:
1. Sorry - this was a typo: the correct schema is: [y: int, features:
vector] - the column "features" was created using Spark's VectorAssembler
and the underlying type is an org.apache.spark.ml.linalg.SparseVector.
well, let's do the following to figure this out:
1) If the schema is indeed [label: Integer, features: SparseVector],
please change the third line to val y = input_data.select("label").
2) For debugging, I would recommend to use a simple script like
"print(sum(X));" and try converting X and