Hello, I just finished setting up a standalone Spark cluster and have moved on to exploring MLlib.
I'm trying to perform Linear Regression on a very simple, contrived dataset. I have which contains I then ran the following code through the Spark shell (modified very slightly from http://spark.incubator.apache.org/docs/latest/mllib-guide.html): The problem is that the weights and intercept are extremely off: It gets a little better if I adjust the step size: But still doesn't converge on the correct estimates (I would of course expect intercept=0, slope=1). Any idea what I'm doing wrong? I feel like I must be missing something obvious. Thanks! Herb Susmann SUNY Geneseo [email protected] -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Inaccurate-Estimates-from-LinearRegressionWithSGD-tp942.html Sent from the Apache Spark User List mailing list archive at Nabble.com.
