Hello,

I just finished setting up a standalone Spark cluster and have moved on to
exploring MLlib.

I'm trying to perform Linear Regression on a very simple, contrived dataset.
I have  which contains


I then ran the following code through the Spark shell (modified very
slightly from
http://spark.incubator.apache.org/docs/latest/mllib-guide.html):



The problem is that the weights and intercept are extremely off:


It gets a little better if I adjust the step size:


But still doesn't converge on the correct estimates (I would of course
expect intercept=0, slope=1). Any idea what I'm doing wrong? I feel like I
must be missing something obvious.

Thanks!
Herb Susmann
SUNY Geneseo
[email protected]



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Inaccurate-Estimates-from-LinearRegressionWithSGD-tp942.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

Reply via email to