Github user tengpeng commented on a diff in the pull request:
https://github.com/apache/spark/pull/19638#discussion_r150394001
--- Diff:
mllib/src/test/scala/org/apache/spark/ml/regression/LinearRegressionSuite.scala
---
@@ -764,13 +764,17 @@ class LinearRegressionSuite
(Intercept) 6.3022157 0.0018600 3388 <2e-16 ***
V2 4.6982442 0.0011805 3980 <2e-16 ***
V3 7.1994344 0.0009044 7961 <2e-16 ***
+
+ # R code for r2adj
--- End diff --
The confusion lies in my incorrect assumption that R Studio's naming
convention when importing data is the same as R shell. R Studio uses `X_` for
all variables including the dependent variable.
I have followed @sethah 's suggestion, replacing with `lm_fit <- lm(V1 ~ V2
+ V3, data = d1)`. We should be fine here.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]