ericl commented on a change in pull request #24302: [SPARK-27392][SQL] TestHive
test tables should be placed in shared test state, not per session
URL: https://github.com/apache/spark/pull/24302#discussion_r276486793
##########
File path:
mllib/src/test/java/org/apache/spark/mllib/regression/JavaRidgeRegressionSuite.java
##########
@@ -55,7 +56,8 @@ public void runRidgeRegressionUsingConstructor() {
int numFeatures = 20;
List<LabeledPoint> data = generateRidgeData(2 * numExamples, numFeatures,
10.0);
- JavaRDD<LabeledPoint> testRDD = jsc.parallelize(data.subList(0,
numExamples));
+ JavaRDD<LabeledPoint> testRDD = jsc.parallelize(
+ new ArrayList<LabeledPoint>(data.subList(0, numExamples)));
Review comment:
The issue here is that list views are not serializable. This prevents the
future addition of automated checks that may require arguments to parallelize()
be serializable. This fixes the issue, and does not affect the behaviour of the
test.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
With regards,
Apache Git Services
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]