[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-12-01 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/9889 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-12-01 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-161057518 Thanks, merging to master and 1.6. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-160872230 @marmbrus Please check the latest changes. Feel free to let me know if we need more changes. Thank you! --- If your project is set up for it, you can reply to this e

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-160846047 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/4

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-160846044 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-160845838 **[Test build #46925 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/46925/consoleFull)** for PR 9889 at commit [`b8d287a`](https://git

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-160823405 **[Test build #46925 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/46925/consoleFull)** for PR 9889 at commit [`b8d287a`](https://gith

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r46231420 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/CacheSuite.scala --- @@ -0,0 +1,89 @@ +/* + * Licensed to the Apache Software Foundation (ASF

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-160803517 Thank you! @marmbrus Will do the changes soon. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as we

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r46222399 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/CacheSuite.scala --- @@ -0,0 +1,89 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r46222418 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/CacheSuite.scala --- @@ -0,0 +1,89 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r46222335 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/SQLContext.scala --- @@ -339,6 +339,15 @@ class SQLContext private[sql]( } /**

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r46222305 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/CacheSuite.scala --- @@ -0,0 +1,89 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-30 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r4664 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/CacheSuite.scala --- @@ -0,0 +1,89 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-26 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r46000945 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -579,11 +580,50 @@ class Dataset[T] private[sql]( */ def takeAsList

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-26 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159969380 @marmbrus Not sure if the latest code changes resolve all your concerns. Please let me know if you have any suggestion. Thank you! Have a good Thanksgiving D

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159665597 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159665598 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/4

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-25 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159665401 **[Test build #46689 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/46689/consoleFull)** for PR 9889 at commit [`92ede39`](https://git

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-25 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159623533 **[Test build #46689 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/46689/consoleFull)** for PR 9889 at commit [`92ede39`](https://gith

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-25 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159620126 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-25 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159618199 please retest it --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this fe

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159614171 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/4

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159614168 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159527582 @rxin Sure, will do the Python testing at first. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as wel

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159527365 @gatorsmile just fyi if you have time, the python tests stuff is probably much more important than the more complicated case of caching. --- If your project is set up for

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159526861 Now, I understood your concern. Thank you for the example! I added your example into the newly created testcase suite `CacheSuite`. I saw the failure and thus used `i

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45836772 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -579,11 +580,50 @@ class Dataset[T] private[sql]( */ def takeAsList(num

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45836459 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -579,11 +580,50 @@ class Dataset[T] private[sql]( */ def takeAsLi

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159473750 I wouldn't block merging an initial version of this feature on this, but it would also be nice if we could support the following (this might be hard though): `

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45824732 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -17,6 +17,8 @@ package org.apache.spark.sql +import org.apach

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45824715 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -579,11 +580,50 @@ class Dataset[T] private[sql]( */ def takeAsList

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45824631 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DatasetSuite.scala --- @@ -213,6 +215,56 @@ class DatasetSuite extends QueryTest with SharedSQLContex

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45824550 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/Queryable.scala --- @@ -27,6 +28,7 @@ private[sql] trait Queryable { def schema: Stru

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159471621 It would be great to also have thats that ensure that things like `.as[Class]` do not break caching. --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45824400 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala --- @@ -75,12 +75,12 @@ private[sql] class CacheManager extends Logging {

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45824320 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DatasetSuite.scala --- @@ -213,6 +215,56 @@ class DatasetSuite extends QueryTest with SharedSQLContex

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45824255 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DatasetSuite.scala --- @@ -213,6 +215,56 @@ class DatasetSuite extends QueryTest with SharedSQLContex

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45824179 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DatasetSuite.scala --- @@ -213,6 +215,56 @@ class DatasetSuite extends QueryTest with SharedSQLContex

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/9889#discussion_r45824083 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -579,11 +580,50 @@ class Dataset[T] private[sql]( */ def takeAsList

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-24 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-159444815 cc @marmbrus I will let you merge this one. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If yo

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158860879 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158860880 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/4

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158860840 **[Test build #46510 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/46510/consoleFull)** for PR 9889 at commit [`251`](https://git

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-22 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158851730 @marmbrus Do these newly added test cases resolve your concerns? --- If your project is set up for it, you can reply to this email and have your reply appear on GitH

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158851741 **[Test build #46510 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/46510/consoleFull)** for PR 9889 at commit [`251`](https://gith

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-22 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158789433 I see, will make a try. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-22 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158787958 I'm worried the existing caching mechanisms might not work on dataset operations. Do we have a good notion of equality for encoders and lambda functions? Can you add

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158705760 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158705761 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/4

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-21 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158705743 **[Test build #46485 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/46485/consoleFull)** for PR 9889 at commit [`c135e1f`](https://git

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158698890 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158698891 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/4

[GitHub] spark pull request: [SPARK-11905] [SQL] Support Persist/Cache and ...

2015-11-21 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9889#issuecomment-158698821 **[Test build #46485 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/46485/consoleFull)** for PR 9889 at commit [`c135e1f`](https://gith