[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4351#issuecomment-72778536 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26703/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame
GitHub user davies opened a pull request: https://github.com/apache/spark/pull/4351 [WIP] [SPARK-5577] Python udf for DataFrame You can merge this pull request into a Git repository by running: $ git pull https://github.com/davies/spark python_udf Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/4351.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #4351 commit 3ab26614b5278edce6e8571e5c51fe0b67e3124e Author: Davies Liu dav...@databricks.com Date: 2015-02-03T08:08:00Z add more tests for DataFrame commit 6040ba73431cc22d8d777555db6b35241275bdce Author: Davies Liu dav...@databricks.com Date: 2015-02-03T09:09:36Z fix docs commit 9ab78b4262961deafe0256c8c28d2911a4c07b0a Author: Davies Liu dav...@databricks.com Date: 2015-02-03T09:10:54Z Merge branch 'master' of github.com:apache/spark into fix_df Conflicts: sql/core/src/main/scala/org/apache/spark/sql/Column.scala commit 78ebcfa6ba750e081f6b5c7b07c8d04f32c2d4d6 Author: Davies Liu dav...@databricks.com Date: 2015-02-03T09:12:02Z add sql_test.py in run_tests commit 35ccb9f5721266a3a25df7e5f6d4b2c98f5f18d5 Author: Davies Liu dav...@databricks.com Date: 2015-02-03T09:23:16Z fix build commit 8dd19a912e8595dddeec56fea964ab40b5b9f738 Author: Davies Liu dav...@databricks.com Date: 2015-02-03T18:00:04Z fix tests in python 2.6 commit c052f6fe0aaaf688a8f08e0fe04abdeea8933448 Author: Davies Liu dav...@databricks.com Date: 2015-02-03T18:44:36Z Merge branch 'master' of github.com:apache/spark into fix_df commit 83c92fedc4f69dfff909d61899c906cea357498f Author: Davies Liu dav...@databricks.com Date: 2015-02-03T20:21:08Z address comments commit 467332cacca8754f04271a70bbaf15c8f2afd5c6 Author: Davies Liu dav...@databricks.com Date: 2015-02-03T20:34:16Z support string in cast() commit dd9919f115d3b8f4b66d213c4a57bc832ed8ed57 Author: Davies Liu dav...@databricks.com Date: 2015-02-03T22:17:09Z fix tests commit 1e4766485b20629a9cee12fc1c4751fc427cc569 Author: Davies Liu dav...@databricks.com Date: 2015-02-04T01:24:15Z Merge branch 'master' of github.com:apache/spark into python_udf --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/4351#discussion_r24063741 --- Diff: python/pyspark/sql.py --- @@ -2263,18 +2263,6 @@ def subtract(self, other): return DataFrame(getattr(self._jdf, except)(other._jdf), self.sql_ctx) -def sample(self, withReplacement, fraction, seed=None): --- End diff -- there are two sample(). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4351#issuecomment-72786425 [Test build #26705 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26705/consoleFull) for PR 4351 at commit [`c6d0d59`](https://github.com/apache/spark/commit/c6d0d592738c7bb459852d60287925d8f0a30a4b). * This patch **passes all tests**. * This patch merges cleanly. * This patch adds the following public classes _(experimental)_: * `class FPGrowthModel[Item: ClassTag](` * `class Dsl(object):` * `class ExamplePointUDT(UserDefinedType):` * `class SQLTests(ReusedPySparkTestCase):` --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4351#issuecomment-72786434 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26705/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/4351#discussion_r24062243 --- Diff: python/pyspark/sql.py --- @@ -2263,18 +2263,6 @@ def subtract(self, other): return DataFrame(getattr(self._jdf, except)(other._jdf), self.sql_ctx) -def sample(self, withReplacement, fraction, seed=None): --- End diff -- why are we removing sample? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4351#issuecomment-72777617 [Test build #26705 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26705/consoleFull) for PR 4351 at commit [`c6d0d59`](https://github.com/apache/spark/commit/c6d0d592738c7bb459852d60287925d8f0a30a4b). * This patch merges cleanly. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org