[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame

2015-02-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4351#issuecomment-72778536
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26703/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame

2015-02-03 Thread davies
GitHub user davies opened a pull request:

https://github.com/apache/spark/pull/4351

[WIP] [SPARK-5577] Python udf for DataFrame



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/davies/spark python_udf

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/4351.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #4351


commit 3ab26614b5278edce6e8571e5c51fe0b67e3124e
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T08:08:00Z

add more tests for DataFrame

commit 6040ba73431cc22d8d777555db6b35241275bdce
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T09:09:36Z

fix docs

commit 9ab78b4262961deafe0256c8c28d2911a4c07b0a
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T09:10:54Z

Merge branch 'master' of github.com:apache/spark into fix_df

Conflicts:
sql/core/src/main/scala/org/apache/spark/sql/Column.scala

commit 78ebcfa6ba750e081f6b5c7b07c8d04f32c2d4d6
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T09:12:02Z

add sql_test.py in run_tests

commit 35ccb9f5721266a3a25df7e5f6d4b2c98f5f18d5
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T09:23:16Z

fix build

commit 8dd19a912e8595dddeec56fea964ab40b5b9f738
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T18:00:04Z

fix tests in python 2.6

commit c052f6fe0aaaf688a8f08e0fe04abdeea8933448
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T18:44:36Z

Merge branch 'master' of github.com:apache/spark into fix_df

commit 83c92fedc4f69dfff909d61899c906cea357498f
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T20:21:08Z

address comments

commit 467332cacca8754f04271a70bbaf15c8f2afd5c6
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T20:34:16Z

support string in cast()

commit dd9919f115d3b8f4b66d213c4a57bc832ed8ed57
Author: Davies Liu dav...@databricks.com
Date:   2015-02-03T22:17:09Z

fix tests

commit 1e4766485b20629a9cee12fc1c4751fc427cc569
Author: Davies Liu dav...@databricks.com
Date:   2015-02-04T01:24:15Z

Merge branch 'master' of github.com:apache/spark into python_udf




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame

2015-02-03 Thread davies
Github user davies commented on a diff in the pull request:

https://github.com/apache/spark/pull/4351#discussion_r24063741
  
--- Diff: python/pyspark/sql.py ---
@@ -2263,18 +2263,6 @@ def subtract(self, other):
 
 return DataFrame(getattr(self._jdf, except)(other._jdf), 
self.sql_ctx)
 
-def sample(self, withReplacement, fraction, seed=None):
--- End diff --

there are two sample().


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame

2015-02-03 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4351#issuecomment-72786425
  
  [Test build #26705 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26705/consoleFull)
 for   PR 4351 at commit 
[`c6d0d59`](https://github.com/apache/spark/commit/c6d0d592738c7bb459852d60287925d8f0a30a4b).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `class FPGrowthModel[Item: ClassTag](`
  * `class Dsl(object):`
  * `class ExamplePointUDT(UserDefinedType):`
  * `class SQLTests(ReusedPySparkTestCase):`



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame

2015-02-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4351#issuecomment-72786434
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/26705/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame

2015-02-03 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/4351#discussion_r24062243
  
--- Diff: python/pyspark/sql.py ---
@@ -2263,18 +2263,6 @@ def subtract(self, other):
 
 return DataFrame(getattr(self._jdf, except)(other._jdf), 
self.sql_ctx)
 
-def sample(self, withReplacement, fraction, seed=None):
--- End diff --

why are we removing sample?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [WIP] [SPARK-5577] Python udf for DataFrame

2015-02-03 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4351#issuecomment-72777617
  
  [Test build #26705 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26705/consoleFull)
 for   PR 4351 at commit 
[`c6d0d59`](https://github.com/apache/spark/commit/c6d0d592738c7bb459852d60287925d8f0a30a4b).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org