[jira] [Resolved] (SPARK-24408) Move abs function to math_funcs group

2018-06-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-24408. - Resolution: Fixed Fix Version/s: 2.4.0 Thanks for helping improve our docs :) > Move abs

[jira] [Assigned] (SPARK-24408) Move abs function to math_funcs group

2018-06-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-24408: --- Assignee: Jacek Laskowski > Move abs function to math_funcs group >

[jira] [Updated] (SPARK-24408) Move abs function to math_funcs group

2018-06-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-24408: Description: A few math function ( {{abs}} )  are is in {{math_funcs}}  group. It should really be. (was:

[jira] [Resolved] (SPARK-23120) Add PMML pipeline export support to PySpark

2018-06-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-23120. - Resolution: Fixed Fix Version/s: 2.4.0 > Add PMML pipeline export support to PySpark >

[jira] [Assigned] (SPARK-14712) spark.ml LogisticRegressionModel.toString should summarize model

2018-06-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-14712: --- Assignee: Bravo Zhang > spark.ml LogisticRegressionModel.toString should summarize model >

[jira] [Resolved] (SPARK-14712) spark.ml LogisticRegressionModel.toString should summarize model

2018-06-28 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-14712. - Resolution: Fixed Fix Version/s: 2.4.0 > spark.ml LogisticRegressionModel.toString should

[jira] [Reopened] (SPARK-24444) Improve pandas_udf GROUPED_MAP docs to explain column assignment

2018-06-01 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reopened SPARK-2: - Code for 2.3.1 isn't merged, re-opening. > Improve pandas_udf GROUPED_MAP docs to explain column

[jira] [Updated] (SPARK-24283) Make standard scaler work without legacy MLlib

2018-05-15 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-24283: Labels: starter (was: ) > Make standard scaler work without legacy MLlib >

[jira] [Created] (SPARK-24283) Make standard scaler work without legacy MLlib

2018-05-15 Thread holdenk (JIRA)
holdenk created SPARK-24283: --- Summary: Make standard scaler work without legacy MLlib Key: SPARK-24283 URL: https://issues.apache.org/jira/browse/SPARK-24283 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-24282) Add support for PMML export for the Standard Scaler Stage

2018-05-15 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16475957#comment-16475957 ] holdenk commented on SPARK-24282: - I'm going to work on this one. > Add support for PMML export for the

[jira] [Assigned] (SPARK-24282) Add support for PMML export for the Standard Scaler Stage

2018-05-15 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-24282: --- Assignee: holdenk > Add support for PMML export for the Standard Scaler Stage >

[jira] [Created] (SPARK-24282) Add support for PMML export for the Standard Scaler Stage

2018-05-15 Thread holdenk (JIRA)
holdenk created SPARK-24282: --- Summary: Add support for PMML export for the Standard Scaler Stage Key: SPARK-24282 URL: https://issues.apache.org/jira/browse/SPARK-24282 Project: Spark Issue Type:

[jira] [Updated] (SPARK-24195) sc.addFile for local:/ path is broken

2018-05-14 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-24195: Labels: starter (was: ) > sc.addFile for local:/ path is broken > - >

[jira] [Updated] (SPARK-23940) High-order function: transform_values(map<K, V1>, function<K, V1, V2>) → map<K, V2>

2018-05-14 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-23940: Labels: starter (was: ) > High-order function: transform_values(map, function) → >

[jira] [Updated] (SPARK-24005) Remove usage of Scala’s parallel collection

2018-05-14 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-24005: Labels: starter (was: ) > Remove usage of Scala’s parallel collection >

[jira] [Updated] (SPARK-24102) RegressionEvaluator should use sample weight data

2018-05-14 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-24102: Labels: starter (was: ) > RegressionEvaluator should use sample weight data >

[jira] [Updated] (SPARK-24103) BinaryClassificationEvaluator should use sample weight data

2018-05-14 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-24103: Labels: starter (was: ) > BinaryClassificationEvaluator should use sample weight data >

[jira] [Assigned] (SPARK-24262) Fix typo in UDF error message

2018-05-13 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-24262: --- Assignee: Kelley Robinson > Fix typo in UDF error message > - > >

[jira] [Created] (SPARK-24262) Fix typo in UDF error message

2018-05-13 Thread holdenk (JIRA)
holdenk created SPARK-24262: --- Summary: Fix typo in UDF error message Key: SPARK-24262 URL: https://issues.apache.org/jira/browse/SPARK-24262 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-24184) Allow escape comma in spark files

2018-05-04 Thread holdenk (JIRA)
holdenk created SPARK-24184: --- Summary: Allow escape comma in spark files Key: SPARK-24184 URL: https://issues.apache.org/jira/browse/SPARK-24184 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-23842) accessing java from PySpark lambda functions

2018-04-26 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-23842. - Resolution: Won't Fix Not supported by the current design, alternatives do exist though. > accessing

[jira] [Commented] (SPARK-23842) accessing java from PySpark lambda functions

2018-04-26 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454585#comment-16454585 ] holdenk commented on SPARK-23842: - So the py4j gateway only exists on the driver program, on the worker

[jira] [Created] (SPARK-23853) Skip doctests which require hive support built in PySpark

2018-04-02 Thread holdenk (JIRA)
holdenk created SPARK-23853: --- Summary: Skip doctests which require hive support built in PySpark Key: SPARK-23853 URL: https://issues.apache.org/jira/browse/SPARK-23853 Project: Spark Issue Type:

[jira] [Created] (SPARK-23851) Investigate pip install edit mode unicode errors

2018-04-02 Thread holdenk (JIRA)
holdenk created SPARK-23851: --- Summary: Investigate pip install edit mode unicode errors Key: SPARK-23851 URL: https://issues.apache.org/jira/browse/SPARK-23851 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-23851) Investigate pip install edit mode unicode errors

2018-04-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423081#comment-16423081 ] holdenk commented on SPARK-23851: - Happening with pip 9.0.3 > Investigate pip install edit mode unicode

[jira] [Updated] (SPARK-23836) Support returning StructType to the level support in GroupedMap Arrow's "scalar" UDFS (or similar)

2018-04-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-23836: Summary: Support returning StructType to the level support in GroupedMap Arrow's "scalar" UDFS (or

[jira] [Commented] (SPARK-23836) Support returning StructType & MapType in Arrow's "scalar" UDFS (or similar)

2018-04-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422942#comment-16422942 ] holdenk commented on SPARK-23836: - Oh wait, I missunderstood our support of structype - I'll rephrase as

[jira] [Commented] (SPARK-21187) Complete support for remaining Spark data types in Arrow Converters

2018-04-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422938#comment-16422938 ] holdenk commented on SPARK-21187: - So Arrays are listed as crossed off but it seems like we don't

[jira] [Commented] (SPARK-23836) Support returning StructType & MapType in Arrow's "scalar" UDFS (or similar)

2018-04-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422926#comment-16422926 ] holdenk commented on SPARK-23836: - I'm going to take a quick crack at this this week. > Support

[jira] [Commented] (SPARK-23836) Support returning StructType & MapType in Arrow's "scalar" UDFS (or similar)

2018-04-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422922#comment-16422922 ] holdenk commented on SPARK-23836: - [~hyukjin.kwon] its a good question, that one seems to be more just

[jira] [Commented] (SPARK-23836) Support returning StructType & MapType in Arrow's "scalar" UDFS (or similar)

2018-03-30 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421044#comment-16421044 ] holdenk commented on SPARK-23836: - cc [~bryanc] > Support returning StructType & MapType in Arrow's

[jira] [Created] (SPARK-23836) Support returning StructType & MapType in Arrow's "scalar" UDFS (or similar)

2018-03-30 Thread holdenk (JIRA)
holdenk created SPARK-23836: --- Summary: Support returning StructType & MapType in Arrow's "scalar" UDFS (or similar) Key: SPARK-23836 URL: https://issues.apache.org/jira/browse/SPARK-23836 Project: Spark

[jira] [Commented] (SPARK-22809) pyspark is sensitive to imports with dots

2018-03-27 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16416038#comment-16416038 ] holdenk commented on SPARK-22809: - This _should_ be resolved by SPARK-23169 but I'll double check when

[jira] [Updated] (SPARK-23672) Document Support returning lists in Arrow UDFs

2018-03-26 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-23672: Description: Documenting the support for returning lists for individual inputs on non-grouped data inside

[jira] [Updated] (SPARK-23672) Document Support returning lists in Arrow UDFs

2018-03-26 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-23672: Summary: Document Support returning lists in Arrow UDFs (was: Support returning lists in Arrow UDFs) >

[jira] [Updated] (SPARK-23672) Support returning lists in Arrow UDFs

2018-03-26 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-23672: Description: Consider to add support for returning lists for individual inputs on non-grouped data inside

[jira] [Resolved] (SPARK-23783) Add new generic export trait for ML pipelines

2018-03-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-23783. - Resolution: Fixed Fix Version/s: 2.4.0 > Add new generic export trait for ML pipelines >

[jira] [Assigned] (SPARK-23783) Add new generic export trait for ML pipelines

2018-03-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-23783: --- Assignee: holdenk > Add new generic export trait for ML pipelines >

[jira] [Assigned] (SPARK-11239) PMML export for ML linear regression

2018-03-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-11239: --- Assignee: holdenk > PMML export for ML linear regression > > >

[jira] [Resolved] (SPARK-11239) PMML export for ML linear regression

2018-03-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-11239. - Resolution: Fixed Fix Version/s: 2.4.0 > PMML export for ML linear regression >

[jira] [Created] (SPARK-23783) Add new generic export trait for ML pipelines

2018-03-23 Thread holdenk (JIRA)
holdenk created SPARK-23783: --- Summary: Add new generic export trait for ML pipelines Key: SPARK-23783 URL: https://issues.apache.org/jira/browse/SPARK-23783 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-21685) Params isSet in scala Transformer triggered by _setDefault in pyspark

2018-03-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-21685: --- Assignee: Bryan Cutler > Params isSet in scala Transformer triggered by _setDefault in pyspark >

[jira] [Resolved] (SPARK-21685) Params isSet in scala Transformer triggered by _setDefault in pyspark

2018-03-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-21685. - Resolution: Fixed Fix Version/s: 2.4.0 > Params isSet in scala Transformer triggered by

[jira] [Commented] (SPARK-23672) Support returning lists in Arrow UDFs

2018-03-19 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16405574#comment-16405574 ] holdenk commented on SPARK-23672: - cc [~bryanc] for thoughts. > Support returning lists in Arrow UDFs >

[jira] [Resolved] (SPARK-15009) PySpark CountVectorizerModel should be able to construct from vocabulary list

2018-03-16 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-15009. - Resolution: Fixed Fix Version/s: 2.4.0 Target Version/s: 2.4.0 Thanks for fixing this

[jira] [Assigned] (SPARK-15009) PySpark CountVectorizerModel should be able to construct from vocabulary list

2018-03-16 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-15009: --- Assignee: Bryan Cutler > PySpark CountVectorizerModel should be able to construct from vocabulary

[jira] [Created] (SPARK-23672) Support returning lists in Arrow UDFs

2018-03-13 Thread holdenk (JIRA)
holdenk created SPARK-23672: --- Summary: Support returning lists in Arrow UDFs Key: SPARK-23672 URL: https://issues.apache.org/jira/browse/SPARK-23672 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-20087) Include accumulators / taskMetrics when sending TaskKilled to onTaskEnd listeners

2018-02-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16352980#comment-16352980 ] holdenk commented on SPARK-20087: - I've given up on changing the accumulator API until Spark 3+ is

[jira] [Commented] (SPARK-4502) Spark SQL reads unneccesary nested fields from Parquet

2018-01-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16340488#comment-16340488 ] holdenk commented on SPARK-4502: [~sameerag] understand this is a pretty big change to try and get in at

[jira] [Commented] (SPARK-22809) pyspark is sensitive to imports with dots

2018-01-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16336983#comment-16336983 ] holdenk commented on SPARK-22809: - oh wait it should work in 0.4.2, I'll poke at that PR more. > pyspark

[jira] [Commented] (SPARK-22809) pyspark is sensitive to imports with dots

2018-01-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16336979#comment-16336979 ] holdenk commented on SPARK-22809: - [~ueshin]: we can push this out to 2.3.1 given we are already in the

[jira] [Updated] (SPARK-22809) pyspark is sensitive to imports with dots

2018-01-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22809: Target Version/s: 2.3.1, 2.4.0 (was: 2.3.0) > pyspark is sensitive to imports with dots >

[jira] [Created] (SPARK-23120) Add PMML pipeline export support to PySpark

2018-01-16 Thread holdenk (JIRA)
holdenk created SPARK-23120: --- Summary: Add PMML pipeline export support to PySpark Key: SPARK-23120 URL: https://issues.apache.org/jira/browse/SPARK-23120 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-22809) pyspark is sensitive to imports with dots

2018-01-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16314308#comment-16314308 ] holdenk commented on SPARK-22809: - Note we used a simple {code:python} def foo(x): return

[jira] [Updated] (SPARK-22809) pyspark is sensitive to imports with dots

2018-01-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22809: Affects Version/s: 2.2.1 Target Version/s: 2.3.0 > pyspark is sensitive to imports with dots >

[jira] [Reopened] (SPARK-22809) pyspark is sensitive to imports with dots

2018-01-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reopened SPARK-22809: - Assignee: holdenk After further investigation this turns out to be an issue maye have been fixed

[jira] [Updated] (SPARK-22406) pyspark version tag is wrong on PyPi

2018-01-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22406: Fix Version/s: 2.1.2 2.2.1 > pyspark version tag is wrong on PyPi >

[jira] [Resolved] (SPARK-22406) pyspark version tag is wrong on PyPi

2018-01-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22406. - Resolution: Fixed We've posted 2.2.1 & 2.1.2 to PyPi with out the post issue. > pyspark version tag is

[jira] [Commented] (SPARK-22406) pyspark version tag is wrong on PyPi

2018-01-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16314233#comment-16314233 ] holdenk commented on SPARK-22406: - Yes. I'll close this. > pyspark version tag is wrong on PyPi >

[jira] [Assigned] (SPARK-22521) VectorIndexerModel support handle unseen categories via handleInvalid: Python API

2017-11-21 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-22521: --- Assignee: Weichen Xu > VectorIndexerModel support handle unseen categories via handleInvalid:

[jira] [Resolved] (SPARK-22521) VectorIndexerModel support handle unseen categories via handleInvalid: Python API

2017-11-21 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22521. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19753

[jira] [Commented] (SPARK-22274) User-defined aggregation functions with pandas udf

2017-11-17 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16256919#comment-16256919 ] holdenk commented on SPARK-22274: - Wonderful, do ping me on the PR then :) > User-defined aggregation

[jira] [Commented] (SPARK-22274) Used-defined aggregation functions with pandas udf

2017-11-16 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16254944#comment-16254944 ] holdenk commented on SPARK-22274: - Is anyone working on this currently? > Used-defined aggregation

[jira] [Commented] (SPARK-22274) Used-defined aggregation functions with pandas udf

2017-11-15 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16253635#comment-16253635 ] holdenk commented on SPARK-22274: - See also the discussion in

[jira] [Resolved] (SPARK-6802) User Defined Aggregate Function Refactoring

2017-11-15 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-6802. Resolution: Fixed > User Defined Aggregate Function Refactoring >

[jira] [Commented] (SPARK-6802) User Defined Aggregate Function Refactoring

2017-11-15 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16253634#comment-16253634 ] holdenk commented on SPARK-6802: Oh wait sorry I re-opened the wrong issue. > User Defined Aggregate

[jira] [Updated] (SPARK-10915) Add support for UDAFs in Python

2017-11-15 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-10915: Affects Version/s: 2.3.0 > Add support for UDAFs in Python > --- > >

[jira] [Created] (SPARK-22525) Spark download page doesn't update package name based package type

2017-11-15 Thread holdenk (JIRA)
holdenk created SPARK-22525: --- Summary: Spark download page doesn't update package name based package type Key: SPARK-22525 URL: https://issues.apache.org/jira/browse/SPARK-22525 Project: Spark

[jira] [Resolved] (SPARK-7146) Should ML sharedParams be a public API?

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-7146. Resolution: Fixed Fix Version/s: 2.3.0 Target Version/s: 2.3.0 Exposed the ML params as a

[jira] [Assigned] (SPARK-7146) Should ML sharedParams be a public API?

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-7146: -- Assignee: holdenk > Should ML sharedParams be a public API? > ---

[jira] [Assigned] (SPARK-22406) pyspark version tag is wrong on PyPi

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-22406: --- Assignee: holdenk > pyspark version tag is wrong on PyPi > > >

[jira] [Commented] (SPARK-22406) pyspark version tag is wrong on PyPi

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239432#comment-16239432 ] holdenk commented on SPARK-22406: - Due to restrictions on PyPI, no. We can try and fix this in 2.2.1

[jira] [Updated] (SPARK-22406) pyspark version tag is wrong on PyPi

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22406: Target Version/s: 2.2.1 > pyspark version tag is wrong on PyPi > > >

[jira] [Assigned] (SPARK-22401) Missing 2.1.2 tag in git

2017-11-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-22401: --- Assignee: holdenk > Missing 2.1.2 tag in git > > > Key:

[jira] [Resolved] (SPARK-22401) Missing 2.1.2 tag in git

2017-11-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22401. - Resolution: Fixed > Missing 2.1.2 tag in git > > > Key:

[jira] [Commented] (SPARK-22202) Release tgz content differences for python and R

2017-10-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16193247#comment-16193247 ] holdenk commented on SPARK-22202: - [~felixcheung] for Python I think it would not be bad to be

[jira] [Commented] (SPARK-22167) Spark Packaging w/R distro issues

2017-10-03 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16189309#comment-16189309 ] holdenk commented on SPARK-22167: - I agree we could improve this, I think though that swapping

[jira] [Updated] (SPARK-22083) When dropping multiple blocks to disk, Spark should release all locks on a failure

2017-10-03 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22083: Fix Version/s: 2.1.2 > When dropping multiple blocks to disk, Spark should release all locks on a >

[jira] [Updated] (SPARK-18971) Netty issue may cause the shuffle client hang

2017-10-03 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-18971: Fix Version/s: 2.1.2 > Netty issue may cause the shuffle client hang >

[jira] [Updated] (SPARK-22167) Spark Packaging w/R distro issues

2017-10-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22167: Fix Version/s: 2.3.0 2.2.1 2.1.2 > Spark Packaging w/R distro issues

[jira] [Resolved] (SPARK-22167) Spark Packaging w/R distro issues

2017-10-02 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22167. - Resolution: Fixed > Spark Packaging w/R distro issues > - > >

[jira] [Updated] (SPARK-22167) Spark Packaging w/R distro issues

2017-09-30 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22167: Target Version/s: 2.1.2, 2.2.1, 2.3.0 (was: 2.1.2, 2.3.0) > Spark Packaging w/R distro issues >

[jira] [Updated] (SPARK-22167) Spark Packaging w/R distro issues

2017-09-30 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22167: Target Version/s: 2.1.2, 2.3.0 (was: 2.1.2) > Spark Packaging w/R distro issues >

[jira] [Commented] (SPARK-22167) Spark Packaging w/R distro issues

2017-09-30 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16187124#comment-16187124 ] holdenk commented on SPARK-22167: - So some more debugging, it does not appear to be a race condition.

[jira] [Commented] (SPARK-22167) Spark Packaging w/R distro issues

2017-09-29 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16186008#comment-16186008 ] holdenk commented on SPARK-22167: - So for some reason the R directory in the hadoop 2.7 build looks like:

[jira] [Commented] (SPARK-22167) Spark Packaging w/R distro issues

2017-09-29 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16185959#comment-16185959 ] holdenk commented on SPARK-22167: - Here is the build log

[jira] [Issue Comment Deleted] (SPARK-22167) Spark Packaging w/R distro issues

2017-09-29 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22167: Comment: was deleted (was: [^build.log]) > Spark Packaging w/R distro issues >

[jira] [Commented] (SPARK-22167) Spark Packaging w/R distro issues

2017-09-29 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16185956#comment-16185956 ] holdenk commented on SPARK-22167: - [^build.log] > Spark Packaging w/R distro issues >

[jira] [Created] (SPARK-22167) Spark Packaging w/R distro issues

2017-09-29 Thread holdenk (JIRA)
holdenk created SPARK-22167: --- Summary: Spark Packaging w/R distro issues Key: SPARK-22167 URL: https://issues.apache.org/jira/browse/SPARK-22167 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-22138) Allow retry during release-build

2017-09-29 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22138. - Resolution: Fixed Fix Version/s: 2.1.2 2.3.0 2.2.1 Issue

[jira] [Resolved] (SPARK-22129) Spark release scripts ignore the GPG_KEY and always sign with your default key

2017-09-29 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22129. - Resolution: Fixed Fix Version/s: 2.1.2 2.3.0 2.2.1 Issue

[jira] [Created] (SPARK-22138) Allow retry during release-build

2017-09-26 Thread holdenk (JIRA)
holdenk created SPARK-22138: --- Summary: Allow retry during release-build Key: SPARK-22138 URL: https://issues.apache.org/jira/browse/SPARK-22138 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-22129) Spark release scripts ignore the GPG_KEY and always sign with your default key

2017-09-26 Thread holdenk (JIRA)
holdenk created SPARK-22129: --- Summary: Spark release scripts ignore the GPG_KEY and always sign with your default key Key: SPARK-22129 URL: https://issues.apache.org/jira/browse/SPARK-22129 Project: Spark

[jira] [Commented] (SPARK-18136) Make PySpark pip install works on windows

2017-09-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16179172#comment-16179172 ] holdenk commented on SPARK-18136: - [~fobofindia09] So currently we're working on a 2.1.2 release, and I

[jira] [Updated] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-18136: Fix Version/s: (was: 2.1.2) > Make PySpark pip install works on windows >

[jira] [Updated] (SPARK-18136) Make PySpark pip install works on windows

2017-09-23 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-18136: Fix Version/s: 2.1.3 > Make PySpark pip install works on windows >

[jira] [Updated] (SPARK-16625) Oracle JDBC table creation fails with ORA-00902: invalid datatype

2017-09-22 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-16625: Fix Version/s: 2.1.2 > Oracle JDBC table creation fails with ORA-00902: invalid datatype >

[jira] [Resolved] (SPARK-22071) Improve release build scripts to check correct JAVA version is being used for build

2017-09-22 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22071. - Resolution: Fixed Fix Version/s: 2.1.2 2.3.0 2.2.1 Issue

[jira] [Resolved] (SPARK-22072) Allow the same shell params to be used for all of the different steps in release-build

2017-09-22 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-22072. - Resolution: Fixed Fix Version/s: 2.1.2 2.3.0 2.2.1 Issue

[jira] [Updated] (SPARK-22072) Allow the same shell params to be used for all of the different steps in release-build

2017-09-20 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22072: Description: The jenkins script currently sets SPARK_VERSION to different values depending on what action

<    1   2   3   4   5   6   7   8   9   10   >