[jira] [Resolved] (SPARK-23109) ML 2.3 QA: API: Python API coverage

2018-01-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-23109. -- Resolution: Done > ML 2.3 QA: API: Python API coverage > --- >

[jira] [Commented] (SPARK-23109) ML 2.3 QA: API: Python API coverage

2018-01-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16343665#comment-16343665 ] Bryan Cutler commented on SPARK-23109: -- Thanks [~mlnick], yes this is done. > ML 2.

[jira] [Comment Edited] (SPARK-23109) ML 2.3 QA: API: Python API coverage

2018-01-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16332698#comment-16332698 ] Bryan Cutler edited comment on SPARK-23109 at 1/29/18 5:25 PM:

[jira] [Comment Edited] (SPARK-23109) ML 2.3 QA: API: Python API coverage

2018-01-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16332698#comment-16332698 ] Bryan Cutler edited comment on SPARK-23109 at 1/29/18 5:26 PM:

[jira] [Created] (SPARK-23258) Should not split Arrow record batches based on row count

2018-01-29 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-23258: Summary: Should not split Arrow record batches based on row count Key: SPARK-23258 URL: https://issues.apache.org/jira/browse/SPARK-23258 Project: Spark Issu

[jira] [Commented] (SPARK-25344) Break large PySpark unittests into smaller files

2018-11-13 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16685613#comment-16685613 ] Bryan Cutler commented on SPARK-25344: -- [~hyukjin.kwon] no problem, I can take on M

[jira] [Comment Edited] (SPARK-26200) Column values are incorrectly transposed when a field in a PySpark Row requires serialization

2018-11-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16704092#comment-16704092 ] Bryan Cutler edited comment on SPARK-26200 at 11/30/18 12:56 AM: -

[jira] [Commented] (SPARK-26200) Column values are incorrectly transposed when a field in a PySpark Row requires serialization

2018-11-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16704092#comment-16704092 ] Bryan Cutler commented on SPARK-26200: -- I think this is a duplicate of https://iss

[jira] [Commented] (SPARK-26200) Column values are incorrectly transposed when a field in a PySpark Row requires serialization

2018-11-30 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16705224#comment-16705224 ] Bryan Cutler commented on SPARK-26200: -- Thanks [~davidlyness], I'll mark this as a

[jira] [Resolved] (SPARK-26200) Column values are incorrectly transposed when a field in a PySpark Row requires serialization

2018-11-30 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-26200. -- Resolution: Duplicate > Column values are incorrectly transposed when a field in a PySpark Row

[jira] [Assigned] (SPARK-25274) Improve toPandas with Arrow by sending out-of-order record batches

2018-12-06 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler reassigned SPARK-25274: Assignee: Bryan Cutler > Improve toPandas with Arrow by sending out-of-order record batch

[jira] [Resolved] (SPARK-25274) Improve toPandas with Arrow by sending out-of-order record batches

2018-12-06 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-25274. -- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 22275 https://git

[jira] [Assigned] (SPARK-24333) Add fit with validation set to spark.ml GBT: Python API

2018-12-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler reassigned SPARK-24333: Assignee: Huaxin Gao > Add fit with validation set to spark.ml GBT: Python API >

[jira] [Resolved] (SPARK-24333) Add fit with validation set to spark.ml GBT: Python API

2018-12-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-24333. -- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 21465 [https://gi

[jira] [Commented] (SPARK-26315) auto cast threshold from Integer to Float in approxSimilarityJoin of BucketedRandomProjectionLSHModel

2018-12-11 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16717873#comment-16717873 ] Bryan Cutler commented on SPARK-26315: -- I believe {{def approxSimilarityJoin(...)}}

[jira] [Commented] (SPARK-9844) File appender race condition during SparkWorker shutdown

2016-02-17 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15150913#comment-15150913 ] Bryan Cutler commented on SPARK-9844: - This error is benign for the most part, once it

[jira] [Created] (SPARK-13500) Add an example for LDA in PySpark

2016-02-25 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-13500: Summary: Add an example for LDA in PySpark Key: SPARK-13500 URL: https://issues.apache.org/jira/browse/SPARK-13500 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-13500) Add an example for LDA in PySpark

2016-02-25 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168021#comment-15168021 ] Bryan Cutler commented on SPARK-13500: -- I'm working on it :D > Add an example for L

[jira] [Resolved] (SPARK-13500) Add an example for LDA in PySpark

2016-02-26 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-13500. -- Resolution: Duplicate this example and others are being added as part of this > Add an example

[jira] [Resolved] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2016-02-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-11219. -- Resolution: Done Fix Version/s: 2.0.0 > Make Parameter Description Format Consistent in

[jira] [Commented] (SPARK-13430) Expose ml summary function in PySpark for classification and regression models

2016-02-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15172775#comment-15172775 ] Bryan Cutler commented on SPARK-13430: -- I can work on adding this > Expose ml summa

[jira] [Created] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-13625: Summary: PySpark-ML method to get list of params for an obj should not check property attr Key: SPARK-13625 URL: https://issues.apache.org/jira/browse/SPARK-13625 Pro

[jira] [Commented] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15176655#comment-15176655 ] Bryan Cutler commented on SPARK-13625: -- I have a fix for this, will post PR soon >

[jira] [Updated] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-13625: - Description: In PySpark params.__init__.py, the method {{Param.params()}} returns a list of Para

[jira] [Updated] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-13625: - Description: In PySpark params.__init__.py, the method {{Param.params()}} returns a list of Para

[jira] [Updated] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-13625: - Description: In PySpark params.__init__.py, the method {{Param.params()}} returns a list of Para

[jira] [Commented] (SPARK-13602) o.a.s.deploy.worker.DriverRunner may leak the driver processes

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15176787#comment-15176787 ] Bryan Cutler commented on SPARK-13602: -- Hi [~zsxwing], mind if I work on this one?

[jira] [Commented] (SPARK-13602) o.a.s.deploy.worker.DriverRunner may leak the driver processes

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15177169#comment-15177169 ] Bryan Cutler commented on SPARK-13602: -- Great! Thanks :D > o.a.s.deploy.worker.Driv

[jira] [Commented] (SPARK-13691) Scala and Python generate inconsistent results

2016-03-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15183438#comment-15183438 ] Bryan Cutler commented on SPARK-13691: -- The reason for this is that Pyspark serializ

[jira] [Commented] (SPARK-13967) Add binary toggle Param to PySpark CountVectorizer

2016-03-18 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15201803#comment-15201803 ] Bryan Cutler commented on SPARK-13967: -- Sure, I'd like to do this - thanks! > Add b

[jira] [Created] (SPARK-13937) PySpark ML JavaWrapper, variable _java_obj should not be static

2016-03-18 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-13937: Summary: PySpark ML JavaWrapper, variable _java_obj should not be static Key: SPARK-13937 URL: https://issues.apache.org/jira/browse/SPARK-13937 Project: Spark

[jira] [Commented] (SPARK-13963) Add binary toggle Param to ml.HashingTF

2016-03-18 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1520#comment-1520 ] Bryan Cutler commented on SPARK-13963: -- Hi [~mlnick], mind if I work on this? > Add

[jira] [Commented] (SPARK-13691) Scala and Python generate inconsistent results

2016-03-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15197775#comment-15197775 ] Bryan Cutler commented on SPARK-13691: -- Since the problem comes from the structure o

[jira] [Commented] (SPARK-13937) PySpark ML JavaWrapper, variable _java_obj should not be static

2016-03-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15197805#comment-15197805 ] Bryan Cutler commented on SPARK-13937: -- I'll submit a PR for this > PySpark ML Java

[jira] [Created] (SPARK-14087) PySpark ML JavaModel does not properly own params after being fit

2016-03-22 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-14087: Summary: PySpark ML JavaModel does not properly own params after being fit Key: SPARK-14087 URL: https://issues.apache.org/jira/browse/SPARK-14087 Project: Spark

[jira] [Updated] (SPARK-14087) PySpark ML JavaModel does not properly own params after being fit

2016-03-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-14087: - Attachment: feature.py > PySpark ML JavaModel does not properly own params after being fit >

[jira] [Commented] (SPARK-14087) PySpark ML JavaModel does not properly own params after being fit

2016-03-22 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15207555#comment-15207555 ] Bryan Cutler commented on SPARK-14087: -- I can post a PR for this > PySpark ML JavaM

[jira] [Commented] (SPARK-7127) Broadcast spark.ml tree ensemble models for predict

2015-06-15 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14587081#comment-14587081 ] Bryan Cutler commented on SPARK-7127: - Hi [~josephkb], I added some commits that allo

[jira] [Comment Edited] (SPARK-7127) Broadcast spark.ml tree ensemble models for predict

2015-06-15 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14587081#comment-14587081 ] Bryan Cutler edited comment on SPARK-7127 at 6/16/15 3:47 AM: --

[jira] [Commented] (SPARK-8400) ml.ALS doesn't handle -1 block size

2015-06-16 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14589039#comment-14589039 ] Bryan Cutler commented on SPARK-8400: - I could do this. Just to clarify, if the user

[jira] [Created] (SPARK-8444) Add Python example in streaming for queueStream usage

2015-06-18 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-8444: --- Summary: Add Python example in streaming for queueStream usage Key: SPARK-8444 URL: https://issues.apache.org/jira/browse/SPARK-8444 Project: Spark Issue Type:

[jira] [Commented] (SPARK-8400) ml.ALS doesn't handle -1 block size

2015-06-18 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592593#comment-14592593 ] Bryan Cutler commented on SPARK-8400: - Ok, sounds good. > ml.ALS doesn't handle -1 bl

[jira] [Commented] (SPARK-8400) ml.ALS doesn't handle -1 block size

2015-07-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14617359#comment-14617359 ] Bryan Cutler commented on SPARK-8400: - Hi [~mengxr], yes I am. I'll hopefully have a

[jira] [Commented] (SPARK-8919) Add @since tags to mllib.recommendation

2015-07-09 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14620834#comment-14620834 ] Bryan Cutler commented on SPARK-8919: - I'll take this one [~mengxr], thanks! > Add @s

[jira] [Commented] (SPARK-8400) ml.ALS doesn't handle -1 block size

2015-07-13 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14625483#comment-14625483 ] Bryan Cutler commented on SPARK-8400: - Hi [~mengxr], just in case you missed my commen

[jira] [Commented] (SPARK-8924) Add @since tags to mllib.tree

2015-07-13 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14625619#comment-14625619 ] Bryan Cutler commented on SPARK-8924: - I can knock this one out > Add @since tags to

[jira] [Commented] (SPARK-10158) ALS should print better errors when given Long IDs

2015-10-29 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980920#comment-14980920 ] Bryan Cutler commented on SPARK-10158: -- I made a quick fix for this. When using Lon

[jira] [Commented] (SPARK-10158) ALS should print better errors when given Long IDs

2015-10-30 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14983644#comment-14983644 ] Bryan Cutler commented on SPARK-10158: -- The only way I can see handling this from th

[jira] [Comment Edited] (SPARK-10158) ALS should print better errors when given Long IDs

2015-10-31 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14983644#comment-14983644 ] Bryan Cutler edited comment on SPARK-10158 at 10/31/15 7:05 AM: ---

[jira] [Commented] (SPARK-4557) Spark Streaming' foreachRDD method should accept a VoidFunction<...>, not a Function<..., Void>

2015-11-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14986101#comment-14986101 ] Bryan Cutler commented on SPARK-4557: - Hi [~somi...@us.ibm.com], the right way to mak

[jira] [Commented] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2015-11-06 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14994096#comment-14994096 ] Bryan Cutler commented on SPARK-11219: -- Good point about the default value placement

[jira] [Updated] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2015-11-06 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-11219: - Description: There are several different formats for describing params in PySpark.MLlib, making

[jira] [Updated] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2015-11-06 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-11219: - Description: There are several different formats for describing params in PySpark.MLlib, making

[jira] [Commented] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2015-11-06 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14994615#comment-14994615 ] Bryan Cutler commented on SPARK-10086: -- I've been able to reproduce this locally, bu

[jira] [Commented] (SPARK-11713) Initial RDD for updateStateByKey for pyspark

2015-11-16 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15007004#comment-15007004 ] Bryan Cutler commented on SPARK-11713: -- I could work on this > Initial RDD for upda

[jira] [Commented] (SPARK-12062) Master rebuilding historical SparkUI should be asynchronous

2015-12-01 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15034395#comment-15034395 ] Bryan Cutler commented on SPARK-12062: -- Hi [~andrewor14], I'd like work on this > M

[jira] [Commented] (SPARK-11928) Master retry deadlock

2015-12-01 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15034797#comment-15034797 ] Bryan Cutler commented on SPARK-11928: -- I was able to reproduce the {{RejectedExecut

[jira] [Updated] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2015-12-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-11219: - Description: There are several different formats for describing params in PySpark.MLlib, making

[jira] [Commented] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2015-12-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15036470#comment-15036470 ] Bryan Cutler commented on SPARK-11219: -- I added an assessment of the current state o

[jira] [Commented] (SPARK-11928) Master retry deadlock

2015-12-09 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15049078#comment-15049078 ] Bryan Cutler commented on SPARK-11928: -- I believe the {{RejectedExecutionException}}

[jira] [Commented] (SPARK-11928) Master retry deadlock

2015-12-09 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15049118#comment-15049118 ] Bryan Cutler commented on SPARK-11928: -- Confirmed, I no longer see that exception an

[jira] [Commented] (SPARK-12062) Master rebuilding historical SparkUI should be asynchronous

2015-12-11 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15053963#comment-15053963 ] Bryan Cutler commented on SPARK-12062: -- Hi [~andrewor14], I have a PR ready for this

[jira] [Commented] (SPARK-12062) Master rebuilding historical SparkUI should be asynchronous

2015-12-13 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15055205#comment-15055205 ] Bryan Cutler commented on SPARK-12062: -- I read the past conversations discussing thi

[jira] [Created] (SPARK-12630) Make Parameter Descriptions Consistent for PySpark MLlib Classification

2016-01-04 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-12630: Summary: Make Parameter Descriptions Consistent for PySpark MLlib Classification Key: SPARK-12630 URL: https://issues.apache.org/jira/browse/SPARK-12630 Project: Spar

[jira] [Created] (SPARK-12631) Make Parameter Descriptions Consistent for PySpark MLlib Clustering

2016-01-04 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-12631: Summary: Make Parameter Descriptions Consistent for PySpark MLlib Clustering Key: SPARK-12631 URL: https://issues.apache.org/jira/browse/SPARK-12631 Project: Spark

[jira] [Created] (SPARK-12632) Make Parameter Descriptions Consistent for PySpark MLlib FPM and Recommendation

2016-01-04 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-12632: Summary: Make Parameter Descriptions Consistent for PySpark MLlib FPM and Recommendation Key: SPARK-12632 URL: https://issues.apache.org/jira/browse/SPARK-12632 Proje

[jira] [Created] (SPARK-12633) Make Parameter Descriptions Consistent for PySpark MLlib Regression

2016-01-04 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-12633: Summary: Make Parameter Descriptions Consistent for PySpark MLlib Regression Key: SPARK-12633 URL: https://issues.apache.org/jira/browse/SPARK-12633 Project: Spark

[jira] [Created] (SPARK-12634) Make Parameter Descriptions Consistent for PySpark MLlib Tree

2016-01-04 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-12634: Summary: Make Parameter Descriptions Consistent for PySpark MLlib Tree Key: SPARK-12634 URL: https://issues.apache.org/jira/browse/SPARK-12634 Project: Spark

[jira] [Updated] (SPARK-12632) Make Parameter Descriptions Consistent for PySpark MLlib FPM and Recommendation

2016-01-04 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-12632: - Remaining Estimate: 1h Original Estimate: 1h > Make Parameter Descriptions Consistent for Py

[jira] [Updated] (SPARK-12631) Make Parameter Descriptions Consistent for PySpark MLlib Clustering

2016-01-04 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-12631: - Remaining Estimate: 1h Original Estimate: 1h > Make Parameter Descriptions Consistent for Py

[jira] [Updated] (SPARK-12634) Make Parameter Descriptions Consistent for PySpark MLlib Tree

2016-01-04 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-12634: - Remaining Estimate: 1h Original Estimate: 1h > Make Parameter Descriptions Consistent for Py

[jira] [Updated] (SPARK-12633) Make Parameter Descriptions Consistent for PySpark MLlib Regression

2016-01-04 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-12633: - Remaining Estimate: 1h Original Estimate: 1h > Make Parameter Descriptions Consistent for Py

[jira] [Commented] (SPARK-12631) Make Parameter Descriptions Consistent for PySpark MLlib Clustering

2016-01-04 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15081902#comment-15081902 ] Bryan Cutler commented on SPARK-12631: -- I'll start on this one, so if anyone else wa

[jira] [Commented] (SPARK-12632) Make Parameter Descriptions Consistent for PySpark MLlib FPM and Recommendation

2016-01-05 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15084461#comment-15084461 ] Bryan Cutler commented on SPARK-12632: -- [~somi...@us.ibm.com] recommendation.py is m

[jira] [Commented] (SPARK-9844) File appender race condition during SparkWorker shutdown

2016-01-06 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15086202#comment-15086202 ] Bryan Cutler commented on SPARK-9844: - I came across this error recently too as of 1.6

[jira] [Created] (SPARK-12701) Logging FileAppender should use join to ensure thread is finished

2016-01-07 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-12701: Summary: Logging FileAppender should use join to ensure thread is finished Key: SPARK-12701 URL: https://issues.apache.org/jira/browse/SPARK-12701 Project: Spark

[jira] [Commented] (SPARK-12701) Logging FileAppender should use join to ensure thread is finished

2016-01-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15088429#comment-15088429 ] Bryan Cutler commented on SPARK-12701: -- I can submit a PR for this. > Logging FileA

[jira] [Updated] (SPARK-12701) Logging FileAppender should use join to ensure thread is finished

2016-01-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-12701: - Issue Type: Improvement (was: Bug) > Logging FileAppender should use join to ensure thread is fi

[jira] [Comment Edited] (SPARK-12701) Logging FileAppender should use join to ensure thread is finished

2016-01-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15088429#comment-15088429 ] Bryan Cutler edited comment on SPARK-12701 at 1/8/16 12:07 AM:

[jira] [Commented] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2016-01-08 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15090292#comment-15090292 ] Bryan Cutler commented on SPARK-11219: -- That's my fault [~josephkb], I was instructi

[jira] [Commented] (SPARK-12299) Remove history serving functionality from standalone Master

2016-01-18 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15105845#comment-15105845 ] Bryan Cutler commented on SPARK-12299: -- I'd be happy to work on this since I recentl

[jira] [Commented] (SPARK-11219) Make Parameter Description Format Consistent in PySpark.MLlib

2016-01-25 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15115756#comment-15115756 ] Bryan Cutler commented on SPARK-11219: -- Regarding overall style in PySpark, I genera

[jira] [Commented] (SPARK-12986) Fix pydoc warnings in mllib/regression.py

2016-01-25 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15116153#comment-15116153 ] Bryan Cutler commented on SPARK-12986: -- It looks like this is caused by an indented

[jira] [Commented] (SPARK-12731) PySpark docstring cleanup

2016-02-01 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15126886#comment-15126886 ] Bryan Cutler commented on SPARK-12731: -- Just to add my 2cents since I've been workin

[jira] [Commented] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2016-02-12 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145625#comment-15145625 ] Bryan Cutler commented on SPARK-10086: -- I was able to track down the cause of these

[jira] [Updated] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2016-02-12 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-10086: - Attachment: flakyRepro.py Simple script with similar operations to this StreamingKMeans test, use

[jira] [Comment Edited] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2016-02-12 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15145628#comment-15145628 ] Bryan Cutler edited comment on SPARK-10086 at 2/13/16 12:44 AM: ---

[jira] [Commented] (SPARK-8400) ml.ALS doesn't handle -1 block size

2015-08-21 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14707409#comment-14707409 ] Bryan Cutler commented on SPARK-8400: - No problem! It does LocalIndexEncoder once tra

[jira] [Commented] (SPARK-6931) python: struct.pack('!q', value) in write_long(value, stream) in serializers.py require int(but doesn't raise exceptions in common cases)

2015-09-03 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729939#comment-14729939 ] Bryan Cutler commented on SPARK-6931: - I can backport the fix for this > python: stru

[jira] [Commented] (SPARK-14087) PySpark ML JavaModel does not properly own params after being fit

2016-04-04 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15225326#comment-15225326 ] Bryan Cutler commented on SPARK-14087: -- I don't think this would completely solve it

[jira] [Created] (SPARK-14472) Cleanup PySpark-ML Java wrapper classes so that JavaWrapper will inherit from JavaCallable

2016-04-07 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-14472: Summary: Cleanup PySpark-ML Java wrapper classes so that JavaWrapper will inherit from JavaCallable Key: SPARK-14472 URL: https://issues.apache.org/jira/browse/SPARK-14472

[jira] [Commented] (SPARK-14472) Cleanup PySpark-ML Java wrapper classes so that JavaWrapper will inherit from JavaCallable

2016-04-07 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15231320#comment-15231320 ] Bryan Cutler commented on SPARK-14472: -- I'm working on it :D > Cleanup PySpark-ML J

[jira] [Commented] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2016-04-08 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15232888#comment-15232888 ] Bryan Cutler commented on SPARK-10086: -- The changes to the test I proposed earlier a

[jira] [Created] (SPARK-14779) Incorrect log message in Worker while handling KillExecutor message

2016-04-20 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-14779: Summary: Incorrect log message in Worker while handling KillExecutor message Key: SPARK-14779 URL: https://issues.apache.org/jira/browse/SPARK-14779 Project: Spark

[jira] [Commented] (SPARK-28482) Data incomplete when using pandas udf in Python 3

2019-08-21 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16912763#comment-16912763 ] Bryan Cutler commented on SPARK-28482: -- [~jiangyu1211] I was not able to reproduce.

[jira] [Commented] (SPARK-28482) Data incomplete when using pandas udf in Python 3

2019-08-22 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16913611#comment-16913611 ] Bryan Cutler commented on SPARK-28482: -- I'm not really sure what you are doing abov

[jira] [Resolved] (SPARK-28482) Data incomplete when using pandas udf in Python 3

2019-08-23 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-28482. -- Resolution: Not A Problem No problem [~jiangyu1211] ! I will resolve this then. In general, I

[jira] [Assigned] (SPARK-28858) add tree-based transformation in the py side

2019-08-23 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler reassigned SPARK-28858: Assignee: zhengruifeng > add tree-based transformation in the py side > -

[jira] [Resolved] (SPARK-28858) add tree-based transformation in the py side

2019-08-23 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-28858. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 25566 [https://gi

[jira] [Created] (SPARK-29040) Support pyspark.createDataFrame from a pyarrow.Table

2019-09-10 Thread Bryan Cutler (Jira)
Bryan Cutler created SPARK-29040: Summary: Support pyspark.createDataFrame from a pyarrow.Table Key: SPARK-29040 URL: https://issues.apache.org/jira/browse/SPARK-29040 Project: Spark Issue Ty

<    1   2   3   4   5   6   7   8   >