[jira] [Created] (SPARK-25124) VectorSizeHint.size is buggy, breaking streaming pipeline

2018-08-15 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-25124: -- Summary: VectorSizeHint.size is buggy, breaking streaming pipeline Key: SPARK-25124 URL: https://issues.apache.org/jira/browse/SPARK-25124 Project: Spark

[jira] [Commented] (SPARK-23996) Implement the optimal KLL algorithms for quantiles in streams

2018-04-23 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16447781#comment-16447781 ] Timothy Hunter commented on SPARK-23996: [~wm624] yes this is the implementation:

[jira] [Created] (SPARK-23996) Implement the optimal KLL algorithms for quantiles in streams

2018-04-16 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-23996: -- Summary: Implement the optimal KLL algorithms for quantiles in streams Key: SPARK-23996 URL: https://issues.apache.org/jira/browse/SPARK-23996 Project: Spark

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-11-30 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16273575#comment-16273575 ] Timothy Hunter commented on SPARK-21866: [~josephkb] I have created a separate ticket to continue

[jira] [Created] (SPARK-22666) Spark reader source for image format

2017-11-30 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-22666: -- Summary: Spark reader source for image format Key: SPARK-22666 URL: https://issues.apache.org/jira/browse/SPARK-22666 Project: Spark Issue Type:

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-11-11 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16248628#comment-16248628 ] Timothy Hunter commented on SPARK-21866: [~josephkb] if I am not mistaken, the image code is

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-11-03 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237731#comment-16237731 ] Timothy Hunter commented on SPARK-21866: Adding {{spark.read.image}} is going to create a (soft)

[jira] [Commented] (SPARK-8515) Improve ML attribute API

2017-10-16 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16206111#comment-16206111 ] Timothy Hunter commented on SPARK-8515: --- Before we commit to an implementation, we should think

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-09-21 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16175158#comment-16175158 ] Timothy Hunter commented on SPARK-21866: Putting this code under {{org.apache.spark.ml.image}}

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-09-05 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16154132#comment-16154132 ] Timothy Hunter commented on SPARK-21866: [~yanboliang] thanks you for the comments. Regarding

[jira] [Updated] (SPARK-21866) SPIP: Image support in Spark

2017-08-31 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-21866: --- Attachment: (was: SPIP - Image support for Apache Spark.pdf) > SPIP: Image support in

[jira] [Updated] (SPARK-21866) SPIP: Image support in Spark

2017-08-31 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-21866: --- Attachment: SPIP - Image support for Apache Spark V1.1.pdf Updated authors' list. > SPIP:

[jira] [Commented] (SPARK-21184) QuantileSummaries implementation is wrong and QuantileSummariesSuite fails with larger n

2017-08-31 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149510#comment-16149510 ] Timothy Hunter commented on SPARK-21184: [~a1ray] thank you for the report, someone should

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-08-31 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149186#comment-16149186 ] Timothy Hunter commented on SPARK-21866: [~srowen] thank you for the comments. Indeed, this

[jira] [Updated] (SPARK-21866) SPIP: Image support in Spark

2017-08-29 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-21866: --- Attachment: SPIP - Image support for Apache Spark.pdf > SPIP: Image support in Spark >

[jira] [Created] (SPARK-21866) SPIP: Image support in Spark

2017-08-29 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-21866: -- Summary: SPIP: Image support in Spark Key: SPARK-21866 URL: https://issues.apache.org/jira/browse/SPARK-21866 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-19634) Feature parity for descriptive statistics in MLlib

2017-03-27 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15944215#comment-15944215 ] Timothy Hunter commented on SPARK-19634: [~sethah], yes, thanks for bringing up these concerns.

[jira] [Commented] (SPARK-19634) Feature parity for descriptive statistics in MLlib

2017-03-27 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15944019#comment-15944019 ] Timothy Hunter commented on SPARK-19634: [~dongjin] [~wm624] sorry it looks like I missed your

[jira] [Commented] (SPARK-20111) codegen bug surfaced by GraphFrames issue 165

2017-03-27 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15944004#comment-15944004 ] Timothy Hunter commented on SPARK-20111: As Spark SQL is making more and more forays into code

[jira] [Created] (SPARK-20077) Documentation for ml.stats.Correlation

2017-03-23 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-20077: -- Summary: Documentation for ml.stats.Correlation Key: SPARK-20077 URL: https://issues.apache.org/jira/browse/SPARK-20077 Project: Spark Issue Type:

[jira] [Created] (SPARK-20076) Python interface for ml.stats.Correlation

2017-03-23 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-20076: -- Summary: Python interface for ml.stats.Correlation Key: SPARK-20076 URL: https://issues.apache.org/jira/browse/SPARK-20076 Project: Spark Issue Type:

[jira] [Commented] (SPARK-19634) Feature parity for descriptive statistics in MLlib

2017-03-13 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15923119#comment-15923119 ] Timothy Hunter commented on SPARK-19634: I was not able to finish it in time, but the bulk of the

[jira] [Commented] (SPARK-19634) Feature parity for descriptive statistics in MLlib

2017-02-27 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886900#comment-15886900 ] Timothy Hunter commented on SPARK-19634: [~wm624] were you able to start to work on this task? I

[jira] [Commented] (SPARK-19635) Feature parity for Chi-square hypothesis testing in MLlib

2017-02-23 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15881471#comment-15881471 ] Timothy Hunter commented on SPARK-19635: After working on it, I realized that Column operations

[jira] [Commented] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-23 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15881457#comment-15881457 ] Timothy Hunter commented on SPARK-19636: After working on it, I realized that Column operations

[jira] [Commented] (SPARK-19573) Make NaN/null handling consistent in approxQuantile

2017-02-22 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15879387#comment-15879387 ] Timothy Hunter commented on SPARK-19573: I do not have too strong an opinion, as long as: 1. we

[jira] [Commented] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-21 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15877128#comment-15877128 ] Timothy Hunter commented on SPARK-19636: Looking more closely at the code, it makes sense to

[jira] [Commented] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-21 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15876929#comment-15876929 ] Timothy Hunter commented on SPARK-19636: Unless someone has started to work on this task, I will

[jira] [Created] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-16 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-19636: -- Summary: Feature parity for correlation statistics in MLlib Key: SPARK-19636 URL: https://issues.apache.org/jira/browse/SPARK-19636 Project: Spark Issue

[jira] [Created] (SPARK-19635) Feature parity for Chi-square hypothesis testing in MLlib

2017-02-16 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-19635: -- Summary: Feature parity for Chi-square hypothesis testing in MLlib Key: SPARK-19635 URL: https://issues.apache.org/jira/browse/SPARK-19635 Project: Spark

[jira] [Created] (SPARK-19634) Feature parity for descriptive statistics in MLlib

2017-02-16 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-19634: -- Summary: Feature parity for descriptive statistics in MLlib Key: SPARK-19634 URL: https://issues.apache.org/jira/browse/SPARK-19634 Project: Spark Issue

[jira] [Commented] (SPARK-19208) MultivariateOnlineSummarizer performance optimization

2017-02-16 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15870655#comment-15870655 ] Timothy Hunter commented on SPARK-19208: I put together the ideas in this thread into a document.

[jira] [Commented] (SPARK-19208) MultivariateOnlineSummarizer performance optimization

2017-02-14 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15866768#comment-15866768 ] Timothy Hunter commented on SPARK-19208: Yes, I meant returning a struct and then projecting this

[jira] [Comment Edited] (SPARK-19208) MultivariateOnlineSummarizer performance optimization

2017-02-14 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15866714#comment-15866714 ] Timothy Hunter edited comment on SPARK-19208 at 2/14/17 9:24 PM: - Thanks

[jira] [Commented] (SPARK-19208) MultivariateOnlineSummarizer performance optimization

2017-02-14 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15866714#comment-15866714 ] Timothy Hunter commented on SPARK-19208: Thanks for the clarification [~mlnick]. I was a bit

[jira] [Comment Edited] (SPARK-19208) MultivariateOnlineSummarizer performance optimization

2017-02-14 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15866535#comment-15866535 ] Timothy Hunter edited comment on SPARK-19208 at 2/14/17 8:04 PM: - I am

[jira] [Commented] (SPARK-19208) MultivariateOnlineSummarizer performance optimization

2017-02-14 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15866535#comment-15866535 ] Timothy Hunter commented on SPARK-19208: I am not sure if we should follow the Estimator API for

[jira] [Commented] (SPARK-14523) Feature parity for Statistics ML with MLlib

2017-02-14 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15866295#comment-15866295 ] Timothy Hunter commented on SPARK-14523: Also, the correlation is missing the multivariate case.

[jira] [Commented] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2017-02-14 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15866288#comment-15866288 ] Timothy Hunter commented on SPARK-4591: --- [~josephkb] do you also want some subtasks for

[jira] [Commented] (SPARK-8884) 1-sample Anderson-Darling Goodness-of-Fit test

2016-11-11 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15657825#comment-15657825 ] Timothy Hunter commented on SPARK-8884: --- I do not have a strong preference either way. We should

[jira] [Commented] (SPARK-8884) 1-sample Anderson-Darling Goodness-of-Fit test

2016-11-10 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15655563#comment-15655563 ] Timothy Hunter commented on SPARK-8884: --- [~srowen] this ticket should still be open I believe?

[jira] [Commented] (SPARK-17845) Improve window function frame boundary API in DataFrame

2016-10-11 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15566380#comment-15566380 ] Timothy Hunter commented on SPARK-17845: I like the {{Window.rowsBetween(Long.MinValue, -3)}}

[jira] [Commented] (SPARK-17219) QuantileDiscretizer does strange things with NaN values

2016-10-06 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15553490#comment-15553490 ] Timothy Hunter commented on SPARK-17219: If I understand correctly the PR, I am concerned by this

[jira] [Commented] (SPARK-17074) generate histogram information for column

2016-09-30 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15537295#comment-15537295 ] Timothy Hunter commented on SPARK-17074: We have discussed this through email and either is fine.

[jira] [Updated] (SPARK-16485) Additional fixes to Mllib 2.0 documentation

2016-07-11 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-16485: --- Description: While reviewing the documentation of MLlib, I found some additional issues.

[jira] [Commented] (SPARK-14816) Update MLlib, GraphX, SparkR websites for 2.0

2016-07-11 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15371504#comment-15371504 ] Timothy Hunter commented on SPARK-14816: Also, in `mllib-guide.md`, let's switch the order

[jira] [Created] (SPARK-16485) Additional fixes to Mllib 2.0 documentation

2016-07-11 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-16485: -- Summary: Additional fixes to Mllib 2.0 documentation Key: SPARK-16485 URL: https://issues.apache.org/jira/browse/SPARK-16485 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12922) Implement gapply() on DataFrame in SparkR

2016-06-28 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15353374#comment-15353374 ] Timothy Hunter commented on SPARK-12922: I opened a separate JIRA for that issue: SPARK-16258 >

[jira] [Created] (SPARK-16258) Automatically append the grouping keys in SparkR's gapply

2016-06-28 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-16258: -- Summary: Automatically append the grouping keys in SparkR's gapply Key: SPARK-16258 URL: https://issues.apache.org/jira/browse/SPARK-16258 Project: Spark

[jira] [Commented] (SPARK-12922) Implement gapply() on DataFrame in SparkR

2016-06-27 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15351311#comment-15351311 ] Timothy Hunter commented on SPARK-12922: [~Narine] while working on a similar function for python

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-06-21 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15342674#comment-15342674 ] Timothy Hunter commented on SPARK-15581: With respect to deep learning, I think it depends on

[jira] [Comment Edited] (SPARK-14816) Update MLlib, GraphX, SparkR websites for 2.0

2016-04-29 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15264370#comment-15264370 ] Timothy Hunter edited comment on SPARK-14816 at 4/29/16 5:21 PM: - Also,

[jira] [Commented] (SPARK-14816) Update MLlib, GraphX, SparkR websites for 2.0

2016-04-29 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15264370#comment-15264370 ] Timothy Hunter commented on SPARK-14816: Also, add a comment about the {{doparallel}} API >

[jira] [Commented] (SPARK-14571) Log instrumentation in ALS

2016-04-19 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249058#comment-15249058 ] Timothy Hunter commented on SPARK-14571: Yes, please feel free to take this task. Thanks! >

[jira] [Commented] (SPARK-7264) SparkR API for parallel functions

2016-04-15 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243584#comment-15243584 ] Timothy Hunter commented on SPARK-7264: --- I will have a PR for this soon. > SparkR API for parallel

[jira] [Commented] (SPARK-14569) Log instrumentation in KMeans

2016-04-15 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243238#comment-15243238 ] Timothy Hunter commented on SPARK-14569: [~iamshrek] thanks for taking a look! > Log

[jira] [Commented] (SPARK-14571) Log instrumentation in ALS

2016-04-13 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15239794#comment-15239794 ] Timothy Hunter commented on SPARK-14571: SPARK-14568 has been merged, so it should easy to follow

[jira] [Commented] (SPARK-14570) Log instrumentation in Random forests

2016-04-13 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15239775#comment-15239775 ] Timothy Hunter commented on SPARK-14570: SPARK-14568 has been merged, so it should easy to follow

[jira] [Commented] (SPARK-14569) Log instrumentation in KMeans

2016-04-13 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15239751#comment-15239751 ] Timothy Hunter commented on SPARK-14569: SPARK-14568 has been merged, so it should easy to follow

[jira] [Updated] (SPARK-14567) Add instrumentation logs to MLlib training algorithms

2016-04-12 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-14567: --- Description: In order to debug performance issues when training mllib algorithms, it is

[jira] [Updated] (SPARK-14567) Add instrumentation logs to MLlib training algorithms

2016-04-12 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-14567: --- Description: In order to debug performance issues when training mllib algorithms, it is

[jira] [Created] (SPARK-14571) Log instrumentation in ALS

2016-04-12 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-14571: -- Summary: Log instrumentation in ALS Key: SPARK-14571 URL: https://issues.apache.org/jira/browse/SPARK-14571 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-14570) Log instrumentation in Random forests

2016-04-12 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-14570: -- Summary: Log instrumentation in Random forests Key: SPARK-14570 URL: https://issues.apache.org/jira/browse/SPARK-14570 Project: Spark Issue Type:

[jira] [Created] (SPARK-14569) Log instrumentation in KMeans

2016-04-12 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-14569: -- Summary: Log instrumentation in KMeans Key: SPARK-14569 URL: https://issues.apache.org/jira/browse/SPARK-14569 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-14568) Log instrumentation in logistic regression as a first task

2016-04-12 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-14568: -- Summary: Log instrumentation in logistic regression as a first task Key: SPARK-14568 URL: https://issues.apache.org/jira/browse/SPARK-14568 Project: Spark

[jira] [Created] (SPARK-14567) Add instrumentation logs to MLlib training algorithms

2016-04-12 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-14567: -- Summary: Add instrumentation logs to MLlib training algorithms Key: SPARK-14567 URL: https://issues.apache.org/jira/browse/SPARK-14567 Project: Spark

[jira] [Created] (SPARK-14100) Merge StringIndexer and StringIndexerModel

2016-03-23 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-14100: -- Summary: Merge StringIndexer and StringIndexerModel Key: SPARK-14100 URL: https://issues.apache.org/jira/browse/SPARK-14100 Project: Spark Issue Type:

[jira] [Commented] (SPARK-13986) Make `DeveloperApi`-annotated things public

2016-03-19 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200525#comment-15200525 ] Timothy Hunter commented on SPARK-13986: [~dongjoon] how did you find the conflicting annotation?

[jira] [Commented] (SPARK-10931) PySpark ML Models should contain Param values

2016-03-10 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15190093#comment-15190093 ] Timothy Hunter commented on SPARK-10931: Using python decorators, it is fairly easy to

[jira] [Commented] (SPARK-12566) GLM model family, link function support in SparkR:::glm

2016-03-09 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15188057#comment-15188057 ] Timothy Hunter commented on SPARK-12566: [~yuhaoyan] I took a look at the current code, and it

[jira] [Commented] (SPARK-11569) StringIndexer transform fails when column contains nulls

2016-03-08 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15185444#comment-15185444 ] Timothy Hunter commented on SPARK-11569: Also, I suggest to look at Pandas' indexers, which have

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-23 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15070500#comment-15070500 ] Timothy Hunter commented on SPARK-12247: Sorry for the delay. That sounds great! Let me know when

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-22 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15068908#comment-15068908 ] Timothy Hunter commented on SPARK-12247: It seems to me that the calculation of false positives

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-21 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15066857#comment-15066857 ] Timothy Hunter commented on SPARK-12247: Thanks for working on it, [~BenFradet]! > Documentation

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-21 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15066854#comment-15066854 ] Timothy Hunter commented on SPARK-12247: If we could import all the code that builds the ratings

[jira] [Created] (SPARK-12324) The documentation sidebar does not collapse properly

2015-12-14 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-12324: -- Summary: The documentation sidebar does not collapse properly Key: SPARK-12324 URL: https://issues.apache.org/jira/browse/SPARK-12324 Project: Spark

[jira] [Updated] (SPARK-12324) The documentation sidebar does not collapse properly

2015-12-14 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-12324: --- Attachment: Screen Shot 2015-12-14 at 12.29.57 PM.png > The documentation sidebar does not

[jira] [Commented] (SPARK-12324) The documentation sidebar does not collapse properly

2015-12-14 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056677#comment-15056677 ] Timothy Hunter commented on SPARK-12324: I am creating a PR with a fix. cc [~josephkb] > The

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-09 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15049521#comment-15049521 ] Timothy Hunter commented on SPARK-12247: [~srowen] would you be interested in this task? >

[jira] [Created] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-09 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-12247: -- Summary: Documentation for spark.ml's ALS and collaborative filtering in general Key: SPARK-12247 URL: https://issues.apache.org/jira/browse/SPARK-12247 Project:

[jira] [Created] (SPARK-12246) Add documentation for spark.ml.clustering.kmeans

2015-12-09 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-12246: -- Summary: Add documentation for spark.ml.clustering.kmeans Key: SPARK-12246 URL: https://issues.apache.org/jira/browse/SPARK-12246 Project: Spark Issue

[jira] [Updated] (SPARK-12212) Clarify the distinction between spark.mllib and spark.ml

2015-12-09 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-12212: --- Target Version/s: (was: 1.6.0) > Clarify the distinction between spark.mllib and spark.ml

[jira] [Updated] (SPARK-12210) Small example that shows how to integrate spark.mllib with spark.ml

2015-12-09 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-12210: --- Target Version/s: (was: 1.6.0) > Small example that shows how to integrate spark.mllib

[jira] [Closed] (SPARK-12246) Add documentation for spark.ml.clustering.kmeans

2015-12-09 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter closed SPARK-12246. -- Resolution: Duplicate > Add documentation for spark.ml.clustering.kmeans >

[jira] [Commented] (SPARK-12246) Add documentation for spark.ml.clustering.kmeans

2015-12-09 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15049779#comment-15049779 ] Timothy Hunter commented on SPARK-12246: It does, thanks [~yuhaoyan] > Add documentation for

[jira] [Updated] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-09 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-12247: --- Target Version/s: (was: 1.6.0) > Documentation for spark.ml's ALS and collaborative

[jira] [Updated] (SPARK-12246) Add documentation for spark.ml.clustering.kmeans

2015-12-09 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-12246: --- Target Version/s: (was: 1.6.0) > Add documentation for spark.ml.clustering.kmeans >

[jira] [Updated] (SPARK-8517) Improve the organization and style of MLlib's user guide

2015-12-09 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-8517: -- Target Version/s: (was: 1.6.0) > Improve the organization and style of MLlib's user guide >

[jira] [Created] (SPARK-12208) Abstract the examples into a common place

2015-12-08 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-12208: -- Summary: Abstract the examples into a common place Key: SPARK-12208 URL: https://issues.apache.org/jira/browse/SPARK-12208 Project: Spark Issue Type:

[jira] [Created] (SPARK-12210) Small example that shows how to integrate spark.mllib with spark.ml

2015-12-08 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-12210: -- Summary: Small example that shows how to integrate spark.mllib with spark.ml Key: SPARK-12210 URL: https://issues.apache.org/jira/browse/SPARK-12210 Project:

[jira] [Created] (SPARK-12212) Clarify the distinction between spark.mllib and spark.ml

2015-12-08 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-12212: -- Summary: Clarify the distinction between spark.mllib and spark.ml Key: SPARK-12212 URL: https://issues.apache.org/jira/browse/SPARK-12212 Project: Spark

[jira] [Closed] (SPARK-11601) ML 1.6 QA: API: Binary incompatible changes

2015-12-01 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter closed SPARK-11601. -- Resolution: Done > ML 1.6 QA: API: Binary incompatible changes >

[jira] [Commented] (SPARK-12000) `sbt publishLocal` hits a Scala compiler bug caused by `Since` annotation

2015-11-30 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15032058#comment-15032058 ] Timothy Hunter commented on SPARK-12000: Yes, I have this branch with some fixes, but I would

[jira] [Commented] (SPARK-8517) Improve the organization and style of MLlib's user guide

2015-11-25 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027375#comment-15027375 ] Timothy Hunter commented on SPARK-8517: --- Here is a few comments I have at a high level: - branding

[jira] [Commented] (SPARK-8517) Improve the organization and style of MLlib's user guide

2015-11-25 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027491#comment-15027491 ] Timothy Hunter commented on SPARK-8517: --- - We need to make a whole page about how best practices

[jira] [Commented] (SPARK-8517) Improve the organization and style of MLlib's user guide

2015-11-25 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027512#comment-15027512 ] Timothy Hunter commented on SPARK-8517: --- - A couple of pages such as {{ml-ensembles}} and

[jira] [Created] (SPARK-2762) SparkILoop leaks memory in multi-repl configurations

2014-07-30 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-2762: - Summary: SparkILoop leaks memory in multi-repl configurations Key: SPARK-2762 URL: https://issues.apache.org/jira/browse/SPARK-2762 Project: Spark Issue

[jira] [Commented] (SPARK-2452) Multi-statement input to spark repl does not work

2014-07-22 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070841#comment-14070841 ] Timothy Hunter commented on SPARK-2452: --- Excellent, thanks Patrick.

[jira] [Created] (SPARK-2452) Multi-statement input to spark repl does not work

2014-07-11 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-2452: - Summary: Multi-statement input to spark repl does not work Key: SPARK-2452 URL: https://issues.apache.org/jira/browse/SPARK-2452 Project: Spark Issue