[jira] [Commented] (SPARK-21152) Use level 3 BLAS operations in LogisticAggregator

2017-06-20 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16056025#comment-16056025 ] Seth Hendrickson commented on SPARK-21152: -- cc [~dbtsai] [~mlnick] [~srowen] BT

[jira] [Created] (SPARK-21152) Use level 3 BLAS operations in LogisticAggregator

2017-06-20 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-21152: Summary: Use level 3 BLAS operations in LogisticAggregator Key: SPARK-21152 URL: https://issues.apache.org/jira/browse/SPARK-21152 Project: Spark Iss

[jira] [Commented] (SPARK-21152) Use level 3 BLAS operations in LogisticAggregator

2017-06-23 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061158#comment-16061158 ] Seth Hendrickson commented on SPARK-21152: -- [~yanboliang] I can do performance t

[jira] [Created] (SPARK-21245) Resolve code duplication for classification/regression summarizers

2017-06-28 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-21245: Summary: Resolve code duplication for classification/regression summarizers Key: SPARK-21245 URL: https://issues.apache.org/jira/browse/SPARK-21245 Project: S

[jira] [Created] (SPARK-21405) Add LBFGS solver for GeneralizedLinearRegression

2017-07-13 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-21405: Summary: Add LBFGS solver for GeneralizedLinearRegression Key: SPARK-21405 URL: https://issues.apache.org/jira/browse/SPARK-21405 Project: Spark Issu

[jira] [Commented] (SPARK-21405) Add LBFGS solver for GeneralizedLinearRegression

2017-07-13 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16086071#comment-16086071 ] Seth Hendrickson commented on SPARK-21405: -- cc [~yanboliang] [~actuaryzhang] I'

[jira] [Created] (SPARK-21406) Add logLikelihood to GLR families

2017-07-13 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-21406: Summary: Add logLikelihood to GLR families Key: SPARK-21406 URL: https://issues.apache.org/jira/browse/SPARK-21406 Project: Spark Issue Type: Sub-tas

[jira] [Commented] (SPARK-21405) Add LBFGS solver for GeneralizedLinearRegression

2017-07-14 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16087418#comment-16087418 ] Seth Hendrickson commented on SPARK-21405: -- Good point, Nick. Though convenientl

[jira] [Updated] (SPARK-21245) Resolve code duplication for classification/regression summarizers

2017-07-26 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seth Hendrickson updated SPARK-21245: - Labels: starter (was: ) Priority: Minor (was: Major) > Resolve code duplication f

[jira] [Commented] (SPARK-4240) Refine Tree Predictions in Gradient Boosting to Improve Prediction Accuracy.

2016-07-01 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15358983#comment-15358983 ] Seth Hendrickson commented on SPARK-4240: - I had done some work on this in the pas

[jira] [Commented] (SPARK-16235) "evaluateEachIteration" is returning wrong results when calculated for classification model.

2016-07-01 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15359014#comment-15359014 ] Seth Hendrickson commented on SPARK-16235: -- To be clear, we are talking about ML

[jira] [Created] (SPARK-16404) LeastSquaresAggregator in Linear Regression serializes unnecessary data

2016-07-06 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-16404: Summary: LeastSquaresAggregator in Linear Regression serializes unnecessary data Key: SPARK-16404 URL: https://issues.apache.org/jira/browse/SPARK-16404 Proje

[jira] [Commented] (SPARK-16404) LeastSquaresAggregator in Linear Regression serializes unnecessary data

2016-07-06 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15365227#comment-15365227 ] Seth Hendrickson commented on SPARK-16404: -- cc [~dbtsai] I looked in to using th

[jira] [Created] (SPARK-18036) Decision Trees do not handle edge cases

2016-10-20 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18036: Summary: Decision Trees do not handle edge cases Key: SPARK-18036 URL: https://issues.apache.org/jira/browse/SPARK-18036 Project: Spark Issue Type: B

[jira] [Created] (SPARK-18060) Avoid unnecessary standardization in multinomial logistic regression training

2016-10-21 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18060: Summary: Avoid unnecessary standardization in multinomial logistic regression training Key: SPARK-18060 URL: https://issues.apache.org/jira/browse/SPARK-18060

[jira] [Commented] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2016-10-31 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15623661#comment-15623661 ] Seth Hendrickson commented on SPARK-15784: -- This seems like it fits the framewor

[jira] [Created] (SPARK-18253) ML Instrumentation logging requires too much manual implementation

2016-11-03 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18253: Summary: ML Instrumentation logging requires too much manual implementation Key: SPARK-18253 URL: https://issues.apache.org/jira/browse/SPARK-18253 Project: S

[jira] [Commented] (SPARK-17138) Python API for multinomial logistic regression

2016-11-03 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15633627#comment-15633627 ] Seth Hendrickson commented on SPARK-17138: -- [~yanboliang] Can you mark this as r

[jira] [Comment Edited] (SPARK-15581) MLlib 2.1 Roadmap

2016-11-03 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15634354#comment-15634354 ] Seth Hendrickson edited comment on SPARK-15581 at 11/3/16 9:28 PM:

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-11-03 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15634354#comment-15634354 ] Seth Hendrickson commented on SPARK-15581: -- I think the points you mention are v

[jira] [Commented] (SPARK-18081) Locality Sensitive Hashing (LSH) User Guide

2016-11-04 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15636757#comment-15636757 ] Seth Hendrickson commented on SPARK-18081: -- [~yunn] Do you have a status update

[jira] [Commented] (SPARK-18081) Locality Sensitive Hashing (LSH) User Guide

2016-11-04 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15637424#comment-15637424 ] Seth Hendrickson commented on SPARK-18081: -- No worries, just wanted to check in

[jira] [Created] (SPARK-18276) Some ML training summaries are not copied when {{copy()}} is called.

2016-11-04 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18276: Summary: Some ML training summaries are not copied when {{copy()}} is called. Key: SPARK-18276 URL: https://issues.apache.org/jira/browse/SPARK-18276 Project:

[jira] [Created] (SPARK-18282) Add model summaries for Python GMM and BisectingKMeans

2016-11-04 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18282: Summary: Add model summaries for Python GMM and BisectingKMeans Key: SPARK-18282 URL: https://issues.apache.org/jira/browse/SPARK-18282 Project: Spark

[jira] [Commented] (SPARK-18316) Spark MLlib, GraphX 2.1 QA umbrella

2016-11-07 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15645454#comment-15645454 ] Seth Hendrickson commented on SPARK-18316: -- Much appreciated [~josephkb]! > Spa

[jira] [Commented] (SPARK-18321) ML 2.1 QA: API: Java compatibility, docs

2016-11-08 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15648529#comment-15648529 ] Seth Hendrickson commented on SPARK-18321: -- I've taken a look at the new LSH add

[jira] [Updated] (SPARK-18366) Add handleInvalid to Pyspark for QuantileDiscretizer and Bucketizer

2016-11-08 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seth Hendrickson updated SPARK-18366: - Component/s: PySpark ML > Add handleInvalid to Pyspark for QuantileDiscr

[jira] [Created] (SPARK-18366) Add handleInvalid to Pyspark for QuantileDiscretizer and Bucketizer

2016-11-08 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18366: Summary: Add handleInvalid to Pyspark for QuantileDiscretizer and Bucketizer Key: SPARK-18366 URL: https://issues.apache.org/jira/browse/SPARK-18366 Project:

[jira] [Created] (SPARK-18369) Deprecate runs in Pyspark mllib KMeans

2016-11-08 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18369: Summary: Deprecate runs in Pyspark mllib KMeans Key: SPARK-18369 URL: https://issues.apache.org/jira/browse/SPARK-18369 Project: Spark Issue Type: Im

[jira] [Commented] (SPARK-18320) ML 2.1 QA: API: Python API coverage

2016-11-08 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15649171#comment-15649171 ] Seth Hendrickson commented on SPARK-18320: -- I scanned through the {{@Since("2.1.

[jira] [Comment Edited] (SPARK-18320) ML 2.1 QA: API: Python API coverage

2016-11-08 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15649171#comment-15649171 ] Seth Hendrickson edited comment on SPARK-18320 at 11/8/16 11:23 PM: ---

[jira] [Resolved] (SPARK-18369) Deprecate runs in Pyspark mllib KMeans

2016-11-10 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seth Hendrickson resolved SPARK-18369. -- Resolution: Not A Problem > Deprecate runs in Pyspark mllib KMeans > --

[jira] [Commented] (SPARK-18369) Deprecate runs in Pyspark mllib KMeans

2016-11-10 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15654450#comment-15654450 ] Seth Hendrickson commented on SPARK-18369: -- There is a deprecation note for Pyth

[jira] [Commented] (SPARK-18321) ML 2.1 QA: API: Java compatibility, docs

2016-11-10 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15654524#comment-15654524 ] Seth Hendrickson commented on SPARK-18321: -- In the current Spark Java docs here:

[jira] [Commented] (SPARK-18392) LSH API, algorithm, and documentation follow-ups

2016-11-11 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15658063#comment-15658063 ] Seth Hendrickson commented on SPARK-18392: -- [~josephkb] I wasn't sure where to a

[jira] [Commented] (SPARK-18321) ML 2.1 QA: API: Java compatibility, docs

2016-11-11 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15658288#comment-15658288 ] Seth Hendrickson commented on SPARK-18321: -- So I generated the API docs between

[jira] [Commented] (SPARK-18392) LSH API, algorithm, and documentation follow-ups

2016-11-14 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15665233#comment-15665233 ] Seth Hendrickson commented on SPARK-18392: -- Thank you for clarifying, I see it n

[jira] [Created] (SPARK-18456) Use matrix abstraction for LogisitRegression coefficients during training

2016-11-15 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18456: Summary: Use matrix abstraction for LogisitRegression coefficients during training Key: SPARK-18456 URL: https://issues.apache.org/jira/browse/SPARK-18456 Pro

[jira] [Updated] (SPARK-9478) Add sample weights to Random Forest

2016-11-17 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seth Hendrickson updated SPARK-9478: Summary: Add sample weights to Random Forest (was: Add class weights to Random Forest) > A

[jira] [Commented] (SPARK-9478) Add sample weights to Random Forest

2016-11-17 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15675770#comment-15675770 ] Seth Hendrickson commented on SPARK-9478: - I'm going to work on submitting a PR fo

[jira] [Commented] (SPARK-17772) Add helper testing methods for instance weighting

2016-11-22 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15687500#comment-15687500 ] Seth Hendrickson commented on SPARK-17772: -- Please do, thanks! > Add helper tes

[jira] [Commented] (SPARK-8971) Support balanced class labels when splitting train/cross validation sets

2016-04-27 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15260399#comment-15260399 ] Seth Hendrickson commented on SPARK-8971: - I've got an improved version of the ori

[jira] [Commented] (SPARK-7129) Add generic boosting algorithm to spark.ml

2016-04-28 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15262991#comment-15262991 ] Seth Hendrickson commented on SPARK-7129: - Creating this initially as a Spark pack

[jira] [Commented] (SPARK-8971) Support balanced class labels when splitting train/cross validation sets

2016-04-29 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15264224#comment-15264224 ] Seth Hendrickson commented on SPARK-8971: - I meant label column. Sorry for the con

[jira] [Commented] (SPARK-15181) Python API for Generalized Linear Regression Summary

2016-05-06 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15274187#comment-15274187 ] Seth Hendrickson commented on SPARK-15181: -- I will submit a PR for this soon. >

[jira] [Created] (SPARK-15181) Python API for Generalized Linear Regression Summary

2016-05-06 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-15181: Summary: Python API for Generalized Linear Regression Summary Key: SPARK-15181 URL: https://issues.apache.org/jira/browse/SPARK-15181 Project: Spark

[jira] [Created] (SPARK-15186) Add user guide for Generalized Linear Regression.

2016-05-06 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-15186: Summary: Add user guide for Generalized Linear Regression. Key: SPARK-15186 URL: https://issues.apache.org/jira/browse/SPARK-15186 Project: Spark Iss

[jira] [Commented] (SPARK-15186) Add user guide for Generalized Linear Regression.

2016-05-06 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15274527#comment-15274527 ] Seth Hendrickson commented on SPARK-15186: -- I will work on this. > Add user gui

[jira] [Updated] (SPARK-15186) Add user guide for Generalized Linear Regression.

2016-05-06 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seth Hendrickson updated SPARK-15186: - Priority: Minor (was: Major) > Add user guide for Generalized Linear Regression. > -

[jira] [Commented] (SPARK-14815) ML, Graph, R 2.0 QA: Update user guide for new features & APIs

2016-05-06 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15274531#comment-15274531 ] Seth Hendrickson commented on SPARK-14815: -- SGTM > ML, Graph, R 2.0 QA: Update

[jira] [Commented] (SPARK-15186) Add user guide for Generalized Linear Regression.

2016-05-09 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15277293#comment-15277293 ] Seth Hendrickson commented on SPARK-15186: -- I have a PR for this once [SPARK-14

[jira] [Commented] (SPARK-15243) Binarizer.explainParam(u"...") raises ValueError

2016-05-10 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15279091#comment-15279091 ] Seth Hendrickson commented on SPARK-15243: -- I'll submit a PR shortly. > Binariz

[jira] [Commented] (SPARK-15243) Binarizer.explainParam(u"...") raises ValueError

2016-05-11 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15280227#comment-15280227 ] Seth Hendrickson commented on SPARK-15243: -- Thanks, I'll take a look at those!

[jira] [Commented] (SPARK-15181) Python API for Generalized Linear Regression Summary

2016-05-11 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15280274#comment-15280274 ] Seth Hendrickson commented on SPARK-15181: -- Thanks, I thought I had searched for

[jira] [Created] (SPARK-15394) ML user guide typos and grammar audit

2016-05-18 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-15394: Summary: ML user guide typos and grammar audit Key: SPARK-15394 URL: https://issues.apache.org/jira/browse/SPARK-15394 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7159) Support multiclass logistic regression in spark.ml

2016-05-20 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15294372#comment-15294372 ] Seth Hendrickson commented on SPARK-7159: - [~dbtsai][~josephkb] I'd like to take t

[jira] [Comment Edited] (SPARK-7159) Support multiclass logistic regression in spark.ml

2016-05-20 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15294372#comment-15294372 ] Seth Hendrickson edited comment on SPARK-7159 at 5/20/16 10:17 PM: -

[jira] [Commented] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-03 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16238137#comment-16238137 ] Seth Hendrickson commented on SPARK-22433: -- The main problem I see is that we pu

[jira] [Created] (SPARK-22461) Move Spark ML model summaries into a dedicated package

2017-11-06 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-22461: Summary: Move Spark ML model summaries into a dedicated package Key: SPARK-22461 URL: https://issues.apache.org/jira/browse/SPARK-22461 Project: Spark

[jira] [Commented] (SPARK-23704) PySpark access of individual trees in random forest is slow

2018-06-22 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16520832#comment-16520832 ] Seth Hendrickson commented on SPARK-23704: -- Instead of {code:java} model.trees[

[jira] [Commented] (SPARK-24579) SPIP: Standardize Optimized Data Exchange between Spark and DL/AI frameworks

2018-07-02 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16530633#comment-16530633 ] Seth Hendrickson commented on SPARK-24579: -- Hmm... Am I the only one who cannot

[jira] [Commented] (SPARK-17136) Design optimizer interface for ML algorithms

2017-01-11 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15819062#comment-15819062 ] Seth Hendrickson commented on SPARK-17136: -- I'm interested in working on this ta

[jira] [Created] (SPARK-19313) GaussianMixture throws cryptic error when number of features is too high

2017-01-20 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-19313: Summary: GaussianMixture throws cryptic error when number of features is too high Key: SPARK-19313 URL: https://issues.apache.org/jira/browse/SPARK-19313 Proj

[jira] [Commented] (SPARK-17139) Add model summary for MultinomialLogisticRegression

2017-02-07 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15857230#comment-15857230 ] Seth Hendrickson commented on SPARK-17139: -- [~josephkb] Is [this more or less wh

[jira] [Commented] (SPARK-17139) Add model summary for MultinomialLogisticRegression

2017-02-08 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15858438#comment-15858438 ] Seth Hendrickson commented on SPARK-17139: -- Seems like a reasonable way to solve

[jira] [Created] (SPARK-19591) Add sample weights to decision trees

2017-02-13 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-19591: Summary: Add sample weights to decision trees Key: SPARK-19591 URL: https://issues.apache.org/jira/browse/SPARK-19591 Project: Spark Issue Type: Sub-

[jira] [Commented] (SPARK-9478) Add sample weights to Random Forest

2017-02-13 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15865079#comment-15865079 ] Seth Hendrickson commented on SPARK-9478: - [~josephkb] Done. Thanks for your feedb

[jira] [Commented] (SPARK-18392) LSH API, algorithm, and documentation follow-ups

2017-02-14 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15867049#comment-15867049 ] Seth Hendrickson commented on SPARK-18392: -- I would pretty strongly prefer to fo

[jira] [Created] (SPARK-19745) SVCAggregator serializes coefficients

2017-02-26 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-19745: Summary: SVCAggregator serializes coefficients Key: SPARK-19745 URL: https://issues.apache.org/jira/browse/SPARK-19745 Project: Spark Issue Type: Imp

[jira] [Created] (SPARK-19746) LogisticAggregator is inefficient in indexing

2017-02-26 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-19746: Summary: LogisticAggregator is inefficient in indexing Key: SPARK-19746 URL: https://issues.apache.org/jira/browse/SPARK-19746 Project: Spark Issue T

[jira] [Created] (SPARK-19747) Consolidate code in ML aggregators

2017-02-26 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-19747: Summary: Consolidate code in ML aggregators Key: SPARK-19747 URL: https://issues.apache.org/jira/browse/SPARK-19747 Project: Spark Issue Type: Improv

[jira] [Commented] (SPARK-19747) Consolidate code in ML aggregators

2017-02-26 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15885185#comment-15885185 ] Seth Hendrickson commented on SPARK-19747: -- BTW, I have a rough prototype which

[jira] [Created] (SPARK-19762) Implement aggregator/loss function hierarchy and apply to linear regression

2017-02-27 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-19762: Summary: Implement aggregator/loss function hierarchy and apply to linear regression Key: SPARK-19762 URL: https://issues.apache.org/jira/browse/SPARK-19762 P

[jira] [Commented] (SPARK-17471) Add compressed method for Matrix class

2016-09-09 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477841#comment-15477841 ] Seth Hendrickson commented on SPARK-17471: -- [~yanboliang] I guess it can be seen

[jira] [Created] (SPARK-17476) Proper handling for unseen labels in logistic regression training.

2016-09-09 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-17476: Summary: Proper handling for unseen labels in logistic regression training. Key: SPARK-17476 URL: https://issues.apache.org/jira/browse/SPARK-17476 Project: S

[jira] [Commented] (SPARK-17471) Add compressed method for Matrix class

2016-09-12 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15484775#comment-15484775 ] Seth Hendrickson commented on SPARK-17471: -- [~yanboliang] Do you have any update

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-09-21 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15510198#comment-15510198 ] Seth Hendrickson commented on SPARK-17134: -- Hmm, it would be nice to see this vs

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-09-22 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515529#comment-15515529 ] Seth Hendrickson commented on SPARK-17134: -- This makes sense. In my initial test

[jira] [Comment Edited] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-09-22 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515529#comment-15515529 ] Seth Hendrickson edited comment on SPARK-17134 at 9/23/16 6:09 AM:

[jira] [Created] (SPARK-17748) One-pass algorithm for linear regression with L1 and elastic-net penalties

2016-09-30 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-17748: Summary: One-pass algorithm for linear regression with L1 and elastic-net penalties Key: SPARK-17748 URL: https://issues.apache.org/jira/browse/SPARK-17748 Pr

[jira] [Commented] (SPARK-17748) One-pass algorithm for linear regression with L1 and elastic-net penalties

2016-09-30 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15536781#comment-15536781 ] Seth Hendrickson commented on SPARK-17748: -- I am working on this currently. The

[jira] [Comment Edited] (SPARK-17748) One-pass algorithm for linear regression with L1 and elastic-net penalties

2016-09-30 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15536781#comment-15536781 ] Seth Hendrickson edited comment on SPARK-17748 at 9/30/16 7:16 PM:

[jira] [Comment Edited] (SPARK-17748) One-pass algorithm for linear regression with L1 and elastic-net penalties

2016-09-30 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15536781#comment-15536781 ] Seth Hendrickson edited comment on SPARK-17748 at 9/30/16 7:16 PM:

[jira] [Created] (SPARK-17772) Add helper testing methods for instance weighting

2016-10-03 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-17772: Summary: Add helper testing methods for instance weighting Key: SPARK-17772 URL: https://issues.apache.org/jira/browse/SPARK-17772 Project: Spark Iss

[jira] [Created] (SPARK-17789) Don't force users to set k for KMeans if initial model is set

2016-10-05 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-17789: Summary: Don't force users to set k for KMeans if initial model is set Key: SPARK-17789 URL: https://issues.apache.org/jira/browse/SPARK-17789 Project: Spark

[jira] [Commented] (SPARK-17792) L-BFGS solver for linear regression does not accept general numeric label column types

2016-10-05 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15550249#comment-15550249 ] Seth Hendrickson commented on SPARK-17792: -- I'll have a PR shortly. > L-BFGS so

[jira] [Created] (SPARK-17792) L-BFGS solver for linear regression does not accept general numeric label column types

2016-10-05 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-17792: Summary: L-BFGS solver for linear regression does not accept general numeric label column types Key: SPARK-17792 URL: https://issues.apache.org/jira/browse/SPARK-17792

[jira] [Updated] (SPARK-17789) Don't force users to set k for KMeans if initial model is set

2016-10-05 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seth Hendrickson updated SPARK-17789: - Description: In the initial implementation of initalModel, we allow users to set the init

[jira] [Commented] (SPARK-17789) Don't force users to set k for KMeans if initial model is set

2016-10-05 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15550601#comment-15550601 ] Seth Hendrickson commented on SPARK-17789: -- When the model is fit, the initial m

[jira] [Commented] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1385#comment-1385 ] Seth Hendrickson commented on SPARK-17824: -- [~yanboliang] Can you please post yo

[jira] [Comment Edited] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1385#comment-1385 ] Seth Hendrickson edited comment on SPARK-17824 at 10/7/16 3:42 PM:

[jira] [Commented] (SPARK-17824) QR solver for WeightedLeastSquares

2016-10-07 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15556237#comment-15556237 ] Seth Hendrickson commented on SPARK-17824: -- Thank you for clarifying > QR solve

[jira] [Resolved] (SPARK-17140) Add initial model to MultinomialLogisticRegression

2016-10-10 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seth Hendrickson resolved SPARK-17140. -- Resolution: Invalid MultinomialLogisticRegression was elminated in SPARK-[17163|https:

[jira] [Commented] (SPARK-17139) Add model summary for MultinomialLogisticRegression

2016-10-10 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15563729#comment-15563729 ] Seth Hendrickson commented on SPARK-17139: -- [~WeichenXu123] Status? > Add model

[jira] [Commented] (SPARK-9478) Add class weights to Random Forest

2016-10-10 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15563919#comment-15563919 ] Seth Hendrickson commented on SPARK-9478: - I'm going to revive this, and hopefully

[jira] [Commented] (SPARK-17772) Add helper testing methods for instance weighting

2016-10-10 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15564174#comment-15564174 ] Seth Hendrickson commented on SPARK-17772: -- I'm working on this. > Add helper t

[jira] [Commented] (SPARK-17906) MulticlassClassificationEvaluator support target label

2016-10-13 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15572154#comment-15572154 ] Seth Hendrickson commented on SPARK-17906: -- We are adding model summaries that w

[jira] [Created] (SPARK-17941) Logistic regression test suites should use weights when comparing to glmnet

2016-10-14 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-17941: Summary: Logistic regression test suites should use weights when comparing to glmnet Key: SPARK-17941 URL: https://issues.apache.org/jira/browse/SPARK-17941 P

[jira] [Created] (SPARK-18019) Log instrumentation in GBTs

2016-10-19 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-18019: Summary: Log instrumentation in GBTs Key: SPARK-18019 URL: https://issues.apache.org/jira/browse/SPARK-18019 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-23437) [ML] Distributed Gaussian Process Regression for MLlib

2018-02-16 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16368109#comment-16368109 ] Seth Hendrickson commented on SPARK-23437: -- TBH, this seems like a pretty reason

  1   2   3   >