[jira] [Updated] (SPARK-7262) LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7262: --- Issue Type: New Feature (was: Bug) > LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML packag

[jira] [Created] (SPARK-7262) LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package

2015-04-29 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7262: -- Summary: LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package Key: SPARK-7262 URL: https://issues.apache.org/jira/browse/SPARK-7262 Project: Spark

[jira] [Updated] (SPARK-7222) Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7222: --- Issue Type: Improvement (was: Documentation) > Added mathematical derivation in comment and compressed the mo

[jira] [Updated] (SPARK-7222) Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7222: --- Summary: Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

[jira] [Created] (SPARK-7222) Added mathematical derivation in comment to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7222: -- Summary: Added mathematical derivation in comment to LinearRegression with ElasticNet Key: SPARK-7222 URL: https://issues.apache.org/jira/browse/SPARK-7222 Project: Spark

[jira] [Updated] (SPARK-7222) Added mathematical derivation in comment and compressed the model to LinearRegression with ElasticNet

2015-04-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7222: --- Description: Added detailed mathematical derivation of how scaling and LeastSquaresAggregator work. Also refac

[jira] [Closed] (SPARK-7191) SharedParamsCodeGen doesn't import org.apache.spark.util.Utils

2015-04-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-7191. -- Resolution: Not A Problem sorry, it's my rebase issue. not a bug. > SharedParamsCodeGen doesn't import org.apac

[jira] [Created] (SPARK-7191) SharedParamsCodeGen doesn't import org.apache.spark.util.Utils

2015-04-28 Thread DB Tsai (JIRA)
DB Tsai created SPARK-7191: -- Summary: SharedParamsCodeGen doesn't import org.apache.spark.util.Utils Key: SPARK-7191 URL: https://issues.apache.org/jira/browse/SPARK-7191 Project: Spark Issue Type:

[jira] [Updated] (SPARK-7191) SharedParamsCodeGen doesn't import org.apache.spark.util.Utils

2015-04-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7191: --- Description: When we run `build/sbt "mllib/runMain org.apache.spark.ml.param.shared.SharedParamsCodeGen"`, the

[jira] [Commented] (SPARK-2505) Weighted Regularizer

2015-04-15 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14497103#comment-14497103 ] DB Tsai commented on SPARK-2505: For example, in GLMNET package, it allows users to regula

[jira] [Comment Edited] (SPARK-6683) Handling feature scaling properly for GLMs

2015-04-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14395147#comment-14395147 ] DB Tsai edited comment on SPARK-6683 at 4/3/15 10:11 PM: - PS, do w

[jira] [Commented] (SPARK-6683) Handling feature scaling properly for GLMs

2015-04-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14395147#comment-14395147 ] DB Tsai commented on SPARK-6683: PS, do we still want to make it work properly with gradie

[jira] [Commented] (SPARK-6683) Handling feature scaling properly for GLMs

2015-04-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14395131#comment-14395131 ] DB Tsai commented on SPARK-6683: The squared error will have slightly more work than logis

[jira] [Comment Edited] (SPARK-6683) Handling feature scaling properly for GLMs

2015-04-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14395131#comment-14395131 ] DB Tsai edited comment on SPARK-6683 at 4/3/15 9:44 PM: The square

[jira] [Commented] (SPARK-6683) Handling feature scaling properly for GLMs

2015-04-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14395090#comment-14395090 ] DB Tsai commented on SPARK-6683: I think we should hide the scaling api, and the bottom li

[jira] [Commented] (SPARK-6683) Handling feature scaling properly for GLMs

2015-04-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14395088#comment-14395088 ] DB Tsai commented on SPARK-6683: I have this implemented in our lab including handling the

[jira] [Created] (SPARK-6141) Upgrade Breeze to 0.11 to fix convergence bug

2015-03-03 Thread DB Tsai (JIRA)
DB Tsai created SPARK-6141: -- Summary: Upgrade Breeze to 0.11 to fix convergence bug Key: SPARK-6141 URL: https://issues.apache.org/jira/browse/SPARK-6141 Project: Spark Issue Type: Bug Com

[jira] [Created] (SPARK-5253) LinearRegression with L1/L2 (elastic net) using OWLQN in new ML pacakge

2015-01-14 Thread DB Tsai (JIRA)
DB Tsai created SPARK-5253: -- Summary: LinearRegression with L1/L2 (elastic net) using OWLQN in new ML pacakge Key: SPARK-5253 URL: https://issues.apache.org/jira/browse/SPARK-5253 Project: Spark Is

[jira] [Commented] (SPARK-5207) StandardScalerModel mean and variance re-use

2015-01-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14274802#comment-14274802 ] DB Tsai commented on SPARK-5207: [~mengxr]'s idea sounds great for me. Specifically, let's

[jira] [Commented] (SPARK-5128) Add stable log1pExp impl

2015-01-07 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14267392#comment-14267392 ] DB Tsai commented on SPARK-5128: https://github.com/apache/spark/pull/3915/commits > Add

[jira] [Commented] (SPARK-5127) Fixed overflow when there are outliers in data in Logistic Regression

2015-01-07 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14267390#comment-14267390 ] DB Tsai commented on SPARK-5127: Not an issue in binary logistic regression. Problem only

[jira] [Closed] (SPARK-5127) Fixed overflow when there are outliers in data in Logistic Regression

2015-01-07 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-5127. -- Resolution: Not a Problem > Fixed overflow when there are outliers in data in Logistic Regression >

[jira] [Updated] (SPARK-5127) Fixed overflow when there are outliers in data in Logistic Regression

2015-01-07 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-5127: --- Description: gradientMultiplier = (1.0 / (1.0 + math.exp(margin))) - label However, the first part of gradien

[jira] [Updated] (SPARK-5127) Fixed overflow when there are outliers in data in Logistic Regression

2015-01-07 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-5127: --- Description: gradientMultiplier = (1.0 / (1.0 + math.exp(margin))) - label However, the first part of gradien

[jira] [Created] (SPARK-5127) Fixed overflow when there are outliers in data in Logistic Regression

2015-01-06 Thread DB Tsai (JIRA)
DB Tsai created SPARK-5127: -- Summary: Fixed overflow when there are outliers in data in Logistic Regression Key: SPARK-5127 URL: https://issues.apache.org/jira/browse/SPARK-5127 Project: Spark Issu

[jira] [Created] (SPARK-4972) Updated the scala doc for lasso and ridge regression for the change of LeastSquaresGradient

2014-12-26 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4972: -- Summary: Updated the scala doc for lasso and ridge regression for the change of LeastSquaresGradient Key: SPARK-4972 URL: https://issues.apache.org/jira/browse/SPARK-4972 Project

[jira] [Commented] (SPARK-4907) Inconsistent loss and gradient in LeastSquaresGradient compared with R

2014-12-25 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14258981#comment-14258981 ] DB Tsai commented on SPARK-4907: [~sowen] It seems that the existing document has 1/2 fact

[jira] [Created] (SPARK-4907) Inconsistent loss and gradient in LeastSquaresGradient compared with R

2014-12-19 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4907: -- Summary: Inconsistent loss and gradient in LeastSquaresGradient compared with R Key: SPARK-4907 URL: https://issues.apache.org/jira/browse/SPARK-4907 Project: Spark Iss

[jira] [Created] (SPARK-4887) Fix a bad unittest in LogisticRegressionSuite

2014-12-18 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4887: -- Summary: Fix a bad unittest in LogisticRegressionSuite Key: SPARK-4887 URL: https://issues.apache.org/jira/browse/SPARK-4887 Project: Spark Issue Type: Bug Com

[jira] [Created] (SPARK-4717) Optimize BLAS library to avoid de-reference multiple times in loop

2014-12-03 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4717: -- Summary: Optimize BLAS library to avoid de-reference multiple times in loop Key: SPARK-4717 URL: https://issues.apache.org/jira/browse/SPARK-4717 Project: Spark Issue T

[jira] [Updated] (SPARK-4708) Make k-mean runs two/three times faster with dense/sparse sample

2014-12-02 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-4708: --- Summary: Make k-mean runs two/three times faster with dense/sparse sample (was: k-mean runs two/three times f

[jira] [Created] (SPARK-4708) k-mean runs two/three times faster with dense/sparse sample

2014-12-02 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4708: -- Summary: k-mean runs two/three times faster with dense/sparse sample Key: SPARK-4708 URL: https://issues.apache.org/jira/browse/SPARK-4708 Project: Spark Issue Type: Im

[jira] [Updated] (SPARK-4708) Make k-mean runs two/three times faster with dense/sparse sample

2014-12-02 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-4708: --- Component/s: MLlib > Make k-mean runs two/three times faster with dense/sparse sample > --

[jira] [Created] (SPARK-4611) Implement the efficient vector norm

2014-11-25 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4611: -- Summary: Implement the efficient vector norm Key: SPARK-4611 URL: https://issues.apache.org/jira/browse/SPARK-4611 Project: Spark Issue Type: Improvement Compo

[jira] [Created] (SPARK-4596) Refactorize Normalizer to make code cleaner

2014-11-24 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4596: -- Summary: Refactorize Normalizer to make code cleaner Key: SPARK-4596 URL: https://issues.apache.org/jira/browse/SPARK-4596 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-4581) Refactorize StandardScaler to improve the transformation performance

2014-11-24 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4581: -- Summary: Refactorize StandardScaler to improve the transformation performance Key: SPARK-4581 URL: https://issues.apache.org/jira/browse/SPARK-4581 Project: Spark Issue

[jira] [Updated] (SPARK-4431) Implement efficient activeIterator for dense and sparse vector

2014-11-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-4431: --- Description: Previously, we were using Breeze's activeIterator to access the non-zero elements in dense/spar

[jira] [Created] (SPARK-4431) Implement efficient activeIterator for dense and sparse vector

2014-11-15 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4431: -- Summary: Implement efficient activeIterator for dense and sparse vector Key: SPARK-4431 URL: https://issues.apache.org/jira/browse/SPARK-4431 Project: Spark Issue Type:

[jira] [Created] (SPARK-4129) Performance tuning in MultivariateOnlineSummarizer

2014-10-28 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4129: -- Summary: Performance tuning in MultivariateOnlineSummarizer Key: SPARK-4129 URL: https://issues.apache.org/jira/browse/SPARK-4129 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-2493) SBT gen-idea doesn't generate correct Intellij project

2014-10-13 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14170080#comment-14170080 ] DB Tsai commented on SPARK-2493: sbt gen-idea will add extra meso dependency other than me

[jira] [Closed] (SPARK-2493) SBT gen-idea doesn't generate correct Intellij project

2014-10-13 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-2493. -- Resolution: Won't Fix > SBT gen-idea doesn't generate correct Intellij project > ---

[jira] [Commented] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-10-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14163458#comment-14163458 ] DB Tsai commented on SPARK-3630: I think there is something else going wrong in the curren

[jira] [Commented] (SPARK-1239) Don't fetch all map output statuses at each reducer during shuffles

2014-10-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14163423#comment-14163423 ] DB Tsai commented on SPARK-1239: +1, we run into this issue as well. > Don't fetch all ma

[jira] [Commented] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-10-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14163391#comment-14163391 ] DB Tsai commented on SPARK-3630: I think there are some issues in the shuffle manger with

[jira] [Created] (SPARK-3832) Upgrade Breeze dependency to 0.10

2014-10-07 Thread DB Tsai (JIRA)
DB Tsai created SPARK-3832: -- Summary: Upgrade Breeze dependency to 0.10 Key: SPARK-3832 URL: https://issues.apache.org/jira/browse/SPARK-3832 Project: Spark Issue Type: Task Components: ML

[jira] [Comment Edited] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-10-07 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161900#comment-14161900 ] DB Tsai edited comment on SPARK-3630 at 10/7/14 2:07 PM: - We also

[jira] [Comment Edited] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-10-07 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161900#comment-14161900 ] DB Tsai edited comment on SPARK-3630 at 10/7/14 2:08 PM: - We also

[jira] [Comment Edited] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-10-07 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161900#comment-14161900 ] DB Tsai edited comment on SPARK-3630 at 10/7/14 2:07 PM: - We also

[jira] [Commented] (SPARK-3630) Identify cause of Kryo+Snappy PARSING_ERROR

2014-10-07 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161900#comment-14161900 ] DB Tsai commented on SPARK-3630: We also see similar issue when we perform map -> reduceBy

[jira] [Closed] (SPARK-3317) The loss of regularization in Updater should use the oldWeights

2014-08-31 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-3317. -- Resolution: Won't Fix > The loss of regularization in Updater should use the oldWeights > --

[jira] [Created] (SPARK-3317) The loss of regularization in Updater should use the oldWeights

2014-08-29 Thread DB Tsai (JIRA)
DB Tsai created SPARK-3317: -- Summary: The loss of regularization in Updater should use the oldWeights Key: SPARK-3317 URL: https://issues.apache.org/jira/browse/SPARK-3317 Project: Spark Issue Type

[jira] [Updated] (SPARK-2979) Improve the convergence rate by minimizing the condition number in LOR with LBFGS

2014-08-11 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-2979: --- Summary: Improve the convergence rate by minimizing the condition number in LOR with LBFGS (was: Improve the

[jira] [Created] (SPARK-2979) Improve the convergence rate by minimize the condition number in LOR with LBFGS

2014-08-11 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2979: -- Summary: Improve the convergence rate by minimize the condition number in LOR with LBFGS Key: SPARK-2979 URL: https://issues.apache.org/jira/browse/SPARK-2979 Project: Spark

[jira] [Created] (SPARK-2934) Adding LogisticRegressionWithLBFGS for training with LBFGS Optimizer

2014-08-08 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2934: -- Summary: Adding LogisticRegressionWithLBFGS for training with LBFGS Optimizer Key: SPARK-2934 URL: https://issues.apache.org/jira/browse/SPARK-2934 Project: Spark Iss

[jira] [Commented] (SPARK-2599) almostEquals mllib.util.TestingUtils does not behave as expected when comparing against 0.0

2014-07-22 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14070929#comment-14070929 ] DB Tsai commented on SPARK-2599: I'm the original guy implementing `almostEquals` for my u

[jira] [Created] (SPARK-2505) Weighted Regularizer

2014-07-15 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2505: -- Summary: Weighted Regularizer Key: SPARK-2505 URL: https://issues.apache.org/jira/browse/SPARK-2505 Project: Spark Issue Type: New Feature Components: MLlib

[jira] [Closed] (SPARK-1451) Multinomial Logistic Regression Support

2014-07-15 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-1451. -- Resolution: Duplicate Duplicate of SPARK-2309 > Multinomial Logistic Regression Support >

[jira] [Updated] (SPARK-2309) Generalize the binary logistic regression into multinomial logistic regression

2014-07-15 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-2309: --- Affects Version/s: 1.1.0 > Generalize the binary logistic regression into multinomial logistic regression > -

[jira] [Updated] (SPARK-2479) Comparing floating-point numbers using relative error in UnitTests

2014-07-15 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-2479: --- Description: Floating point math is not exact, and most floating-point numbers end up being slightly impreci

[jira] [Created] (SPARK-2493) SBT gen-idea doesn't generate correct Intellij project

2014-07-15 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2493: -- Summary: SBT gen-idea doesn't generate correct Intellij project Key: SPARK-2493 URL: https://issues.apache.org/jira/browse/SPARK-2493 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-2479) Comparing floating-point numbers using relative error in UnitTests

2014-07-14 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-2479: --- Description: Due to rounding errors, most floating-point numbers end up being slightly imprecise. As long as

[jira] [Created] (SPARK-2479) Comparing floating-point numbers using relative error in UnitTests

2014-07-14 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2479: -- Summary: Comparing floating-point numbers using relative error in UnitTests Key: SPARK-2479 URL: https://issues.apache.org/jira/browse/SPARK-2479 Project: Spark Issue T

[jira] [Created] (SPARK-2477) Using appendBias for adding intercept in GeneralizedLinearAlgorithm

2014-07-14 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2477: -- Summary: Using appendBias for adding intercept in GeneralizedLinearAlgorithm Key: SPARK-2477 URL: https://issues.apache.org/jira/browse/SPARK-2477 Project: Spark Issue

[jira] [Closed] (SPARK-2413) Upgrade junit_xml_listener to 0.5.1

2014-07-11 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-2413. -- > Upgrade junit_xml_listener to 0.5.1 > --- > > Key: SPARK-2413 >

[jira] [Closed] (SPARK-2281) Simplify the duplicate code in Gradient.scala

2014-07-09 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-2281. -- Resolution: Not a Problem > Simplify the duplicate code in Gradient.scala > ---

[jira] [Created] (SPARK-2413) Upgrade junit_xml_listener to 0.5.1

2014-07-08 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2413: -- Summary: Upgrade junit_xml_listener to 0.5.1 Key: SPARK-2413 URL: https://issues.apache.org/jira/browse/SPARK-2413 Project: Spark Issue Type: Improvement Rep

[jira] [Created] (SPARK-2309) Generalize the binary logistic regression into multinomial logistic regression

2014-06-27 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2309: -- Summary: Generalize the binary logistic regression into multinomial logistic regression Key: SPARK-2309 URL: https://issues.apache.org/jira/browse/SPARK-2309 Project: Spark

[jira] [Updated] (SPARK-2309) Generalize the binary logistic regression into multinomial logistic regression

2014-06-27 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-2309: --- Description: Currently, there is no multi-class classifier in mllib. Logistic regression can be extended to

[jira] [Created] (SPARK-2281) Simplify the duplicate code in Gradient.scala

2014-06-25 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2281: -- Summary: Simplify the duplicate code in Gradient.scala Key: SPARK-2281 URL: https://issues.apache.org/jira/browse/SPARK-2281 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-2272) Feature scaling which standardizes the range of independent variables or features of data.

2014-06-24 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2272: -- Summary: Feature scaling which standardizes the range of independent variables or features of data. Key: SPARK-2272 URL: https://issues.apache.org/jira/browse/SPARK-2272 Project:

[jira] [Commented] (SPARK-2100) Allow users to disable Jetty Spark UI in local mode

2014-06-16 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14033068#comment-14033068 ] DB Tsai commented on SPARK-2100: Thanks. I think there are some dependency issue we have,

[jira] [Comment Edited] (SPARK-2100) Allow users to disable Jetty Spark UI in local mode

2014-06-16 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14033049#comment-14033049 ] DB Tsai edited comment on SPARK-2100 at 6/16/14 9:50 PM: - [~sowen]

[jira] [Commented] (SPARK-2100) Allow users to disable Jetty Spark UI in local mode

2014-06-16 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14033049#comment-14033049 ] DB Tsai commented on SPARK-2100: [~sowen] You are right. The servlet api is pulled by jett

[jira] [Commented] (SPARK-2100) Allow users to disable Jetty Spark UI in local mode

2014-06-16 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14032786#comment-14032786 ] DB Tsai commented on SPARK-2100: Jar file conflict. If we only include spark-core in our t

[jira] [Updated] (SPARK-2100) Allow users to disable Jetty Spark UI in local mode

2014-06-10 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-2100: --- Description: Since we want to use Spark hadoop APIs in local mode for design time to explore the first coupl

[jira] [Created] (SPARK-2100) Allow users to disable Jetty Spark UI in local mode

2014-06-10 Thread DB Tsai (JIRA)
DB Tsai created SPARK-2100: -- Summary: Allow users to disable Jetty Spark UI in local mode Key: SPARK-2100 URL: https://issues.apache.org/jira/browse/SPARK-2100 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-1969) Public available online summarizer for mean, variance, min, and max

2014-06-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-1969: --- Description: Basically, it moves the private ColumnStatisticsAggregator class from RowMatrix to public avail

[jira] [Updated] (SPARK-1969) Public available online summarizer for mean, variance, min, and max

2014-06-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-1969: --- Description: It basically moved the private ColumnStatisticsAggregator class from RowMatrix to public availa

[jira] [Updated] (SPARK-1969) Public available online summarizer for mean, variance, min, and max

2014-06-03 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-1969: --- Description: Basically, it moves the private ColumnStatisticsAggregator class from RowMatrix to public avail

[jira] [Created] (SPARK-1969) Online Summarizer for mean, variance, min, max, and quartile

2014-05-29 Thread DB Tsai (JIRA)
DB Tsai created SPARK-1969: -- Summary: Online Summarizer for mean, variance, min, max, and quartile Key: SPARK-1969 URL: https://issues.apache.org/jira/browse/SPARK-1969 Project: Spark Issue Type: N

[jira] [Commented] (SPARK-1870) Jars specified via --jars in spark-submit are not added to executor classpath for YARN

2014-05-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14002597#comment-14002597 ] DB Tsai commented on SPARK-1870: This is not Yarn issue, and it's classloader issue. It ha

[jira] [Updated] (SPARK-1516) Yarn Client should not call System.exit, should throw exception instead.

2014-04-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-1516: --- Assignee: (was: DB Tsai) > Yarn Client should not call System.exit, should throw exception instead. > ---

[jira] [Assigned] (SPARK-1516) Yarn Client should not call System.exit, should throw exception instead.

2014-04-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-1516: -- Assignee: DB Tsai > Yarn Client should not call System.exit, should throw exception instead. >

[jira] [Updated] (SPARK-1457) Change APIs for training algorithms to take optimizer as parameter

2014-04-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-1457: --- Assignee: (was: DB Tsai) > Change APIs for training algorithms to take optimizer as parameter >

[jira] [Assigned] (SPARK-1457) Change APIs for training algorithms to take optimizer as parameter

2014-04-21 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-1457: -- Assignee: DB Tsai > Change APIs for training algorithms to take optimizer as parameter > -

[jira] [Updated] (SPARK-1516) Yarn Client should not call System.exit, should throw exception instead.

2014-04-16 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-1516: --- Assignee: (was: DB Tsai) > Yarn Client should not call System.exit, should throw exception instead. > ---

[jira] [Created] (SPARK-1516) Yarn Client should not call System.exit, should throw exception instead.

2014-04-16 Thread DB Tsai (JIRA)
DB Tsai created SPARK-1516: -- Summary: Yarn Client should not call System.exit, should throw exception instead. Key: SPARK-1516 URL: https://issues.apache.org/jira/browse/SPARK-1516 Project: Spark I

[jira] [Created] (SPARK-1457) Change APIs for training algorithms to take optimizer as parameter

2014-04-09 Thread DB Tsai (JIRA)
DB Tsai created SPARK-1457: -- Summary: Change APIs for training algorithms to take optimizer as parameter Key: SPARK-1457 URL: https://issues.apache.org/jira/browse/SPARK-1457 Project: Spark Issue

[jira] [Created] (SPARK-1451) Multinomial Logistic Regression Support

2014-04-08 Thread DB Tsai (JIRA)
DB Tsai created SPARK-1451: -- Summary: Multinomial Logistic Regression Support Key: SPARK-1451 URL: https://issues.apache.org/jira/browse/SPARK-1451 Project: Spark Issue Type: New Feature C

[jira] [Commented] (SPARK-1157) L-BFGS Optimizer

2014-04-08 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13963803#comment-13963803 ] DB Tsai commented on SPARK-1157: PR: https://github.com/apache/spark/pull/353 > L-BFGS Op

[jira] [Updated] (SPARK-1157) L-BFGS Optimizer

2014-04-02 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-1157: --- Description: L-BFGS (Limited-memory BFGS) is an optimization algorithm like BFGS which uses an approximation

[jira] [Closed] (SPARK-1401) Use mapParitions instead of map to avoid creating expensive object in GradientDescent optimizer

2014-04-02 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai closed SPARK-1401. -- Resolution: Duplicate Fix Version/s: 0.9.1 > Use mapParitions instead of map to avoid creating expensive

[jira] [Commented] (SPARK-1401) Use mapParitions instead of map to avoid creating expensive object in GradientDescent optimizer

2014-04-02 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13958438#comment-13958438 ] DB Tsai commented on SPARK-1401: In SPARK-1212, this issue is addressed by aggregate. Gonn

[jira] [Created] (SPARK-1401) Use mapParitions instead of map to avoid creating expensive object in GradientDescent optimizer

2014-04-02 Thread DB Tsai (JIRA)
DB Tsai created SPARK-1401: -- Summary: Use mapParitions instead of map to avoid creating expensive object in GradientDescent optimizer Key: SPARK-1401 URL: https://issues.apache.org/jira/browse/SPARK-1401 Pro

<    2   3   4   5   6   7