[jira] [Comment Edited] (SPARK-14174) Accelerate KMeans via Mini-Batch EM

2017-02-21 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15876202#comment-15876202 ] RJ Nowling edited comment on SPARK-14174 at 2/21/17 4:08 PM: - I did the

[jira] [Commented] (SPARK-14174) Accelerate KMeans via Mini-Batch EM

2017-02-21 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15876202#comment-15876202 ] RJ Nowling commented on SPARK-14174: I did the initial implementation for SPARK-2308. re: the random

[jira] [Comment Edited] (SPARK-16365) Ideas for moving "mllib-local" forward

2016-07-13 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15375928#comment-15375928 ] RJ Nowling edited comment on SPARK-16365 at 7/13/16 10:40 PM: -- I'm really

[jira] [Commented] (SPARK-16365) Ideas for moving "mllib-local" forward

2016-07-13 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15375928#comment-15375928 ] RJ Nowling commented on SPARK-16365: I'm really looking forward to this feature. Spark is great where

[jira] [Commented] (SPARK-14174) Accelerate KMeans via Mini-Batch EM

2016-04-01 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15221806#comment-15221806 ] RJ Nowling commented on SPARK-14174: This is a dupe of [SPARK-2308] but that needs someone to take it

[jira] [Created] (SPARK-12450) Un-persist broadcasted variables in KMeans

2015-12-21 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-12450: -- Summary: Un-persist broadcasted variables in KMeans Key: SPARK-12450 URL: https://issues.apache.org/jira/browse/SPARK-12450 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12450) Un-persist broadcasted variables in KMeans

2015-12-21 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15066629#comment-15066629 ] RJ Nowling commented on SPARK-12450: File a PR here: [https://github.com/apache/spark/pull/10415] >

[jira] [Comment Edited] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056115#comment-15056115 ] RJ Nowling edited comment on SPARK-4816 at 12/14/15 3:42 PM: - I tested it

[jira] [Commented] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056168#comment-15056168 ] RJ Nowling commented on SPARK-4816: --- I think issue [SPARK-9507] fixed the issue. I checked out git

[jira] [Comment Edited] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056168#comment-15056168 ] RJ Nowling edited comment on SPARK-4816 at 12/14/15 4:16 PM: - I think

[jira] [Comment Edited] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056168#comment-15056168 ] RJ Nowling edited comment on SPARK-4816 at 12/14/15 4:19 PM: - I think

[jira] [Reopened] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] RJ Nowling reopened SPARK-4816: --- > Maven profile netlib-lgpl does not work > --- > >

[jira] [Commented] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056115#comment-15056115 ] RJ Nowling commented on SPARK-4816: --- I tested it again to make sure and ran into the same issue: {code}

[jira] [Comment Edited] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056168#comment-15056168 ] RJ Nowling edited comment on SPARK-4816 at 12/14/15 3:57 PM: - I think issue

[jira] [Comment Edited] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056168#comment-15056168 ] RJ Nowling edited comment on SPARK-4816 at 12/14/15 3:58 PM: - I think

[jira] [Commented] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056630#comment-15056630 ] RJ Nowling commented on SPARK-4816: --- Tried with Maven 3.3.9. I see no issues with the newer version of

[jira] [Commented] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056414#comment-15056414 ] RJ Nowling commented on SPARK-4816: --- Happy to try Maven 3.3.x and report back. Would certainly confirm

[jira] [Commented] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056368#comment-15056368 ] RJ Nowling commented on SPARK-4816: --- I want to push for two things (a) some sort of documentation for

[jira] [Commented] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056375#comment-15056375 ] RJ Nowling commented on SPARK-4816: --- Also, what version of Maven are you running? > Maven profile

[jira] [Commented] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15056675#comment-15056675 ] RJ Nowling commented on SPARK-4816: --- Agreed. Thanks! > Maven profile netlib-lgpl does not work >

[jira] [Comment Edited] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-10 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15051056#comment-15051056 ] RJ Nowling edited comment on SPARK-4816 at 12/10/15 2:52 PM: - Hi [~srowen], I

[jira] [Commented] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-10 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15051056#comment-15051056 ] RJ Nowling commented on SPARK-4816: --- Hi [~srowen], I haven't tried master yet but that wouldn't address

[jira] [Reopened] (SPARK-4816) Maven profile netlib-lgpl does not work

2015-12-09 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] RJ Nowling reopened SPARK-4816: --- I ran into the same issue with Spark 1.4. If I download the tarball from {{spark.apache.org}} and build

[jira] [Commented] (SPARK-3644) REST API for Spark application info (jobs / stages / tasks / storage info)

2015-07-10 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14622342#comment-14622342 ] RJ Nowling commented on SPARK-3644: --- [~joshrosen] Thanks for pointing to the new JIRA!

[jira] [Commented] (SPARK-3644) REST API for Spark application info (jobs / stages / tasks / storage info)

2015-07-08 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619144#comment-14619144 ] RJ Nowling commented on SPARK-3644: --- [~joshrosen] The issue and corresponding PR you

[jira] [Commented] (SPARK-3644) REST API for Spark application info (jobs / stages / tasks / storage info)

2015-07-08 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619239#comment-14619239 ] RJ Nowling commented on SPARK-3644: --- [~joshrosen] Several users commented above about

[jira] [Commented] (SPARK-4729) Add time series subsampling to MLlib

2015-07-06 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615016#comment-14615016 ] RJ Nowling commented on SPARK-4729: --- Hi [~yalamart], I haven't looked at this in quite

[jira] [Created] (SPARK-6522) Standardize Random Number Generation

2015-03-24 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-6522: - Summary: Standardize Random Number Generation Key: SPARK-6522 URL: https://issues.apache.org/jira/browse/SPARK-6522 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2015-03-11 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357367#comment-14357367 ] RJ Nowling commented on SPARK-2429: --- Hi [~yuu.ishik...@gmail.com] I think the new

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2015-03-11 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357389#comment-14357389 ] RJ Nowling commented on SPARK-2429: --- [~josephkb] I think it would be great to get the

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2015-03-11 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357410#comment-14357410 ] RJ Nowling commented on SPARK-2429: --- I'm familiar with the community interest but I'm

[jira] [Created] (SPARK-6167) Previous Commit Broke BroadcastTest

2015-03-04 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-6167: - Summary: Previous Commit Broke BroadcastTest Key: SPARK-6167 URL: https://issues.apache.org/jira/browse/SPARK-6167 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6167) Previous Commit Broke BroadcastTest

2015-03-04 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347607#comment-14347607 ] RJ Nowling commented on SPARK-6167: --- This PR fixes the issue in master and the 1.3

[jira] [Commented] (SPARK-6167) Previous Commit Broke BroadcastTest

2015-03-04 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14347773#comment-14347773 ] RJ Nowling commented on SPARK-6167: --- Great! Thanks! Previous Commit Broke

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2015-03-02 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343829#comment-14343829 ] RJ Nowling commented on SPARK-2308: --- Ok, we should mark the status of the JIRA as won't

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2015-03-02 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343804#comment-14343804 ] RJ Nowling commented on SPARK-2308: --- [~derrickburns] and [~mengxr] Is work still being

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2015-03-02 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14343806#comment-14343806 ] RJ Nowling commented on SPARK-2429: --- [~yuu.ishik...@gmail.com] are you still working on

[jira] [Commented] (SPARK-2430) Standarized Clustering Algorithm API and Framework

2015-03-01 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342382#comment-14342382 ] RJ Nowling commented on SPARK-2430: --- I think we can close this JIRA. It's been

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-21 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14285980#comment-14285980 ] RJ Nowling commented on SPARK-4894: --- [~mengxr] Since [~lmcguire] has submitted the

[jira] [Commented] (SPARK-5328) Update PySpark MLlib NaiveBayes API to take model type parameter for Bernoulli fit

2015-01-20 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283887#comment-14283887 ] RJ Nowling commented on SPARK-5328: --- The Python API for Naive Bayes is located in

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-15 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279250#comment-14279250 ] RJ Nowling commented on SPARK-4894: --- Thanks, [~josephkb]! I'd be happy to help with the

[jira] [Commented] (SPARK-5272) Refactor NaiveBayes to support discrete and continuous labels,features

2015-01-15 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14279258#comment-14279258 ] RJ Nowling commented on SPARK-5272: --- Hi [~josephkb], I can see benefits to your

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277631#comment-14277631 ] RJ Nowling commented on SPARK-4894: --- Thanks [~lmcguire]! I'll wait until next week in

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1420#comment-1420 ] RJ Nowling commented on SPARK-4894: --- Hi [~josephkb], lots to think about! In general,

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278228#comment-14278228 ] RJ Nowling commented on SPARK-4894: --- [~josephkb], after some thought, I've come around

[jira] [Comment Edited] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278228#comment-14278228 ] RJ Nowling edited comment on SPARK-4894 at 1/15/15 4:21 AM:

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-13 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276380#comment-14276380 ] RJ Nowling commented on SPARK-4894: --- Hi @lmcguire, Always happy to have more help! :)

[jira] [Comment Edited] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-13 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276380#comment-14276380 ] RJ Nowling edited comment on SPARK-4894 at 1/14/15 2:06 AM: Hi

[jira] [Commented] (SPARK-4728) Add exponential, log normal, and gamma distributions to data generator to MLlib

2014-12-18 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252224#comment-14252224 ] RJ Nowling commented on SPARK-4728: --- [~mengxr] can you assign this JIRA to me since I've

[jira] [Created] (SPARK-4891) Add exponential, log normal, and gamma distributions to data generator to PySpark's MLlib

2014-12-18 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-4891: - Summary: Add exponential, log normal, and gamma distributions to data generator to PySpark's MLlib Key: SPARK-4891 URL: https://issues.apache.org/jira/browse/SPARK-4891

[jira] [Commented] (SPARK-4891) Add exponential, log normal, and gamma distributions to data generator to PySpark's MLlib

2014-12-18 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252534#comment-14252534 ] RJ Nowling commented on SPARK-4891: --- [~mengxr] Could you assign this to me? Thanks! :)

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2014-12-18 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252860#comment-14252860 ] RJ Nowling commented on SPARK-4894: --- [~mengxr] Could you assign this to me? Thanks!

[jira] [Commented] (SPARK-4728) Add exponential, log normal, and gamma distributions to data generator to MLlib

2014-12-11 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14242924#comment-14242924 ] RJ Nowling commented on SPARK-4728: --- I posted a PR for this issue:

[jira] [Issue Comment Deleted] (SPARK-4728) Add exponential, log normal, and gamma distributions to data generator to MLlib

2014-12-11 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] RJ Nowling updated SPARK-4728: -- Comment: was deleted (was: I posted a PR for this issue: https://github.com/apache/spark/pull/3680)

[jira] [Commented] (SPARK-4727) Add dimensional RDDs (time series, spatial)

2014-12-04 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14234399#comment-14234399 ] RJ Nowling commented on SPARK-4727: --- Thanks, Jeremy! Your work may cover my needs, and

[jira] [Created] (SPARK-4727) Add dimensional RDDs (time series, spatial)

2014-12-03 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-4727: - Summary: Add dimensional RDDs (time series, spatial) Key: SPARK-4727 URL: https://issues.apache.org/jira/browse/SPARK-4727 Project: Spark Issue Type:

[jira] [Created] (SPARK-4728) Add exponential, log normal, and gamma distributions to data generator to MLlib

2014-12-03 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-4728: - Summary: Add exponential, log normal, and gamma distributions to data generator to MLlib Key: SPARK-4728 URL: https://issues.apache.org/jira/browse/SPARK-4728 Project:

[jira] [Created] (SPARK-4729) Add time series subsampling to MLlib

2014-12-03 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-4729: - Summary: Add time series subsampling to MLlib Key: SPARK-4729 URL: https://issues.apache.org/jira/browse/SPARK-4729 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-11-16 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14213945#comment-14213945 ] RJ Nowling commented on SPARK-2429: --- Hi Yu, I'm having trouble finding the function to

[jira] [Commented] (SPARK-4158) Spark throws exception when Mesos resources are missing

2014-10-31 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14192448#comment-14192448 ] RJ Nowling commented on SPARK-4158: --- I verified that the associated patch fixes this

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-10-29 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14188325#comment-14188325 ] RJ Nowling commented on SPARK-2429: --- The sparsity tests look good. Have you compared

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-10-23 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181183#comment-14181183 ] RJ Nowling commented on SPARK-2429: --- I added a couple comments to the PR. I would say

[jira] [Commented] (SPARK-4040) calling count() on RDD's emitted from a DStream blocks forEachRDD progress.

2014-10-22 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179882#comment-14179882 ] RJ Nowling commented on SPARK-4040: --- I don't think you can access a RDD from with an

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-10-22 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179890#comment-14179890 ] RJ Nowling commented on SPARK-2429: --- A 6x performance improvement is great improvement!

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-10-15 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172486#comment-14172486 ] RJ Nowling commented on SPARK-2429: --- Great to know! I'm glad that isn't a bottleneck.

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-10-09 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14165480#comment-14165480 ] RJ Nowling commented on SPARK-2429: --- Great work, Yu! Ok, first off, let me make sure I

[jira] [Commented] (SPARK-3785) Support off-loading computations to a GPU

2014-10-08 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14163712#comment-14163712 ] RJ Nowling commented on SPARK-3785: --- Part of my graduate work involved implementing

[jira] [Comment Edited] (SPARK-3614) Filter on minimum occurrences of a term in IDF

2014-09-22 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142860#comment-14142860 ] RJ Nowling edited comment on SPARK-3614 at 9/22/14 5:52 PM:

[jira] [Commented] (SPARK-3614) Filter on minimum occurrences of a term in IDF

2014-09-22 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14143898#comment-14143898 ] RJ Nowling commented on SPARK-3614: --- It could lead to over-fitting and thus

[jira] [Commented] (SPARK-3614) Filter on minimum occurrences of a term in IDF

2014-09-21 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142631#comment-14142631 ] RJ Nowling commented on SPARK-3614: --- I would like to work on this. Filter on minimum

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-09-16 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14136671#comment-14136671 ] RJ Nowling commented on SPARK-2429: --- Great! I look forward to seeing your

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2014-09-15 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14134264#comment-14134264 ] RJ Nowling commented on SPARK-2308: --- It is true that we will save on the distance

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2014-09-15 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14134301#comment-14134301 ] RJ Nowling commented on SPARK-2308: --- I'm not a committer but [~mengxr] is. That said,

[jira] [Commented] (SPARK-3250) More Efficient Sampling

2014-09-11 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14130880#comment-14130880 ] RJ Nowling commented on SPARK-3250: --- Great work! If these performance improvements hold

[jira] [Commented] (SPARK-2966) Add an approximation algorithm for hierarchical clustering to MLlib

2014-09-05 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14123077#comment-14123077 ] RJ Nowling commented on SPARK-2966: --- Wonderful! If I can help or when you're ready for

[jira] [Commented] (SPARK-2966) Add an approximation algorithm for hierarchical clustering to MLlib

2014-09-04 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122266#comment-14122266 ] RJ Nowling commented on SPARK-2966: --- No worries. Based on my reading of the Spark

[jira] [Commented] (SPARK-2430) Standarized Clustering Algorithm API and Framework

2014-09-04 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14122273#comment-14122273 ] RJ Nowling commented on SPARK-2430: --- Hi Yu, The community had suggested looking into

[jira] [Created] (SPARK-3384) Potential thread unsafe Breeze vector addition in KMeans

2014-09-03 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-3384: - Summary: Potential thread unsafe Breeze vector addition in KMeans Key: SPARK-3384 URL: https://issues.apache.org/jira/browse/SPARK-3384 Project: Spark Issue Type:

[jira] [Updated] (SPARK-3384) Potential thread unsafe Breeze vector addition in KMeans

2014-09-03 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] RJ Nowling updated SPARK-3384: -- Description: In the KMeans clustering implementation, the Breeze vectors are accumulated using +=.

[jira] [Commented] (SPARK-3384) Potential thread unsafe Breeze vector addition in KMeans

2014-09-03 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14120294#comment-14120294 ] RJ Nowling commented on SPARK-3384: --- Xiangrui Meng I'll try to get a code example

[jira] [Commented] (SPARK-3250) More Efficient Sampling

2014-08-29 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14115213#comment-14115213 ] RJ Nowling commented on SPARK-3250: --- Very clever! Once it's verified to sample

[jira] [Created] (SPARK-3250) More Efficient Sampling

2014-08-27 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-3250: - Summary: More Efficient Sampling Key: SPARK-3250 URL: https://issues.apache.org/jira/browse/SPARK-3250 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-3250) More Efficient Sampling

2014-08-27 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] RJ Nowling updated SPARK-3250: -- Description: Sampling, as currently implemented in Spark, is an O(n) operation. A number of

[jira] [Updated] (SPARK-3250) More Efficient Sampling

2014-08-27 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] RJ Nowling updated SPARK-3250: -- Description: Sampling, as currently implemented in Spark, is an O\(n\) operation. A number of

[jira] [Commented] (SPARK-2429) Hierarchical Implementation of KMeans

2014-08-27 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1411#comment-1411 ] RJ Nowling commented on SPARK-2429: --- Discussion on the dev list mentioned a community

[jira] [Commented] (SPARK-2966) Add an approximation algorithm for hierarchical clustering to MLlib

2014-08-27 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14112223#comment-14112223 ] RJ Nowling commented on SPARK-2966: --- This is a duplicate of SPARK-2429. Please see the

[jira] [Created] (SPARK-3263) PR #720 broke GraphGenerator.logNormal

2014-08-27 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-3263: - Summary: PR #720 broke GraphGenerator.logNormal Key: SPARK-3263 URL: https://issues.apache.org/jira/browse/SPARK-3263 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2014-08-26 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14111224#comment-14111224 ] RJ Nowling commented on SPARK-2308: --- Xiangrui, I realized that sampling in Spark is

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2014-07-30 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079260#comment-14079260 ] RJ Nowling commented on SPARK-2308: --- Thanks for the clarification. :) I'll run the

[jira] [Updated] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2014-07-16 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] RJ Nowling updated SPARK-2308: -- Attachment: uneven_centers.pdf many_small_centers.pdf Add KMeans MiniBatch clustering

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2014-07-16 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14063601#comment-14063601 ] RJ Nowling commented on SPARK-2308: --- I tested kmeans vs minibatch kmeans under 2

[jira] [Created] (SPARK-2429) Hierarchical Implementation of KMeans

2014-07-10 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-2429: - Summary: Hierarchical Implementation of KMeans Key: SPARK-2429 URL: https://issues.apache.org/jira/browse/SPARK-2429 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-2430) Standarized Clustering Algorithm API and Framework

2014-07-10 Thread RJ Nowling (JIRA)
RJ Nowling created SPARK-2430: - Summary: Standarized Clustering Algorithm API and Framework Key: SPARK-2430 URL: https://issues.apache.org/jira/browse/SPARK-2430 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-2308) Add KMeans MiniBatch clustering algorithm to MLlib

2014-07-10 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14057513#comment-14057513 ] RJ Nowling commented on SPARK-2308: --- That sounds like a good idea for a test. I'll