[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277561#comment-14277561 ] Reynold Xin commented on SPARK-5235: I will merge your PR and we can continue having

[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277611#comment-14277611 ] Alex Baretta commented on SPARK-5235: - [~rxin] I see your point. Well, listen, I

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277631#comment-14277631 ] RJ Nowling commented on SPARK-4894: --- Thanks [~lmcguire]! I'll wait until next week in

[jira] [Commented] (SPARK-4746) integration tests should be separated from faster unit tests

2015-01-14 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277646#comment-14277646 ] Imran Rashid commented on SPARK-4746: - submitted a PR:

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277475#comment-14277475 ] Sean Owen commented on SPARK-5235: -- [~alexbaretta] Well at least that explains why tests

[jira] [Commented] (SPARK-5193) Make Spark SQL API usable in Java and remove the Java-specific API

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277656#comment-14277656 ] Apache Spark commented on SPARK-5193: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277555#comment-14277555 ] Alex Baretta commented on SPARK-5235: - [~rxin] Could be. All I'm saying is that your

[jira] [Commented] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277553#comment-14277553 ] Sean Owen commented on SPARK-5235: -- [~alexbaretta] I suppose my point is that no code can

[jira] [Commented] (SPARK-5206) Accumulators are not re-registered during recovering from checkpoint

2015-01-14 Thread vincent ye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277552#comment-14277552 ] vincent ye commented on SPARK-5206: --- Hi Tathagata, Accumulator object is created after

[jira] [Updated] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5234: - Fix Version/s: 1.2.1 examples for ml don't have sparkContext.stop

[jira] [Updated] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5234: - Target Version/s: 1.3.0, 1.2.1 (was: 1.3.0) examples for ml don't have sparkContext.stop

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277702#comment-14277702 ] Joseph K. Bradley commented on SPARK-4894: -- [~rnowling] Thanks for looking into

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277533#comment-14277533 ] Alex Baretta commented on SPARK-5235: - [~sowen] I would much rather the decision of

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277578#comment-14277578 ] Xiangrui Meng commented on SPARK-4894: -- [~rnowling] I've assigned this to you. Let's

[jira] [Commented] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-14 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277643#comment-14277643 ] Manoj Kumar commented on SPARK-3726: [~josephkb] You seem to report issues that I

[jira] [Created] (SPARK-5255) Use python doc note for experimental tags in tree.py

2015-01-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5255: Summary: Use python doc note for experimental tags in tree.py Key: SPARK-5255 URL: https://issues.apache.org/jira/browse/SPARK-5255 Project: Spark

[jira] [Updated] (SPARK-5255) Use python doc note for experimental tags in tree.py

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5255: - Description: spark/python/pyspark/mllib/tree.py currently has several EXPERIMENTAL tags

[jira] [Updated] (SPARK-5228) Hide tables for Active Jobs/Completed Jobs/Failed Jobs when they are empty

2015-01-14 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5228: -- Assignee: Kousuke Saruta Hide tables for Active Jobs/Completed Jobs/Failed Jobs when they are empty

[jira] [Resolved] (SPARK-5228) Hide tables for Active Jobs/Completed Jobs/Failed Jobs when they are empty

2015-01-14 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-5228. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4028

[jira] [Resolved] (SPARK-4014) TaskContext.attemptId returns taskId

2015-01-14 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4014. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3849

[jira] [Updated] (SPARK-4014) TaskContext.attemptId returns taskId

2015-01-14 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4014: -- Target Version/s: 1.0.3, 1.1.2, 1.2.1 Assignee: Josh Rosen Labels:

[jira] [Commented] (SPARK-4746) integration tests should be separated from faster unit tests

2015-01-14 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277651#comment-14277651 ] Imran Rashid commented on SPARK-4746: - btw if anybody else wants to futz around with

[jira] [Updated] (SPARK-3726) RandomForest: Support for bootstrap options

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3726: - Assignee: Manoj Kumar RandomForest: Support for bootstrap options

[jira] [Commented] (SPARK-4585) Spark dynamic executor allocation shouldn't use maxExecutors as initial number

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277716#comment-14277716 ] Apache Spark commented on SPARK-4585: - User 'sryza' has created a pull request for

[jira] [Commented] (SPARK-3717) DecisionTree, RandomForest: Partition by feature

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277485#comment-14277485 ] Joseph K. Bradley commented on SPARK-3717: -- [~bbnsumanth] Please do not

[jira] [Updated] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5235: --- Summary: Determine serializability of SQLContext (was: java.io.NotSerializableException:

[jira] [Resolved] (SPARK-2909) Indexing for SparseVector in pyspark

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-2909. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4025

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277460#comment-14277460 ] Alex Baretta commented on SPARK-5235: - [~rxin] [~sowen] My bad! Indeed the SQLContext

[jira] [Created] (SPARK-5253) LinearRegression with L1/L2 (elastic net) using OWLQN in new ML pacakge

2015-01-14 Thread DB Tsai (JIRA)
DB Tsai created SPARK-5253: -- Summary: LinearRegression with L1/L2 (elastic net) using OWLQN in new ML pacakge Key: SPARK-5253 URL: https://issues.apache.org/jira/browse/SPARK-5253 Project: Spark

[jira] [Updated] (SPARK-5235) Determine serializability of SQLContext

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5235: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-5166 Determine serializability of SQLContext

[jira] [Issue Comment Deleted] (SPARK-5206) Accumulators are not re-registered during recovering from checkpoint

2015-01-14 Thread vincent ye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vincent ye updated SPARK-5206: -- Comment: was deleted (was: Hi Tathagata, Accumulator object is created after the StreamingContext (ssc)

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Leah McGuire (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277590#comment-14277590 ] Leah McGuire commented on SPARK-4894: - Thanks [~rnowling]! I can take a look at it

[jira] [Updated] (SPARK-5234) examples for ml don't have sparkContext.stop

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5234: - Assignee: yuhao yang examples for ml don't have sparkContext.stop

[jira] [Commented] (SPARK-4746) integration tests should be separated from faster unit tests

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277633#comment-14277633 ] Apache Spark commented on SPARK-4746: - User 'squito' has created a pull request for

[jira] [Commented] (SPARK-5199) Input metrics should show up for InputFormats that return CombineFileSplits

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277704#comment-14277704 ] Apache Spark commented on SPARK-5199: - User 'sryza' has created a pull request for

[jira] [Created] (SPARK-5254) Update the user guide to make clear that spark.mllib is not being deprecated

2015-01-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5254: Summary: Update the user guide to make clear that spark.mllib is not being deprecated Key: SPARK-5254 URL: https://issues.apache.org/jira/browse/SPARK-5254 Project:

[jira] [Issue Comment Deleted] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5256: - Comment: was deleted (was: Generalization: grouped optimization) Improving MLlib

[jira] [Commented] (SPARK-5254) Update the user guide to make clear that spark.mllib is not being deprecated

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277768#comment-14277768 ] Apache Spark commented on SPARK-5254: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1420#comment-1420 ] RJ Nowling commented on SPARK-4894: --- Hi [~josephkb], lots to think about! In general,

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277858#comment-14277858 ] Joseph K. Bradley commented on SPARK-5256: -- Generalization: grouped optimization

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277857#comment-14277857 ] Joseph K. Bradley commented on SPARK-5256: -- Improving Updater concept Improving

[jira] [Updated] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort)

2015-01-14 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-3655: -- Priority: Major (was: Minor) Support sorting of values in addition to keys (i.e. secondary sort)

[jira] [Created] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5256: Summary: Improving MLlib optimization APIs Key: SPARK-5256 URL: https://issues.apache.org/jira/browse/SPARK-5256 Project: Spark Issue Type: Umbrella

[jira] [Commented] (SPARK-4906) Spark master OOMs with exception stack trace stored in JobProgressListener

2015-01-14 Thread Mingyu Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277849#comment-14277849 ] Mingyu Kim commented on SPARK-4906: --- typically once a few tasks have failed the stage

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277933#comment-14277933 ] Joseph K. Bradley commented on SPARK-4894: -- +1 for small changes, but

[jira] [Commented] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277932#comment-14277932 ] Xiangrui Meng commented on SPARK-5226: -- [~angellandros] Before you start coding,

[jira] [Updated] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5226: - Target Version/s: (was: 1.2.0) Add DBSCAN Clustering Algorithm to MLlib

[jira] [Updated] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5226: - Affects Version/s: (was: 1.2.0) Add DBSCAN Clustering Algorithm to MLlib

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277986#comment-14277986 ] Alexander Ulanov commented on SPARK-5256: - I would like to improve Gradient

[jira] [Commented] (SPARK-5256) Improving MLlib optimization APIs

2015-01-14 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277988#comment-14277988 ] Alexander Ulanov commented on SPARK-5256: - Also, asynchronous gradient update

[jira] [Updated] (SPARK-5260) Expose JsonRDD.allKeysWithValueTypes() in a utility class

2015-01-14 Thread Corey J. Nolet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Corey J. Nolet updated SPARK-5260: -- Description: I have found this method extremely useful when implementing my own strategy for

[jira] [Created] (SPARK-5258) Clean up exposed classes in sql.hive package

2015-01-14 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5258: -- Summary: Clean up exposed classes in sql.hive package Key: SPARK-5258 URL: https://issues.apache.org/jira/browse/SPARK-5258 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-5259) Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry

2015-01-14 Thread SuYan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SuYan updated SPARK-5259: - Description: 1. while shuffle stage was retry, there may have 2 taskSet running. we call the 2

[jira] [Commented] (SPARK-5193) Make Spark SQL API usable in Java and remove the Java-specific API

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278249#comment-14278249 ] Apache Spark commented on SPARK-5193: - User 'rxin' has created a pull request for this

[jira] [Created] (SPARK-5259) Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry

2015-01-14 Thread SuYan (JIRA)
SuYan created SPARK-5259: Summary: Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry Key: SPARK-5259 URL: https://issues.apache.org/jira/browse/SPARK-5259

[jira] [Resolved] (SPARK-5254) Update the user guide to make clear that spark.mllib is not being deprecated

2015-01-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5254. -- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Issue resolved by pull

[jira] [Commented] (SPARK-5226) Add DBSCAN Clustering Algorithm to MLlib

2015-01-14 Thread Muhammad-Ali A'rabi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278305#comment-14278305 ] Muhammad-Ali A'rabi commented on SPARK-5226: Yeah, of course. It will take me

[jira] [Created] (SPARK-5257) SparseVector indices must be non-negative

2015-01-14 Thread Derrick Burns (JIRA)
Derrick Burns created SPARK-5257: Summary: SparseVector indices must be non-negative Key: SPARK-5257 URL: https://issues.apache.org/jira/browse/SPARK-5257 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5259) Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278222#comment-14278222 ] Apache Spark commented on SPARK-5259: - User 'suyanNone' has created a pull request for

[jira] [Updated] (SPARK-5259) Add task equal() and hashcode() to avoid stage.pendingTasks not accurate while stage was retry

2015-01-14 Thread SuYan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SuYan updated SPARK-5259: - Description: 1. while shuffle stage was retry, there may have 2 taskSet running. we call the 2

[jira] [Commented] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278228#comment-14278228 ] RJ Nowling commented on SPARK-4894: --- [~josephkb], after some thought, I've come around

[jira] [Comment Edited] (SPARK-4894) Add Bernoulli-variant of Naive Bayes

2015-01-14 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278228#comment-14278228 ] RJ Nowling edited comment on SPARK-4894 at 1/15/15 4:21 AM:

[jira] [Commented] (SPARK-5193) Make Spark SQL API usable in Java and remove the Java-specific API

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278163#comment-14278163 ] Apache Spark commented on SPARK-5193: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5254) Update the user guide to make clear that spark.mllib is not being deprecated

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278160#comment-14278160 ] Apache Spark commented on SPARK-5254: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-14 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276943#comment-14276943 ] Travis Galoppo commented on SPARK-5019: --- I have a patch prepared for this; it is

[jira] [Commented] (SPARK-5220) keepPushingBlocks in BlockGenerator terminated when an exception occurs, which causes the block pushing thread to terminate and blocks receiver

2015-01-14 Thread Max Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276974#comment-14276974 ] Max Xu commented on SPARK-5220: --- I believe with https://github.com/apache/spark/pull/3655,

[jira] [Commented] (SPARK-5251) Using `tableIdentifier` in hive metastore

2015-01-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276980#comment-14276980 ] Apache Spark commented on SPARK-5251: - User 'scwf' has created a pull request for this

[jira] [Created] (SPARK-5250) EOFException in when reading gzipped files from S3 with wholeTextFiles

2015-01-14 Thread Mojmir Vinkler (JIRA)
Mojmir Vinkler created SPARK-5250: - Summary: EOFException in when reading gzipped files from S3 with wholeTextFiles Key: SPARK-5250 URL: https://issues.apache.org/jira/browse/SPARK-5250 Project:

[jira] [Comment Edited] (SPARK-5220) keepPushingBlocks in BlockGenerator terminated when an exception occurs, which causes the block pushing thread to terminate and blocks receiver

2015-01-14 Thread Max Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276974#comment-14276974 ] Max Xu edited comment on SPARK-5220 at 1/14/15 2:49 PM: I believe

[jira] [Commented] (SPARK-5250) EOFException in when reading gzipped files from S3 with wholeTextFiles

2015-01-14 Thread Mojmir Vinkler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277051#comment-14277051 ] Mojmir Vinkler commented on SPARK-5250: --- Just tested with Scala and got the same

[jira] [Created] (SPARK-5251) Using `tableIdentifier` in hive metastore

2015-01-14 Thread wangfei (JIRA)
wangfei created SPARK-5251: -- Summary: Using `tableIdentifier` in hive metastore Key: SPARK-5251 URL: https://issues.apache.org/jira/browse/SPARK-5251 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5147) write ahead logs from streaming receiver are not purged because cleanupOldBlocks in WriteAheadLogBasedBlockHandler is never called

2015-01-14 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-5147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276988#comment-14276988 ] François Garillot commented on SPARK-5147: -- 1. Yes, you're right I had forgotten

[jira] [Updated] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient and fail on SparseVectors with large size

2015-01-14 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Derrick Burns updated SPARK-5186: - Summary: Vector.equals and Vector.hashCode are very inefficient and fail on SparseVectors with

[jira] [Commented] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-14 Thread Pedro Rodriguez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278381#comment-14278381 ] Pedro Rodriguez commented on SPARK-1405: Worked on some preliminary testing

[jira] [Commented] (SPARK-5186) Vector.equals and Vector.hashCode are very inefficient

2015-01-14 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278374#comment-14278374 ] Derrick Burns commented on SPARK-5186: -- The aforementioned pull request does fix part

[jira] [Commented] (SPARK-5147) write ahead logs from streaming receiver are not purged because cleanupOldBlocks in WriteAheadLogBasedBlockHandler is never called

2015-01-14 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278161#comment-14278161 ] Saisai Shao commented on SPARK-5147: 1. Currently detecting whether to delete the WAL

[jira] [Created] (SPARK-5261) In some cases ,The value of word's vector representation is too big

2015-01-14 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-5261: -- Summary: In some cases ,The value of word's vector representation is too big Key: SPARK-5261 URL: https://issues.apache.org/jira/browse/SPARK-5261 Project: Spark

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277114#comment-14277114 ] Alex Baretta commented on SPARK-5235: - [~sowen] My SQL queries are failing due to

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-14 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277104#comment-14277104 ] Nicholas Chammas commented on SPARK-3821: - Hmm, I doubt that was intentional since

[jira] [Updated] (SPARK-5179) Spark UI history job duration is wrong

2015-01-14 Thread Olivier Toupin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olivier Toupin updated SPARK-5179: -- Target Version/s: (was: 1.2.1) Spark UI history job duration is wrong

[jira] [Updated] (SPARK-5179) Spark UI history job duration is wrong

2015-01-14 Thread Olivier Toupin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Olivier Toupin updated SPARK-5179: -- Fix Version/s: 1.2.1 Spark UI history job duration is wrong

[jira] [Commented] (SPARK-3821) Develop an automated way of creating Spark images (AMI, Docker, and others)

2015-01-14 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277334#comment-14277334 ] Shivaram Venkataraman commented on SPARK-3821: -- Regarding the pre-built

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277217#comment-14277217 ] Sean Owen commented on SPARK-5235: -- [~alexbaretta] It certainly may not be your code of

[jira] [Created] (SPARK-5252) Streaming StatefulNetworkWordCount example hangs

2015-01-14 Thread Lutz Buech (JIRA)
Lutz Buech created SPARK-5252: - Summary: Streaming StatefulNetworkWordCount example hangs Key: SPARK-5252 URL: https://issues.apache.org/jira/browse/SPARK-5252 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277240#comment-14277240 ] Alex Baretta commented on SPARK-5235: - [~sowen] I agree with you that contexts have no

[jira] [Updated] (SPARK-5252) Streaming StatefulNetworkWordCount example hangs

2015-01-14 Thread Lutz Buech (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lutz Buech updated SPARK-5252: -- Attachment: debug.txt log at DEBUG level Streaming StatefulNetworkWordCount example hangs

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277258#comment-14277258 ] Alex Baretta commented on SPARK-5235: - Yes, there is a need for a hotfix. [~rxin]

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277270#comment-14277270 ] Reynold Xin commented on SPARK-5235: I can merge your PR soon, but [~alexbaretta] can

[jira] [Commented] (SPARK-5211) Restore HiveMetastoreTypes.toDataType

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277290#comment-14277290 ] Reynold Xin commented on SPARK-5211: BTW who are the developers using it? Restore

[jira] [Commented] (SPARK-5211) Restore HiveMetastoreTypes.toDataType

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277292#comment-14277292 ] Reynold Xin commented on SPARK-5211: I'm under the impression that everything in the

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277251#comment-14277251 ] Sean Owen commented on SPARK-5235: -- @Alex Baretta what version worked? If you're saying a

[jira] [Updated] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5235: --- Description: The SQLConf field in SQLContext is neither Serializable nor transient. Here's the stack

[jira] [Updated] (SPARK-5245) Move Decimal from types.decimal to types package

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5245: --- Summary: Move Decimal from types.decimal to types package (was: Move Decimal and Date into

[jira] [Resolved] (SPARK-5248) moving Decimal from types.decimal to types package

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5248. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Adrian Wang Fixed in

[jira] [Updated] (SPARK-5245) Move Decimal from types.decimal to types package

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5245: --- Assignee: Adrian Wang Move Decimal from types.decimal to types package

[jira] [Resolved] (SPARK-5245) Move Decimal from types.decimal to types package

2015-01-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5245. Resolution: Fixed Fix Version/s: 1.3.0 Fixed in https://github.com/apache/spark/pull/4041

[jira] [Commented] (SPARK-5235) java.io.NotSerializableException: org.apache.spark.sql.SQLConf

2015-01-14 Thread Alex Baretta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277414#comment-14277414 ] Alex Baretta commented on SPARK-5235: - [~rxin] I'm sorry to say it's not that easy,

[jira] [Created] (SPARK-5244) add parser for COALESCE()

2015-01-14 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-5244: -- Summary: add parser for COALESCE() Key: SPARK-5244 URL: https://issues.apache.org/jira/browse/SPARK-5244 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-3276) Provide a API to specify whether the old files need to be ignored in file input text DStream

2015-01-14 Thread Jem Tucker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276641#comment-14276641 ] Jem Tucker commented on SPARK-3276: --- This can be achieved using

[jira] [Created] (SPARK-5245) Move Decimal and Date into o.a.s.sql.types

2015-01-14 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5245: -- Summary: Move Decimal and Date into o.a.s.sql.types Key: SPARK-5245 URL: https://issues.apache.org/jira/browse/SPARK-5245 Project: Spark Issue Type: Improvement

  1   2   >