[jira] [Resolved] (SPARK-6053) Support model save/load in Python's ALS.

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6053. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4811

[jira] [Resolved] (SPARK-6083) Make Python API example consistent in NaiveBayes

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6083. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4834

[jira] [Commented] (SPARK-6077) Multiple spark streaming tabs on UI when reuse the same sparkcontext

2015-03-01 Thread zhichao-li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342595#comment-14342595 ] zhichao-li commented on SPARK-6077: --- Yeah. It would fix the SPARK-2463 as well. Almost

[jira] [Updated] (SPARK-6085) Increase default value for memory overhead

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6085: - Component/s: Spark Core Priority: Minor (was: Major) Increase default value for memory overhead

[jira] [Updated] (SPARK-6089) Size of task result fetched can't be found in UI

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6089: - Priority: Minor (was: Major) Affects Version/s: (was: 1.3.0)

[jira] [Commented] (SPARK-6082) SparkSQL should fail gracefully when input data format doesn't match expectations

2015-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342094#comment-14342094 ] Apache Spark commented on SPARK-6082: - User 'liancheng' has created a pull request for

[jira] [Resolved] (SPARK-1122) Collect the RDD and send to each partition to form a new RDD

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1122. -- Resolution: Won't Fix You can also accomplish this with {{mapPartitions}} and simply convert the

[jira] [Resolved] (SPARK-3023) SIGINT to driver with yarn-client should release containers on the cluster

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3023. -- Resolution: Cannot Reproduce ... or perhaps fixed along the way. I tried spark-shell in yarn-client

[jira] [Commented] (SPARK-2430) Standarized Clustering Algorithm API and Framework

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342178#comment-14342178 ] Sean Owen commented on SPARK-2430: -- There are a number of outstanding JIRAs about

[jira] [Updated] (SPARK-4800) RDD Preview Feature in WebUI

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4800: - Component/s: (was: Spark Core) Priority: Minor (was: Major) Preview meaning a few items from

[jira] [Resolved] (SPARK-5628) Add option to return spark-ec2 version

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5628. -- Resolution: Fixed Fix Version/s: 1.2.2 Target Version/s: (was: 1.2.2) Add option to

[jira] [Updated] (SPARK-5628) Add option to return spark-ec2 version

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5628: - Labels: (was: backport-needed) Add option to return spark-ec2 version

[jira] [Comment Edited] (SPARK-5499) iterative computing with 1000 iterations causes stage failure

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342115#comment-14342115 ] Sean Owen edited comment on SPARK-5499 at 3/1/15 10:12 AM: --- To

[jira] [Resolved] (SPARK-5499) iterative computing with 1000 iterations causes stage failure

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5499. -- Resolution: Not a Problem To narrow this down, I tried: {code:scala}

[jira] [Commented] (SPARK-3621) Provide a way to broadcast an RDD (instead of just a variable made of the RDD) so that a job can access

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342119#comment-14342119 ] Sean Owen commented on SPARK-3621: -- I'd like to resolve what the use case is here: Is

[jira] [Commented] (SPARK-5390) Encourage users to post on Stack Overflow in Community Docs

2015-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342134#comment-14342134 ] Apache Spark commented on SPARK-5390: - User 'srowen' has created a pull request for

[jira] [Resolved] (SPARK-3558) Throw exception for concurrently-running SparkContexts / StreamingContexts in the same JVM

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3558. -- Resolution: Duplicate Target Version/s: (was: 1.2.0) Pardon, this looks like exactly the

[jira] [Resolved] (SPARK-3678) Yarn app name reported in RM is different between cluster and client mode

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3678. -- Resolution: Duplicate Target Version/s: (was: 1.2.0) Resolving in favor of the later one

[jira] [Resolved] (SPARK-2372) Grouped Optimization/Learning

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2372. -- Resolution: Won't Fix Sounds like a WontFix given the PR discussion Grouped Optimization/Learning

[jira] [Commented] (SPARK-1593) Add status command to Spark Daemons(master/worker)

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342158#comment-14342158 ] Sean Owen commented on SPARK-1593: -- [~pradeepbaji] if you want to take this forward, can

[jira] [Resolved] (SPARK-2946) Allow specifying * for --num-executors in YARN

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2946. -- Resolution: Duplicate Looks like a subset of the larger request to be able to ask for all of

[jira] [Commented] (SPARK-6018) NoSuchMethodError in Spark app is swallowed by YARN AM

2015-03-01 Thread Tarek Abouzeid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342184#comment-14342184 ] Tarek Abouzeid commented on SPARK-6018: --- the problem was that i was using spark

[jira] [Commented] (SPARK-6042) spark-submit giving Exception in thread main java.lang.NoSuchMethodError: org.apache.spark.sql.hive.HiveContext.sql(Ljava/lang/String;)Lorg/apache/spark/sql/SchemaRDD

2015-03-01 Thread Tarek Abouzeid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342188#comment-14342188 ] Tarek Abouzeid commented on SPARK-6042: --- you were right , thanks so much , the

[jira] [Commented] (SPARK-1346) Backport SPARK-1210 into 0.9 branch

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342100#comment-14342100 ] Sean Owen commented on SPARK-1346: -- Q: is anything being backported to 0.9 at this point?

[jira] [Resolved] (SPARK-4392) Event proration based on event timestamps

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4392. -- Resolution: Won't Fix WontFix, per PR Event proration based on event timestamps

[jira] [Resolved] (SPARK-6074) Assembly doesn't include pyspark sql files

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6074. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4822

[jira] [Updated] (SPARK-6074) Assembly doesn't include pyspark sql files

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6074: - Component/s: (was: YARN) (was: Spark Core) SQL

[jira] [Updated] (SPARK-6074) Assembly doesn't include pyspark sql files

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6074: - Affects Version/s: (was: 1.2.1) (These pyspark/sql files didn't exist in 1.2) Assembly doesn't

[jira] [Resolved] (SPARK-1054) Get Cassandra support in Spark Core/Spark Cassandra Module

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1054. -- Resolution: Won't Fix Like HBase, there are already examples of using Cassandra from Spark via

[jira] [Commented] (SPARK-1881) Executor caching

2015-03-01 Thread nigel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342181#comment-14342181 ] nigel commented on SPARK-1881: -- Hello; This was a query for Mesos, not Yarn. I haven't used

[jira] [Commented] (SPARK-6014) java.io.IOException: Filesystem is thrown when ctrl+c or ctrl+d spark-sql on YARN

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342077#comment-14342077 ] Sean Owen commented on SPARK-6014: -- Although a fix is possible for Hadoop 2.2+, it is not

[jira] [Resolved] (SPARK-5376) [Mesos] MesosExecutor should have correct resources

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5376. -- Resolution: Not a Problem Per request from reporter. [Mesos] MesosExecutor should have correct

[jira] [Commented] (SPARK-5390) Encourage users to post on Stack Overflow in Community Docs

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342132#comment-14342132 ] Sean Owen commented on SPARK-5390: -- I'd like to remove the Community section from

[jira] [Commented] (SPARK-1881) Executor caching

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342163#comment-14342163 ] Sean Owen commented on SPARK-1881: -- Are you asking for something like {{spark.yarn.jar}}

[jira] [Updated] (SPARK-4011) tighten the visibility of the members in Master/Worker class

2015-03-01 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nan Zhu updated SPARK-4011: --- Description: Currently, most of the members in Master/Worker are with public accessibility we might wish to

[jira] [Commented] (SPARK-4011) tighten the visibility of the members in Master/Worker class

2015-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342239#comment-14342239 ] Apache Spark commented on SPARK-4011: - User 'CodingCat' has created a pull request for

[jira] [Updated] (SPARK-4011) tighten the visibility of the members in Master/Worker class

2015-03-01 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nan Zhu updated SPARK-4011: --- Summary: tighten the visibility of the members in Master/Worker class (was: tighten the visibility of the

[jira] [Comment Edited] (SPARK-2620) case class cannot be used as key for reduce

2015-03-01 Thread Marko Bonaci (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14341895#comment-14341895 ] Marko Bonaci edited comment on SPARK-2620 at 3/1/15 1:21 PM: -

[jira] [Commented] (SPARK-4011) tighten the visibility of the members in Master/Worker class

2015-03-01 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342240#comment-14342240 ] Nan Zhu commented on SPARK-4011: [~srowen] I just submitted the patch, pinged you in

[jira] [Updated] (SPARK-6083) Make Python API example consistent in NaiveBayes

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6083: - Assignee: Manoj Kumar Make Python API example consistent in NaiveBayes

[jira] [Updated] (SPARK-6083) Make Python API example consistent in NaiveBayes

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6083: - Target Version/s: 1.3.0 Make Python API example consistent in NaiveBayes

[jira] [Commented] (SPARK-6103) remove unused class to import in EdgeRDDImpl

2015-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342651#comment-14342651 ] Apache Spark commented on SPARK-6103: - User 'lianhuiwang' has created a pull request

[jira] [Commented] (SPARK-6077) Multiple spark streaming tabs on UI when reuse the same sparkcontext

2015-03-01 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342704#comment-14342704 ] Nicholas Chammas commented on SPARK-6077: - Please disregard the comments on

[jira] [Commented] (SPARK-2463) Creating then stopping StreamingContext multiple times from shell generates duplicate Streaming tabs in UI

2015-03-01 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342714#comment-14342714 ] Nicholas Chammas commented on SPARK-2463: - For people reading through these

[jira] [Updated] (SPARK-2463) Creating then stopping StreamingContext multiple times from shell generates duplicate Streaming tabs in UI

2015-03-01 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-2463: Description: Start a {{StreamingContext}} from the interactive shell and then stop it. Go

[jira] [Created] (SPARK-6105) enhance spark-ganglia to support redundant gmond addresses setting in ganglia unicast mode

2015-03-01 Thread Fuqing Yang (JIRA)
Fuqing Yang created SPARK-6105: -- Summary: enhance spark-ganglia to support redundant gmond addresses setting in ganglia unicast mode Key: SPARK-6105 URL: https://issues.apache.org/jira/browse/SPARK-6105

[jira] [Updated] (SPARK-6104) spark sql shuffle OOM

2015-03-01 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6104: - Affects Version/s: 1.2.0 spark sql shuffle OOM - Key: SPARK-6104

[jira] [Updated] (SPARK-6104) spark sql shuffle OOM

2015-03-01 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6104: - Description: Currently, spark shuffle can use ExternalAppendOnlyMap to combine data that have fetched,

[jira] [Updated] (SPARK-6104) spark sql shuffle OOM

2015-03-01 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6104: - Description: Currently, spark shuffle can use ExternalAppendOnlyMap to combine data that have fetched,

[jira] [Updated] (SPARK-6104) spark SQL shuffle OOM

2015-03-01 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6104: - Summary: spark SQL shuffle OOM (was: spark sql shuffle OOM) spark SQL shuffle OOM

[jira] [Commented] (SPARK-6104) spark SQL shuffle OOM

2015-03-01 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342863#comment-14342863 ] Hong Shen commented on SPARK-6104: -- [~rxin] [~sandyryza], can you pay attention to this

[jira] [Updated] (SPARK-6104) spark SQL shuffle OOM

2015-03-01 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6104: - Description: Currently, spark shuffle can use ExternalAppendOnlyMap to combine data that have fetched,

[jira] [Commented] (SPARK-6104) spark SQL shuffle OOM

2015-03-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342869#comment-14342869 ] Reynold Xin commented on SPARK-6104: Yes, large aggregation with auto spilling to disk

[jira] [Closed] (SPARK-6104) spark SQL shuffle OOM

2015-03-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-6104. -- Resolution: Duplicate spark SQL shuffle OOM - Key: SPARK-6104

[jira] [Commented] (SPARK-6104) spark SQL shuffle OOM

2015-03-01 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342872#comment-14342872 ] Hong Shen commented on SPARK-6104: -- Can you link to the Duplicate issue? spark SQL

[jira] [Comment Edited] (SPARK-6104) spark SQL shuffle OOM

2015-03-01 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342872#comment-14342872 ] Hong Shen edited comment on SPARK-6104 at 3/2/15 7:25 AM: -- Can

[jira] [Updated] (SPARK-2873) Support disk spilling in Spark SQL aggregation / join

2015-03-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-2873: --- Summary: Support disk spilling in Spark SQL aggregation / join (was: OOM happens when group by and

[jira] [Created] (SPARK-6106) Support user group mapping and groups in view, modify and admin acls

2015-03-01 Thread Jerry Chen (JIRA)
Jerry Chen created SPARK-6106: - Summary: Support user group mapping and groups in view, modify and admin acls Key: SPARK-6106 URL: https://issues.apache.org/jira/browse/SPARK-6106 Project: Spark

[jira] [Created] (SPARK-6107) event log file ends with .inprogress should be able to display on webUI for standalone mode

2015-03-01 Thread Zhang, Liye (JIRA)
Zhang, Liye created SPARK-6107: -- Summary: event log file ends with .inprogress should be able to display on webUI for standalone mode Key: SPARK-6107 URL: https://issues.apache.org/jira/browse/SPARK-6107

[jira] [Created] (SPARK-6104) spark sql shuffle OOM

2015-03-01 Thread Hong Shen (JIRA)
Hong Shen created SPARK-6104: Summary: spark sql shuffle OOM Key: SPARK-6104 URL: https://issues.apache.org/jira/browse/SPARK-6104 Project: Spark Issue Type: Improvement Reporter:

[jira] [Commented] (SPARK-6067) Spark sql hive dynamic partitions job will fail if task fails

2015-03-01 Thread baishuo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342642#comment-14342642 ] baishuo commented on SPARK-6067: Hi Jason, can I get more information about the problem?

[jira] [Updated] (SPARK-2463) Creating then stopping StreamingContext multiple times from shell generates duplicate Streaming tabs in UI

2015-03-01 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-2463: Description: Start a {{StreamingContext}} from the interactive shell and then stop it. Go

[jira] [Created] (SPARK-6103) remove unused class to import in EdgeRDDImpl

2015-03-01 Thread Lianhui Wang (JIRA)
Lianhui Wang created SPARK-6103: --- Summary: remove unused class to import in EdgeRDDImpl Key: SPARK-6103 URL: https://issues.apache.org/jira/browse/SPARK-6103 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-6108) No application number limit in spark history server

2015-03-01 Thread Xia Hu (JIRA)
Xia Hu created SPARK-6108: - Summary: No application number limit in spark history server Key: SPARK-6108 URL: https://issues.apache.org/jira/browse/SPARK-6108 Project: Spark Issue Type: Wish

[jira] [Created] (SPARK-6092) Add RankingMetrics in PySpark/MLlib

2015-03-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6092: Summary: Add RankingMetrics in PySpark/MLlib Key: SPARK-6092 URL: https://issues.apache.org/jira/browse/SPARK-6092 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-6091) Add MulticlassMetrics in PySpark/MLlib

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6091: - Summary: Add MulticlassMetrics in PySpark/MLlib (was: Add MulticlassMetrics to PySpark/MLlib)

[jira] [Updated] (SPARK-5694) Python API for evaluation metrics

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5694: - Target Version/s: (was: 1.4.0) Python API for evaluation metrics

[jira] [Created] (SPARK-6091) Add MulticlassMetrics to PySpark/MLlib

2015-03-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6091: Summary: Add MulticlassMetrics to PySpark/MLlib Key: SPARK-6091 URL: https://issues.apache.org/jira/browse/SPARK-6091 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-6090) Add BinaryClassificationMetrics in PySpark/MLlib

2015-03-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6090: - Summary: Add BinaryClassificationMetrics in PySpark/MLlib (was: Add

[jira] [Created] (SPARK-6094) Add MultilabelMetrics in PySpark/MLlib

2015-03-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6094: Summary: Add MultilabelMetrics in PySpark/MLlib Key: SPARK-6094 URL: https://issues.apache.org/jira/browse/SPARK-6094 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-6093) Add RegressionMetrics in PySpark/MLlib

2015-03-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6093: Summary: Add RegressionMetrics in PySpark/MLlib Key: SPARK-6093 URL: https://issues.apache.org/jira/browse/SPARK-6093 Project: Spark Issue Type: Sub-task

[jira] [Issue Comment Deleted] (SPARK-5991) Python API for ML model import/export

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5991: - Comment: was deleted (was: User 'mengxr' has created a pull request for this issue:

[jira] [Commented] (SPARK-6085) Increase default value for memory overhead

2015-03-01 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342398#comment-14342398 ] Ted Yu commented on SPARK-6085: --- In my opinion, priority for this JIRA should be Major.

[jira] [Updated] (SPARK-5991) Python API for ML model import/export

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5991: - Assignee: Joseph K. Bradley Python API for ML model import/export

[jira] [Commented] (SPARK-6053) Support model save/load in Python's ALS.

2015-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342396#comment-14342396 ] Apache Spark commented on SPARK-6053: - User 'mengxr' has created a pull request for

[jira] [Created] (SPARK-6096) Support model save/load in Python's naive Bayes

2015-03-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6096: Summary: Support model save/load in Python's naive Bayes Key: SPARK-6096 URL: https://issues.apache.org/jira/browse/SPARK-6096 Project: Spark Issue Type:

[jira] [Created] (SPARK-6095) Support model save/load in Python's linear models

2015-03-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6095: Summary: Support model save/load in Python's linear models Key: SPARK-6095 URL: https://issues.apache.org/jira/browse/SPARK-6095 Project: Spark Issue Type:

[jira] [Created] (SPARK-6097) Support model save/load in Python's tree models

2015-03-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6097: Summary: Support model save/load in Python's tree models Key: SPARK-6097 URL: https://issues.apache.org/jira/browse/SPARK-6097 Project: Spark Issue Type:

[jira] [Created] (SPARK-6098) Propagate Experimental tag to child classes

2015-03-01 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6098: Summary: Propagate Experimental tag to child classes Key: SPARK-6098 URL: https://issues.apache.org/jira/browse/SPARK-6098 Project: Spark Issue

[jira] [Created] (SPARK-6099) Stabilize mllib ClassificationModel, RegressionModel APIs

2015-03-01 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6099: Summary: Stabilize mllib ClassificationModel, RegressionModel APIs Key: SPARK-6099 URL: https://issues.apache.org/jira/browse/SPARK-6099 Project: Spark

[jira] [Updated] (SPARK-6099) Stabilize mllib ClassificationModel, RegressionModel APIs

2015-03-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6099: - Target Version/s: 1.4.0 (was: 1.3.0) Stabilize mllib ClassificationModel,

[jira] [Updated] (SPARK-4587) Model export/import

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4587: - Issue Type: Umbrella (was: New Feature) Model export/import ---

[jira] [Created] (SPARK-6100) Distributed linear algebra in PySpark/MLlib

2015-03-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6100: Summary: Distributed linear algebra in PySpark/MLlib Key: SPARK-6100 URL: https://issues.apache.org/jira/browse/SPARK-6100 Project: Spark Issue Type:

[jira] [Updated] (SPARK-6100) Distributed linear algebra in PySpark/MLlib

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6100: - Description: This is an umbrella JIRA for the Python API of distributed linear algebra in MLlib.

[jira] [Commented] (SPARK-4705) Driver retries in cluster mode always fail if event logging is enabled

2015-03-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342279#comment-14342279 ] Apache Spark commented on SPARK-4705: - User 'twinkle-sachdeva' has created a pull

[jira] [Comment Edited] (SPARK-5564) Support sparse LDA solutions

2015-03-01 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342311#comment-14342311 ] Debasish Das edited comment on SPARK-5564 at 3/1/15 4:41 PM: -

[jira] [Comment Edited] (SPARK-5564) Support sparse LDA solutions

2015-03-01 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342311#comment-14342311 ] Debasish Das edited comment on SPARK-5564 at 3/1/15 4:51 PM: -

[jira] [Commented] (SPARK-5564) Support sparse LDA solutions

2015-03-01 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342311#comment-14342311 ] Debasish Das commented on SPARK-5564: - I am right now using the following PR to do

[jira] [Comment Edited] (SPARK-5564) Support sparse LDA solutions

2015-03-01 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342311#comment-14342311 ] Debasish Das edited comment on SPARK-5564 at 3/1/15 4:20 PM: -

[jira] [Comment Edited] (SPARK-5564) Support sparse LDA solutions

2015-03-01 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342311#comment-14342311 ] Debasish Das edited comment on SPARK-5564 at 3/1/15 4:19 PM: -

[jira] [Commented] (SPARK-5564) Support sparse LDA solutions

2015-03-01 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342312#comment-14342312 ] Debasish Das commented on SPARK-5564: - By the way the following step is an

[jira] [Commented] (SPARK-2430) Standarized Clustering Algorithm API and Framework

2015-03-01 Thread RJ Nowling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342382#comment-14342382 ] RJ Nowling commented on SPARK-2430: --- I think we can close this JIRA. It's been

[jira] [Updated] (SPARK-3357) Internal log messages should be set at DEBUG level instead of INFO

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3357: - Assignee: Sean Owen Internal log messages should be set at DEBUG level instead of INFO

[jira] [Commented] (SPARK-5960) Allow AWS credentials to be passed to KinesisUtils.createStream()

2015-03-01 Thread Chris Fregly (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342359#comment-14342359 ] Chris Fregly commented on SPARK-5960: - linking to an old jira where this was

[jira] [Updated] (SPARK-5694) Python API for evaluation metrics

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5694: - Assignee: Xiangrui Meng Issue Type: Umbrella (was: New Feature) Python API for evaluation

[jira] [Updated] (SPARK-5694) Python API for evaluation metrics

2015-03-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5694: - Description: This is an umbrella JIRA for evaluation metrics in Python. They should be defined

[jira] [Created] (SPARK-6090) Add BinaryClassfiicationMetrics in PySpark/MLlib

2015-03-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6090: Summary: Add BinaryClassfiicationMetrics in PySpark/MLlib Key: SPARK-6090 URL: https://issues.apache.org/jira/browse/SPARK-6090 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-2430) Standarized Clustering Algorithm API and Framework

2015-03-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2430. -- Resolution: Won't Fix OK. Also wondering about things like

[jira] [Created] (SPARK-6101) Create a SparkSQL DataSource API implementation for DynamoDB

2015-03-01 Thread Chris Fregly (JIRA)
Chris Fregly created SPARK-6101: --- Summary: Create a SparkSQL DataSource API implementation for DynamoDB Key: SPARK-6101 URL: https://issues.apache.org/jira/browse/SPARK-6101 Project: Spark

[jira] [Created] (SPARK-6102) Create a SparkSQL DataSource API implementation for Redshift

2015-03-01 Thread Chris Fregly (JIRA)
Chris Fregly created SPARK-6102: --- Summary: Create a SparkSQL DataSource API implementation for Redshift Key: SPARK-6102 URL: https://issues.apache.org/jira/browse/SPARK-6102 Project: Spark

  1   2   >